I'm working on optimizing a function which uses floating point instructions.
For bench marking I need to know execution latency of the instructions to know theoretical possible performance.
I have found such manual for A57: http://infocenter.arm.com/help/topic/com.arm.doc.uan0015b/Cortex_A57_Software_Optimization_Guide_external.pdf
But, I don't find any such doc for A53. Am I missing something? Is there any such optimization guide available for A53?