I have been found a lib but there was not
void atan2fv_neon_hfp(float *y, float *x,float *res,int len)
to calculate len floats once.
How can I write a neon version for atan2fv_neon_hfp
?
I have been found a lib but there was not
void atan2fv_neon_hfp(float *y, float *x,float *res,int len)
to calculate len floats once.
How can I write a neon version for atan2fv_neon_hfp
?
Glancing over the implementation of atan2f_neon_hfp it doesn't look like there's anything that wouldn't be possible to trivially vectorize (by loading pairs into d16 and d17, instead of vdup'ing into them), but you'll need to know ARM/NEON assembly. Alternatively, you could just call the scalar version in a loop.