Is there any single instruction or function that can invert the sign of every float inside a __m128?
i.e. a = r0:r1:r2:r3 ===> a = -r0:-r1:-r2:-r3
?
I know this can be done by _mm_sub_ps(_mm_set1_ps(0.0),a)
, but isn't it potentially slow since _mm_set1_ps(0.0)
is a multi-instruction function?