Let's assume i have defined __m256d x
and that I want to extract the lower 128-bits. I would do:
__m128d xlow = _mm256_castpd256_pd128(x);
However, I recently saw someone do:
__m128d xlow = (__m128d) x
Is there a prefered method to use for the cast? Why use the first method?