How can I mask 8 floats in my __m256
variable via bits in my unsigned char
variable? (their values are not known during compilation)
__m256 flts = _mm256_set1_ps(5.0f);
unsigned char = 0b10010111;//just for example, but can be any value during runtime
Desired output would have flts
contain 5, 0, 0, 5, 0, 5, 5, 5
Is there an efficient instruction on Intel Intrinsics Guide?
Processor only supports instructions up to AVX (but not AVX2 or beyond)