After some operations I have an SSE register where each of the 16 bytes has the lowest bit set if some condition was fulfilled or it is 0 if it wasn't. I'd now would like to extract this into a bitmask where for each of these 16 bytes a bit is set iff the byte had value one.
I searched the Intel intrinsics guide up and down but couldn't really find how to do this. Pseudocode
void _mm_???(__m128i a)
FOR j := 0 to 15
i := j*8
IF a[i]
a[j] := 1
else
a[j] := 0
FI
ENDFOR