Discussion about this post

User's avatar
dzaima's avatar

NEON actually supports proper 32-bit int multiplies - vmulq_s32; so does AVX2 - _mm256_mullo_epi32. NEON does also have dynamic shifts via vshlq_s32 & co (that might look like a left-shift instr, but it also does right-shifts with a negative shift amount).

Expand full comment
remageFrs's avatar

There's a small mistake in 'Bit-Hacking the IEEE Float Format' step 4:

"Bitcast the resulting bits to a float and subtract 1.0 to shift the number down into the range: [1.0, 2.0)." - range should be "[0.0, 1.0)", the subtraction of 1.0 shifts _from_ [1.0,2.0) _to_ [0.0,1.0).

Expand full comment
4 more comments...

No posts