Commit Graph

2 Commits

Author SHA1 Message Date
Feoramund
12dd0cb72a Simplify and make simd_util cross-platform
This new algorithm uses a Scalar->Vector->Scalar iteration loop which
requires no masking off of any incomplete data chunks.

Also, the width was reduced to 32 bytes instead of 64, as I found this
to be about as fast as the previous 64-byte x86 version.
2024-08-09 18:54:04 -04:00
Feoramund
28c98c2e7a Add tests for vectorized index_* procs 2024-08-06 15:19:05 -04:00