Fprox’s Substack • 269 implied HN points • 25 Jan 26
- Zvabd adds vector integer absolute-value and absolute-difference instructions plus widened-accumulate variants, targeting DSP use and keeping some ops limited to 8/16-bit to reduce hardware cost.
- Zvzip provides vzip, vunzip (even/odd), and vpair instructions to interleave and extract paired elements more directly than emulating with vcompress, and these new ops support optional masking.
- Zvdot4a8i defines 4-element 8-bit dot-product vector ops (vector-vector and vector-scalar) that multiply and accumulate 4×8-bit groups into 32-bit results, paving the way for faster matrix-style computations.