SVE Instruction List by Dougall Johnson
ST4W (scalar plus scalar): Contiguous store four-word structures from four vectors (scalar index)
ST4W { Zt1.S, Zt2.S, Zt3.S, Zt4.S }, Pg, [Xn, Xm, LSL #2] (SVE (SME
void svst4[_f32](svbool_t pg, float32_t *base, svfloat32x4_t data)
void svst4[_s32](svbool_t pg, int32_t *base, svint32x4_t data)
void svst4[_u32](svbool_t pg, uint32_t *base, svuint32x4_t data)
128-bit SVE
Interleave 32-bit elements from four consecutive registers (2), (3), (4), and (5), and store them to the memory operand (6). If the predicate bit from (1) corresponding to an element in (2), (3), (4), and (5) is zero, those four contiguous stores are skipped, and cannot cause a fault, and the corresponding values in memory are unchanged.
256-bit SVE
Interleave 32-bit elements from four consecutive registers (2), (3), (4), and (5), and store them to the memory operand (6). If the predicate bit from (1) corresponding to an element in (2), (3), (4), and (5) is zero, those four contiguous stores are skipped, and cannot cause a fault, and the corresponding values in memory are unchanged.
512-bit SVE
Interleave 32-bit elements from four consecutive registers (2), (3), (4), and (5), and store them to the memory operand (6). If the predicate bit from (1) corresponding to an element in (2), (3), (4), and (5) is zero, those four contiguous stores are skipped, and cannot cause a fault, and the corresponding values in memory are unchanged.
Larger sizes
1024-bit SVE
Interleave 32-bit elements from four consecutive registers (2), (3), (4), and (5), and store them to the memory operand (6). If the predicate bit from (1) corresponding to an element in (2), (3), (4), and (5) is zero, those four contiguous stores are skipped, and cannot cause a fault, and the corresponding values in memory are unchanged.
2048-bit SVE
Interleave 32-bit elements from four consecutive registers (2), (3), (4), and (5), and store them to the memory operand (6). If the predicate bit from (1) corresponding to an element in (2), (3), (4), and (5) is zero, those four contiguous stores are skipped, and cannot cause a fault, and the corresponding values in memory are unchanged.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.