SVE Instruction List by Dougall Johnson
ST1W (scalar plus immediate, consecutive registers): Contiguous store of words from multiple consecutive vectors (immediate index)
ST1W { Zt1.S, Zt2.S, Zt3.S, Zt4.S }, PNg, [Xn{, #imm, MUL VL}] (SVE2.1 (SME2+S
128-bit SVE
data:image/s3,"s3://crabby-images/0f8e6/0f8e65a807299ca2738e13a922769cc36f2eb888" alt=""
Store 32-bit values from the four consecutive registers (2), (3), (4), and (5) to the memory operand (6). After decoding the predicate (1) from its predicate-as-counter representation to a quadruple-length predicate, if the predicate bit corresponding to an element is zero, that store is skipped, and cannot cause a fault, and the corresponding value in memory is unchanged. The first register number (2) must be divisible by four.
256-bit SVE
data:image/s3,"s3://crabby-images/21c30/21c3024f69c28989bdeae2812ee0a300e8f01213" alt=""
Store 32-bit values from the four consecutive registers (2), (3), (4), and (5) to the memory operand (6). After decoding the predicate (1) from its predicate-as-counter representation to a quadruple-length predicate, if the predicate bit corresponding to an element is zero, that store is skipped, and cannot cause a fault, and the corresponding value in memory is unchanged. The first register number (2) must be divisible by four.
512-bit SVE
data:image/s3,"s3://crabby-images/12fb2/12fb224eeb3b6e035924b2635e0aee64960c96aa" alt=""
Store 32-bit values from the four consecutive registers (2), (3), (4), and (5) to the memory operand (6). After decoding the predicate (1) from its predicate-as-counter representation to a quadruple-length predicate, if the predicate bit corresponding to an element is zero, that store is skipped, and cannot cause a fault, and the corresponding value in memory is unchanged. The first register number (2) must be divisible by four.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/02052/02052c0e3a6741265c3b46349090767e7efdf40a" alt=""
Store 32-bit values from the four consecutive registers (2), (3), (4), and (5) to the memory operand (6). After decoding the predicate (1) from its predicate-as-counter representation to a quadruple-length predicate, if the predicate bit corresponding to an element is zero, that store is skipped, and cannot cause a fault, and the corresponding value in memory is unchanged. The first register number (2) must be divisible by four.
2048-bit SVE
data:image/s3,"s3://crabby-images/96df8/96df875a5906010bdcc2a935bb5e428da837e288" alt=""
Store 32-bit values from the four consecutive registers (2), (3), (4), and (5) to the memory operand (6). After decoding the predicate (1) from its predicate-as-counter representation to a quadruple-length predicate, if the predicate bit corresponding to an element is zero, that store is skipped, and cannot cause a fault, and the corresponding value in memory is unchanged. The first register number (2) must be divisible by four.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.