SVE Instruction List by Dougall Johnson
LD1W (scalar plus immediate, consecutive registers): Contiguous load of words to multiple consecutive vectors (immediate index)
LD1W { Zt1.S, Zt2.S }, PNg/Z, [Xn{, #imm, MUL VL}] (SVE2.1 (SME2+S
128-bit SVE
data:image/s3,"s3://crabby-images/324e2/324e28aba6a6c6e8f9abb58cc33e9d695945a7b6" alt=""
Load 32-bit values from the memory operand (1) into the 32-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
256-bit SVE
data:image/s3,"s3://crabby-images/7b5f3/7b5f3637aa2a34d6b180a295abb4dbf486d5f5c6" alt=""
Load 32-bit values from the memory operand (1) into the 32-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
512-bit SVE
data:image/s3,"s3://crabby-images/2c4e4/2c4e47349aff0a22e80404d90d6f56131f529be5" alt=""
Load 32-bit values from the memory operand (1) into the 32-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/25e55/25e55f540b1fbf962fb1ee3cd6620a9db732e222" alt=""
Load 32-bit values from the memory operand (1) into the 32-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
2048-bit SVE
data:image/s3,"s3://crabby-images/858e5/858e53f87a218302f44925cb79aae82a472feda7" alt=""
Load 32-bit values from the memory operand (1) into the 32-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.