SVE Instruction List by Dougall Johnson
LD1B (scalar plus scalar, consecutive registers): Contiguous load of bytes to multiple consecutive vectors (scalar index)
LD1B { Zt1.B, Zt2.B }, PNg/Z, [Xn, Xm] (SVE2.1 (SME2+S
128-bit SVE
data:image/s3,"s3://crabby-images/b5d60/b5d60d729293186958d0676c7fe82f3caacbedab" alt=""
Load 8-bit values from the memory operand (1) into the 8-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
256-bit SVE
data:image/s3,"s3://crabby-images/fa00c/fa00ca563fc46a9bb0e6e73cbfc96c488c4e2b70" alt=""
Load 8-bit values from the memory operand (1) into the 8-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
512-bit SVE
data:image/s3,"s3://crabby-images/490f2/490f22d99546e1ad4373ff5522426868b5b04db8" alt=""
Load 8-bit values from the memory operand (1) into the 8-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/3bbab/3bbab0ee8da98941956b19de9b81fdd1fd9af195" alt=""
Load 8-bit values from the memory operand (1) into the 8-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
2048-bit SVE
data:image/s3,"s3://crabby-images/a2eec/a2eec21bf168ab558cd213149b5569a28dc1fd77" alt=""
Load 8-bit values from the memory operand (1) into the 8-bit elements of two consecutive registers (2) and (3). After decoding the predicate from its predicate-as-counter representation to a double-length predicate, if the predicate bit corresponding to an element is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The first destination register number (2) must be divisible by two.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.