SVE Instruction List by Dougall Johnson
LD1ROB (scalar plus immediate): Contiguous load and replicate thirty-two bytes (immediate index)
LD1ROB { Zt.B }, Pg/Z, [Xn, #imm] (SVE+F64MM+NS
svint8_t svld1ro[_s8](svbool_t pg, const int8_t *base)
svuint8_t svld1ro[_u8](svbool_t pg, const uint8_t *base)
128-bit SVE
This operation is undefined for 128-bit SVE.
256-bit SVE
data:image/s3,"s3://crabby-images/f7daa/f7daa067c8d9945b34eab65e24f4773b164b3ec2" alt=""
Load each 8-bit element in the low 256-bit segment of (3) from the memory operand (2), or zero the element if the corresponding predicate bit in (1) is zero, then replicate that 128-bit segment to fill the register, ignoring the predicate. If the predicate bit corresponding to an element in the low 256-bit segment of (3) is zero, that load is skipped, and cannot cause a fault.
512-bit SVE
data:image/s3,"s3://crabby-images/1f2c4/1f2c4b1be73713d8a26eac285cdc8e471a091a33" alt=""
Load each 8-bit element in the low 256-bit segment of (3) from the memory operand (2), or zero the element if the corresponding predicate bit in (1) is zero, then replicate that 128-bit segment to fill the register, ignoring the predicate. If the predicate bit corresponding to an element in the low 256-bit segment of (3) is zero, that load is skipped, and cannot cause a fault.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/694d2/694d284e33b98cc8c5bf9e655984b7732c2f3a81" alt=""
Load each 8-bit element in the low 256-bit segment of (3) from the memory operand (2), or zero the element if the corresponding predicate bit in (1) is zero, then replicate that 128-bit segment to fill the register, ignoring the predicate. If the predicate bit corresponding to an element in the low 256-bit segment of (3) is zero, that load is skipped, and cannot cause a fault.
2048-bit SVE
data:image/s3,"s3://crabby-images/6bb7d/6bb7d8600add1454fd17fc7257b66941e1fb6adc" alt=""
Load each 8-bit element in the low 256-bit segment of (3) from the memory operand (2), or zero the element if the corresponding predicate bit in (1) is zero, then replicate that 128-bit segment to fill the register, ignoring the predicate. If the predicate bit corresponding to an element in the low 256-bit segment of (3) is zero, that load is skipped, and cannot cause a fault.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.