SVE Instruction List by Dougall Johnson
LD1B (scalar plus vector): Gather load unsigned bytes to vector (vector index)
LD1B { Zt.S }, Pg/Z, [Xn, Zm.S, SXTW] (SVE+NS
LD1B { Zt.S }, Pg/Z, [Xn, Zm.S, UXTW] (SVE+NS
svint32_t svld1ub_gather_[s32]offset_s32(svbool_t pg, const uint8_t *base, svint32_t offsets)
svuint32_t svld1ub_gather_[s32]offset_u32(svbool_t pg, const uint8_t *base, svint32_t offsets)
svint32_t svld1ub_gather_[u32]offset_s32(svbool_t pg, const uint8_t *base, svuint32_t offsets)
svuint32_t svld1ub_gather_[u32]offset_u32(svbool_t pg, const uint8_t *base, svuint32_t offsets)
128-bit SVE
data:image/s3,"s3://crabby-images/b736e/b736e3d2483f7795c0411fc785facd8c2dd65f9e" alt=""
Gather (load) and zero extend 8-bit values into the 32-bit elements of (3), from a base address (Xn/base), plus each corresponding sign-or-zero-extended 32-bit offset from (2). If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
256-bit SVE
data:image/s3,"s3://crabby-images/9979b/9979b20dbe22888fb45f8a0640eecba8eeab7b34" alt=""
Gather (load) and zero extend 8-bit values into the 32-bit elements of (3), from a base address (Xn/base), plus each corresponding sign-or-zero-extended 32-bit offset from (2). If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
512-bit SVE
data:image/s3,"s3://crabby-images/90295/90295a3a627f7f7552cb8e747d15ab38886385e9" alt=""
Gather (load) and zero extend 8-bit values into the 32-bit elements of (3), from a base address (Xn/base), plus each corresponding sign-or-zero-extended 32-bit offset from (2). If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/9bd05/9bd05113fab720b25f8a731e4c480fa7ec82b5d0" alt=""
Gather (load) and zero extend 8-bit values into the 32-bit elements of (3), from a base address (Xn/base), plus each corresponding sign-or-zero-extended 32-bit offset from (2). If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
2048-bit SVE
data:image/s3,"s3://crabby-images/f6ee8/f6ee8fbebf60c14a714d6e51ed308d28357cdd0d" alt=""
Gather (load) and zero extend 8-bit values into the 32-bit elements of (3), from a base address (Xn/base), plus each corresponding sign-or-zero-extended 32-bit offset from (2). If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.