SVE Instruction List by Dougall Johnson
LD1D (vector plus immediate): Gather load doublewords to vector (immediate index)
LD1D { Zt.D }, Pg/Z, [Zn.D, #imm] (SVE+NS
svfloat64_t svld1_gather[_u64base]_offset_f64(svbool_t pg, svuint64_t bases, int64_t offset)
svint64_t svld1_gather[_u64base]_offset_s64(svbool_t pg, svuint64_t bases, int64_t offset)
svuint64_t svld1_gather[_u64base]_offset_u64(svbool_t pg, svuint64_t bases, int64_t offset)
128-bit SVE
Gather (load) 64-bit values into (3), from the corresponding 64-bit address from (2), plus an immediate. If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The immediate offset is limited to 0 ≤ imm < 256, and must be divisible by eight.
256-bit SVE
Gather (load) 64-bit values into (3), from the corresponding 64-bit address from (2), plus an immediate. If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The immediate offset is limited to 0 ≤ imm < 256, and must be divisible by eight.
512-bit SVE
Gather (load) 64-bit values into (3), from the corresponding 64-bit address from (2), plus an immediate. If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The immediate offset is limited to 0 ≤ imm < 256, and must be divisible by eight.
Larger sizes
1024-bit SVE
Gather (load) 64-bit values into (3), from the corresponding 64-bit address from (2), plus an immediate. If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The immediate offset is limited to 0 ≤ imm < 256, and must be divisible by eight.
2048-bit SVE
Gather (load) 64-bit values into (3), from the corresponding 64-bit address from (2), plus an immediate. If the predicate bit from (1) corresponding to an element in (3) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero. The immediate offset is limited to 0 ≤ imm < 256, and must be divisible by eight.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.