SVE Instruction List by Dougall Johnson
LD3D (scalar plus immediate): Contiguous load three-doubleword structures to three vectors (immediate index)
LD3D { Zt1.D, Zt2.D, Zt3.D }, Pg/Z, [Xn{, #imm, MUL VL}] (SVE (SME
svfloat64x3_t svld3_vnum[_f64](svbool_t pg, const float64_t *base, int64_t vnum)
svint64x3_t svld3_vnum[_s64](svbool_t pg, const int64_t *base, int64_t vnum)
svuint64x3_t svld3_vnum[_u64](svbool_t pg, const uint64_t *base, int64_t vnum)
128-bit SVE
Load and deinterleave groups of three interleaved 64-bit values from the memory operand (1) into the 64-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.
256-bit SVE
Load and deinterleave groups of three interleaved 64-bit values from the memory operand (1) into the 64-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.
512-bit SVE
Load and deinterleave groups of three interleaved 64-bit values from the memory operand (1) into the 64-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.
Larger sizes
1024-bit SVE
Load and deinterleave groups of three interleaved 64-bit values from the memory operand (1) into the 64-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.
2048-bit SVE
Load and deinterleave groups of three interleaved 64-bit values from the memory operand (1) into the 64-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.