SVE Instruction List by Dougall Johnson
# LD3H (scalar plus scalar): Contiguous load three-halfword structures to three vectors (scalar index)

LD3H { Zt1.H, Zt2.H, Zt3.H }, Pg/Z, [Xn, Xm, LSL #1] (SVE (SME

svbfloat16x3_t svld3[_bf16](svbool_t pg, const bfloat16_t *base)

svfloat16x3_t svld3[_f16](svbool_t pg, const float16_t *base)

svint16x3_t svld3[_s16](svbool_t pg, const int16_t *base)

svuint16x3_t svld3[_u16](svbool_t pg, const uint16_t *base)

## 128-bit SVE

Load and deinterleave groups of three interleaved 16-bit values from the memory operand (1) into the 16-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.

## 256-bit SVE

Load and deinterleave groups of three interleaved 16-bit values from the memory operand (1) into the 16-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.

## 512-bit SVE

Load and deinterleave groups of three interleaved 16-bit values from the memory operand (1) into the 16-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.

## Larger sizes

## 1024-bit SVE

Load and deinterleave groups of three interleaved 16-bit values from the memory operand (1) into the 16-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.

## 2048-bit SVE

Load and deinterleave groups of three interleaved 16-bit values from the memory operand (1) into the 16-bit elements of three consecutive registers (2), (3), and (4). If the predicate bit corresponding to an element in (2), (3), and (4) is zero, those three contiguous loads are skipped, and cannot cause a fault, and the elements are set to zero.

Report mistakes or give feedback

Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.