SVE Instruction List by Dougall Johnson
LD1B (scalar plus scalar, single register): Contiguous load unsigned bytes to vector (scalar index)
LD1B { Zt.H }, Pg/Z, [Xn, Xm] (SVE (SME
svint16_t svld1ub_s16(svbool_t pg, const uint8_t *base)
svuint16_t svld1ub_u16(svbool_t pg, const uint8_t *base)
128-bit SVE
Load 8-bit values from the memory operand (1) and zero extend them, writing the results to the 16-bit elements of (2). If the predicate bit corresponding to an element in (2) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
256-bit SVE
Load 8-bit values from the memory operand (1) and zero extend them, writing the results to the 16-bit elements of (2). If the predicate bit corresponding to an element in (2) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
512-bit SVE
Load 8-bit values from the memory operand (1) and zero extend them, writing the results to the 16-bit elements of (2). If the predicate bit corresponding to an element in (2) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
Larger sizes
1024-bit SVE
Load 8-bit values from the memory operand (1) and zero extend them, writing the results to the 16-bit elements of (2). If the predicate bit corresponding to an element in (2) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
2048-bit SVE
Load 8-bit values from the memory operand (1) and zero extend them, writing the results to the 16-bit elements of (2). If the predicate bit corresponding to an element in (2) is zero, that load is skipped, and cannot cause a fault, and the element is set to zero.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.