SVE Instruction List by Dougall Johnson
UDOT (4-way, indexed): Unsigned integer indexed dot product
UDOT Zda.D, Zn.H, Zm.H[imm] (SVE (SME
svuint64_t svdot_lane[_u64](svuint64_t op1, svuint16_t op2, svuint16_t op3, uint64_t imm_index)
128-bit SVE
For each group of four unsigned 16-bit integers from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit integer accumulator from (3), setting (4) to the total. Within each 128-bit segment, the group of four values used from (1) is specified by imm
.
256-bit SVE
For each group of four unsigned 16-bit integers from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit integer accumulator from (3), setting (4) to the total. Within each 128-bit segment, the group of four values used from (1) is specified by imm
.
512-bit SVE
For each group of four unsigned 16-bit integers from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit integer accumulator from (3), setting (4) to the total. Within each 128-bit segment, the group of four values used from (1) is specified by imm
.
Larger sizes
1024-bit SVE
For each group of four unsigned 16-bit integers from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit integer accumulator from (3), setting (4) to the total. Within each 128-bit segment, the group of four values used from (1) is specified by imm
.
2048-bit SVE
For each group of four unsigned 16-bit integers from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit integer accumulator from (3), setting (4) to the total. Within each 128-bit segment, the group of four values used from (1) is specified by imm
.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.