SVE Instruction List by Dougall Johnson
FDOT (2-way, indexed, FP16 to FP32): Half-precision dot product by indexed element to single-precision
FDOT Zda.S, Zn.H, Zm.H[imm] (SVE2.1 (SME2
svfloat32_t svdot_lane[_f32_f16](svfloat32_t zda, svfloat16_t zn, svfloat16_t zm, uint64_t imm_idx)
128-bit SVE

For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. Within each 128-bit segment, the pair of values used from (1) is specified by
imm. See
the documentation for the exact order of operations.
256-bit SVE

For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. Within each 128-bit segment, the pair of values used from (1) is specified by
imm. See
the documentation for the exact order of operations.
512-bit SVE

For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. Within each 128-bit segment, the pair of values used from (1) is specified by
imm. See
the documentation for the exact order of operations.
Larger sizes
1024-bit SVE

For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. Within each 128-bit segment, the pair of values used from (1) is specified by
imm. See
the documentation for the exact order of operations.
2048-bit SVE

For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. Within each 128-bit segment, the pair of values used from (1) is specified by
imm. See
the documentation for the exact order of operations.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.