SVE Instruction List by Dougall Johnson
FDOT (vectors): Half-precision floating-point dot product
FDOT Zda.S, Zn.H, Zm.H (SVE2.1 (SME2
128-bit SVE
For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. See
the documentation for the exact order of operations.
256-bit SVE
For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. See
the documentation for the exact order of operations.
512-bit SVE
For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. See
the documentation for the exact order of operations.
Larger sizes
1024-bit SVE
For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. See
the documentation for the exact order of operations.
2048-bit SVE
For each pair of 16-bit floats from (1) and (2), compute the dot-product, then add the result to the corresponding 32-bit float accumulator from (3), setting (4) to the total. See
the documentation for the exact order of operations.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.