SVE Instruction List by Dougall Johnson
See "FMMLA" in the exploration tools

FMMLA: Floating-point matrix multiply-accumulate

FMMLA Zda.D, Zn.D, Zm.D (SVE+F64MM+NS
svfloat64_t svmmla[_f64](svfloat64_t op1, svfloat64_t op2, svfloat64_t op3)

128-bit SVE

This operation is undefined for 128-bit SVE.

256-bit SVE

Within each 256-bit segment, interpreting the 64-bit floats from (1), (2) and (3) as 2-by-2 matrices, multiply (1) by (2), add the resulting 2-by-2 matrix to (3), and write the result to (4). See the documentation for the exact order of operations.

512-bit SVE

Within each 256-bit segment, interpreting the 64-bit floats from (1), (2) and (3) as 2-by-2 matrices, multiply (1) by (2), add the resulting 2-by-2 matrix to (3), and write the result to (4). See the documentation for the exact order of operations.

Larger sizes

1024-bit SVE

Within each 256-bit segment, interpreting the 64-bit floats from (1), (2) and (3) as 2-by-2 matrices, multiply (1) by (2), add the resulting 2-by-2 matrix to (3), and write the result to (4). See the documentation for the exact order of operations.

2048-bit SVE

Within each 256-bit segment, interpreting the 64-bit floats from (1), (2) and (3) as 2-by-2 matrices, multiply (1) by (2), add the resulting 2-by-2 matrix to (3), and write the result to (4). See the documentation for the exact order of operations.

Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.