SVE Instruction List by Dougall Johnson
FMLALB (vectors): Half-precision floating-point multiply-add long to single-precision (bottom)
FMLALB Zda.S, Zn.H, Zm.H (SVE2 (SME
svfloat32_t svmlalb[_f32](svfloat32_t op1, svfloat16_t op2, svfloat16_t op3)
128-bit SVE
For each even 16-bit float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
256-bit SVE
For each even 16-bit float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
512-bit SVE
For each even 16-bit float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
Larger sizes
1024-bit SVE
For each even 16-bit float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
2048-bit SVE
For each even 16-bit float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.