SVE Instruction List by Dougall Johnson
BFMLALT (vectors): BFloat16 floating-point multiply-add long to single-precision (top)
BFMLALT Zda.S, Zn.H, Zm.H (SVE+BF16 (SME+BF16
svfloat32_t svbfmlalt[_f32](svfloat32_t op1, svbfloat16_t op2, svbfloat16_t op3)
128-bit SVE
data:image/s3,"s3://crabby-images/5c421/5c421706012f1e3aedcdc9f05dd0b56c8c0ec126" alt=""
For each odd BFloat16 float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
256-bit SVE
data:image/s3,"s3://crabby-images/2c41c/2c41cf625b3e9195f33dde2d0df4e28291564c6a" alt=""
For each odd BFloat16 float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
512-bit SVE
data:image/s3,"s3://crabby-images/c1c06/c1c0650c57166aab6154b188c06ca68ebd938fbd" alt=""
For each odd BFloat16 float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/5c67b/5c67b896687f58a60e550ccb4e7144e37c3fd832" alt=""
For each odd BFloat16 float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
2048-bit SVE
data:image/s3,"s3://crabby-images/ff073/ff07387c767f2049e5ea875af621b221b46bf287" alt=""
For each odd BFloat16 float calculate (1) * (2), and add that to the 32-bit float from (3), then set (4) to the result.
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.