See "BFMLALT (indexed)" in the exploration tools

BFMLALT (indexed): BFloat16 multiply-add by indexed element to single-precision (top)

BFMLALT Zda.S, Zn.H, Zm.H[imm] (SVE+BF16 (SME+BF16

svfloat32_t svbfmlalt_lane[_f32](svfloat32_t op1, svbfloat16_t op2, svbfloat16_t op3, uint64_t imm_index)

128-bit SVE

Multiply each odd BFloat16 from (2), with the BFloat16 specified by imm from (1), then add this to the 32-bit float from (3), and set (4) to the result.

256-bit SVE

Multiply each odd BFloat16 from (2), with the BFloat16 specified by imm from (1), then add this to the 32-bit float from (3), and set (4) to the result.

512-bit SVE

Multiply each odd BFloat16 from (2), with the BFloat16 specified by imm from (1), then add this to the 32-bit float from (3), and set (4) to the result.

Larger sizes

1024-bit SVE

Multiply each odd BFloat16 from (2), with the BFloat16 specified by imm from (1), then add this to the 32-bit float from (3), and set (4) to the result.

2048-bit SVE

Multiply each odd BFloat16 from (2), with the BFloat16 specified by imm from (1), then add this to the 32-bit float from (3), and set (4) to the result.

Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.