SVE Instruction List by Dougall Johnson
BFCLAMP: BFloat16 floating-point clamp to minimum/maximum number
BFCLAMP Zd.H, Zn.H, Zm.H (SVE2+B16B16 (SME2+B16B16
128-bit SVE
data:image/s3,"s3://crabby-images/2e7e8/2e7e8c20bc760880f9035befe8c74fdda75fb7de" alt=""
For each BFloat16, clamp (1) to be between the lower-bound (2), and the upper-bound (3), and write the result to (4).
(4) = min( max( (2), (1) ), (3) )
256-bit SVE
data:image/s3,"s3://crabby-images/65a2b/65a2ba571ec02b329865a202fabaf8d21433591b" alt=""
For each BFloat16, clamp (1) to be between the lower-bound (2), and the upper-bound (3), and write the result to (4).
(4) = min( max( (2), (1) ), (3) )
512-bit SVE
data:image/s3,"s3://crabby-images/032a3/032a371347a26c7ed79162d7989935941ca15d60" alt=""
For each BFloat16, clamp (1) to be between the lower-bound (2), and the upper-bound (3), and write the result to (4).
(4) = min( max( (2), (1) ), (3) )
Larger sizes
1024-bit SVE
data:image/s3,"s3://crabby-images/59a47/59a47b3f5af17057457e8901fe912aa0922a78e8" alt=""
For each BFloat16, clamp (1) to be between the lower-bound (2), and the upper-bound (3), and write the result to (4).
(4) = min( max( (2), (1) ), (3) )
2048-bit SVE
data:image/s3,"s3://crabby-images/fc31f/fc31fff25ac0a803d8066a1f7802e9112afa0790" alt=""
For each BFloat16, clamp (1) to be between the lower-bound (2), and the upper-bound (3), and write the result to (4).
(4) = min( max( (2), (1) ), (3) )
Report mistakes or give feedback
Inspired by and based on the x86/x64 SIMD Instruction List by Daytime.