Apple Microarchitecture Research by Dougall Johnson

M1/A14 P-core (Firestorm): Overview | Base Instructions | SIMD and FP Instructions
M1/A14 E-core (Icestorm):  Overview | Base Instructions | SIMD and FP Instructions

MRS (SSBS)

Test 1: uops

Code:

  mrs x0, s3_3_c4_c2_6

(no loop instructions)

1000 unrolls and 1 iteration

Retires: 1.000

Issues: 1.000

Integer unit issues: 1.000

Load/store unit issues: 0.000

SIMD/FP unit issues: 0.000

retire uop (01)cycle (02)031e3f51schedule uop (52)schedule int uop (53)6d6emap stall dispatch (70)map rewind (75)map stall (76)8283flush restart other nonspec (84)85inst all (8c)l1d cache writeback (a8)accfd5map dispatch bubble (d6)ddfetch restart (de)e0? int output thing (e9)? int retires (ef)f5f6f7f8fd
100410348010192610001000103410348653882103416411100100732261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100103731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410347010192610001000103410348653882103416411100110731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100110731261110311000100010351035103510351035

Test 2: throughput

Count: 8

Code:

  mrs x0, s3_3_c4_c2_6
  mrs x1, s3_3_c4_c2_6
  mrs x2, s3_3_c4_c2_6
  mrs x3, s3_3_c4_c2_6
  mrs x4, s3_3_c4_c2_6
  mrs x5, s3_3_c4_c2_6
  mrs x6, s3_3_c4_c2_6
  mrs x7, s3_3_c4_c2_6

(fused SUBS/B.cc loop)

100 unrolls and 100 iterations

Result (median cycles for code divided by count): 1.0004

retire uop (01)cycle (02)03mmu table walk data (08)09l2 tlb miss data (0b)18191e1f3f51schedule uop (52)schedule int uop (53)dispatch int uop (56)int uops in schedulers (59)60696a6b6d6emap stall dispatch (70)map rewind (75)map stall (76)dispatch uop (78)map int uop (7c)map int uop inputs (7f)8283flush restart other nonspec (84)85inst all (8c)inst branch (8d)inst branch taken (90)inst branch cond (94)inst int alu (97)9fl1d tlb access (a0)l1d tlb miss (a1)ld unit uop (a6)st unit uop (a7)l1d cache writeback (a8)a9acc2cfd5map dispatch bubble (d6)dbddfetch restart (de)e0? int output thing (e9)ea? int retires (ef)f5f6f7f8fd
802048003962100000120800202680100801001005000497695508003580035699663699841002002008003516411802011009910010010000001000511022502280032800000801008003680036800368003680036
8020480035620000000080020788010080100100500049769550800358003569966369984100200200800351641180201100991001001000000001980511022502280077800000801008003680036800368003680036
8020480035621000001208002026801008010010050004977840080741809147028087706601252042028096116417180201100991001001002000001204305471227302381223803800801008140681410811878148881135
8020481444631100019176800203938035080351124638049777450810968113970356997084512520220081051164251802011009910010010000004036080511022502280032800000801008003680036800368003680036
802048003562000000120800202680175801001005000497695508003580035699663699841002002008003516411802011009910010010000001000515922502280032800000801008003680036800368003680036
80204800356200000000800202680100801001005000497695508003580035699663699841002002008003516411802011009910010010000000000511022502280032800000801008003680036800368003680036
80204800356200000000800202680100801001005001497695508003580035699663699841002002008003516411802011009910010010000000000511022502280032800000801008003680036800368003680036
80204800356200000000800202680100801001005000497695508003580035699663699841002002008003516411802011009910010010000001000511022502280032800000801008003680036800718003680036
80204800356200000000800202680100801001005000497695508003580035699663699841002002008003516411802011009910010010000000000511022502280032800000801008003680036800368003680036
80204800356200000000800202680173801001165000497713008003580035699663699841002002008016516441802011009910010010000200414252514825702280165800650801008016980169801678021380036

1000 unrolls and 10 iterations

Result (median cycles for code divided by count): 1.0004

retire uop (01)cycle (02)03l2 tlb miss instruction (0a)1e3f51schedule uop (52)schedule int uop (53)dispatch int uop (56)int uops in schedulers (59)60696a6b6d6emap stall dispatch (70)map rewind (75)map stall (76)dispatch uop (78)map int uop (7c)map int uop inputs (7f)8283flush restart other nonspec (84)85inst all (8c)inst branch (8d)inst branch taken (90)inst branch cond (94)inst int alu (97)9fl1d cache writeback (a8)accdcfd5map dispatch bubble (d6)ddfetch restart (de)e0? int output thing (e9)? int retires (ef)f5f6f7f8fd
8002480040621012800202680010800411050049769550800358003569988370006102020800351641180021109101010100502315251168003280000800108003680036800438003680036
80024800356200980020268001080010105004976955080035800356998837000610202080035164118002110910101000050205255138003280000800108003680036800368003680036
8002480035620008002026800108001010500497695508003580035699883700061020208003516411800211091010100305020625878003280000800108003680036800368003680036
80024800356210080020268001080010105004976955080035800356998837000610202080035164118002110910101000050201025668003280000800108003680036800368003680036
800248003562001280020268001080010105004976955080035800356998837000610202080035164118002110910101003050201025878003280000800108003680036800368003680036
80024800356210080020528001080010105014976955080035800356998837000610202080035164118002110910101000050201125668003280000800108003680036800368003680036
80024800356200080020268001080010105014976955080035800356998837000610202080035164118002110910101000050206256128003280000800108003680036800368003680036
8002480035620038002026800108001010501497695508003580035699883700061020208003516411800211091010100005020825778003280000800108003680036800368003680036
8002480035620008002026800108001010500497695508003580035699883700061020208003516411800211091010100005020725778003280000800108003680036800368003680036
8002480035620008002026800108001010500497695508003580035699883700061020208003516421800211091010100005020625658003280000800108003680036800368003680036