Apple Microarchitecture Research by Dougall Johnson

M1/A14 P-core (Firestorm): Overview | Base Instructions | SIMD and FP Instructions
M1/A14 E-core (Icestorm):  Overview | Base Instructions | SIMD and FP Instructions

MRS (DIT)

Test 1: uops

Code:

  mrs x0, s3_3_c4_c2_5

(no loop instructions)

1000 unrolls and 1 iteration

Retires: 1.000

Issues: 1.000

Integer unit issues: 1.000

Load/store unit issues: 0.000

SIMD/FP unit issues: 0.000

retire uop (01)cycle (02)031e3f51schedule uop (52)schedule int uop (53)6d6emap stall dispatch (70)map rewind (75)map stall (76)8283flush restart other nonspec (84)85inst all (8c)l1d cache writeback (a8)accfd5map dispatch bubble (d6)ddfetch restart (de)e0? int output thing (e9)? int retires (ef)f5f6f7f8fd
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100110731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
1004103481210192610001000103410348653882103416411100160731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100130731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100100731261110311000100010351035103510351035
100410348010192610001000103410348653882103416411100110731261110311000100010351035103510351035

Test 2: throughput

Count: 8

Code:

  mrs x0, s3_3_c4_c2_5
  mrs x1, s3_3_c4_c2_5
  mrs x2, s3_3_c4_c2_5
  mrs x3, s3_3_c4_c2_5
  mrs x4, s3_3_c4_c2_5
  mrs x5, s3_3_c4_c2_5
  mrs x6, s3_3_c4_c2_5
  mrs x7, s3_3_c4_c2_5

(fused SUBS/B.cc loop)

100 unrolls and 100 iterations

Result (median cycles for code divided by count): 1.0004

retire uop (01)cycle (02)03mmu table walk data (08)l2 tlb miss data (0b)18191e1f3f51schedule uop (52)schedule int uop (53)dispatch int uop (56)int uops in schedulers (59)60696a6b6d6emap stall dispatch (70)map rewind (75)map stall (76)dispatch uop (78)7amap int uop (7c)map int uop inputs (7f)8283flush restart other nonspec (84)85inst all (8c)inst branch (8d)inst branch taken (90)inst branch cond (94)inst int alu (97)9fl1d tlb access (a0)l1d tlb miss (a1)st unit uop (a7)l1d cache writeback (a8)a9acc2branch mispred nonspec (cb)cfd5map dispatch bubble (d6)ddfetch restart (de)e0? int output thing (e9)? int retires (ef)f5f6f7f8fd
8020480062621000090080020268010080100100500149769550800358003569966369984100020020080035164118020110099100100100000000005110325228003280000801008003680036800368003680036
8020480035620000087080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
8020480035620000075080020268010080100100500049769550800358003569966369984124020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
80204800356200000477080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000015110225228003280000801008003680036800368003680036
80204800356200000453080020268010080100123500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
80204800356200000270080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
8020480035620000069080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
8020480035620000069080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
80204800356200000255080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036
80204800356200000483080020268010080100100500049769550800358003569966369984100020020080035164118020110099100100100000000005110225228003280000801008003680036800368003680036

1000 unrolls and 10 iterations

Result (median cycles for code divided by count): 1.0004

retire uop (01)cycle (02)03mmu table walk data (08)09l2 tlb miss instruction (0a)18191e1f3a3f51schedule uop (52)schedule int uop (53)dispatch int uop (56)int uops in schedulers (59)60696a6b6d6emap stall dispatch (70)map rewind (75)map stall (76)dispatch uop (78)map int uop (7c)map int uop inputs (7f)8283flush restart other nonspec (84)85inst all (8c)inst branch (8d)inst branch taken (90)inst branch cond (94)inst int alu (97)9fl1d tlb access (a0)l1d tlb miss (a1)l1d cache writeback (a8)accfd0d5map dispatch bubble (d6)ddfetch restart (de)e0? int output thing (e9)eaeb? int retires (ef)f5f6f7f8fd
8002480050642000002700800202680010800101050149769550800358003569988370006102020800351641180021109101010001125020016251411800328000000800108003680036800368003680036
8002480035621000000008002026800108001010501497695508003580035699883700061020208003516411800211091010102024075020012251310800328000000800108003680036800368003680036
80024800356200000100080020268001080010105014976955080035800806998837000610202080035164118002110910101000305020014251216800328000000800108003680036800368003680036
8002480035620000000008006326800108001010501497695508003580035699883700061020208003516411800211091010100000502001625128800328000000800108008080036800368003680036
80024800356200000000080020268001080010105014976955080035800356998837000610202080035164118002110910101000005020012251111800328000000800108003680036800368003680036
80024800356200000000080020268001080010125014976999080035800356998837000610202080035164118002110910101000005020011251215800328000000800108003680036800368003680036
80024800356200000000080020268001080010105004976955080035800356998837000610202080035164118002110910101000105020014251314800328000000800108003680036800368003680036
80024800356210000000080020268001080010105014976955080035800356998837000610202080035164118002110910101000005020015251414800328000000800108003680036800368003680080
80024800356210000000080020268001080010105014976955080035800356998837000610202080035164118002110910101000005020010251613800328000000800108003680036800368003680036
8002480035621000000008002026800108001010500497695508003580035699883700061020208003516411800211091010100010502001425914800328000000800108007980079801238003680036