Running benchmarks...
  Threads: 4
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.08354
  GOPS:         88.5986
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.07723
  GOPS:         178.235
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.0773
  GOPS:         356.447
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.26349
  GOPS:         121.568
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 5.13899
  GOPS:         22.4169
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 5.13884
  GOPS:         11.2087
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 1.14319
  GOPS:         358.297
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 4.57175
  GOPS:         358.375
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 4.57238
  GOPS:         358.325
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 4.57192
  GOPS:         358.362
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 4.57289
  GOPS:         179.143
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 4.57106
  GOPS:         336.027
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 4.57208
  GOPS:         358.349
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 4.11463
  GOPS:         238.913
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 6.85733
  GOPS:         286.712
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 12.3421
  GOPS:         318.598
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 23.3123
  GOPS:         337.347
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 45.2637
  GOPS:         347.489
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 5.48339
  GOPS:         358.552
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 5.48532
  GOPS:         358.426
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 4.57049
  GOPS:         89.6183
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 5.48652
  GOPS:         716.694
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 5.48577
  GOPS:         358.396
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 4.57397
  GOPS:         179.101
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 4.57326
  GOPS:         89.5641
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 4.57263
  GOPS:         179.153