Running benchmarks...
  Threads: 10
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.87125
  GOPS:         320.642
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.88566
  GOPS:         636.381
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.87755
  GOPS:         1278.26
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 2.3594
  GOPS:         406.883
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 8.9904
  GOPS:         53.3903
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 8.99671
  GOPS:         26.6764
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 13.1391
  GOPS:         2338.06
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 2.74377
  GOPS:         2089.97
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 5.50479
  GOPS:         2232.24
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 8.77161
  GOPS:         2334.8
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 10.6605
  GOPS:         960.56
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 10.8684
  GOPS:         1766.59
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 10.8573
  GOPS:         1886.28
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 4.13304
  GOPS:         1238.8
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 7.02505
  GOPS:         1457.64
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 12.3968
  GOPS:         1652.04
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 18.7359
  GOPS:         1748.94
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 18.4225
  GOPS:         1778.69
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 11.9837
  GOPS:         2050.79
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 12.4546
  GOPS:         1973.25
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 12.052
  GOPS:         424.824
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 11.8746
  GOPS:         4139.24
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 12.2951
  GOPS:         1998.85
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 10.5841
  GOPS:         483.746
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 12.6645
  GOPS:         242.568
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 12.9843
  GOPS:         630.916