Running benchmarks...
  Threads: 3
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.23928
  GOPS:         145.245
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.2133
  GOPS:         296.712
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.21288
  GOPS:         593.628
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 1.52908
  GOPS:         188.349
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 4.6463
  GOPS:         30.9924
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 4.64632
  GOPS:         15.4961
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 4.64576
  GOPS:         1983.74
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 1.16908
  GOPS:         1471.51
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 1.85972
  GOPS:         1982.23
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 3.09866
  GOPS:         1982.79
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 3.09867
  GOPS:         991.392
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 3.09859
  GOPS:         1858.91
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 3.09875
  GOPS:         1982.74
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 1.19605
  GOPS:         1284.23
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 1.93824
  GOPS:         1584.95
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 3.48643
  GOPS:         1762.26
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 5.26856
  GOPS:         1865.86
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 5.11346
  GOPS:         1922.46
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 3.71849
  GOPS:         1982.74
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 3.71865
  GOPS:         1982.66
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 3.09878
  GOPS:         495.68
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 3.71668
  GOPS:         3967.41
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 3.71832
  GOPS:         1982.83
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 3.09865
  GOPS:         495.7
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 3.71847
  GOPS:         247.844
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 4.02857
  GOPS:         610.043