Running benchmarks...
  Threads: 1
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.125
  GOPS:         53.3332
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.12252
  GOPS:         106.902
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.15416
  GOPS:         207.943
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 1.48089
  GOPS:         64.826
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 1.52802
  GOPS:         31.4132
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 1.52811
  GOPS:         15.7057
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 1.53135
  GOPS:         2006.07
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 1.14117
  GOPS:         502.503
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 1.2227
  GOPS:         1004.99
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 1.01983
  GOPS:         2008.19
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 1.01882
  GOPS:         1005.08
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 1.01884
  GOPS:         1884.49
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 1.01871
  GOPS:         2010.39
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 1.17964
  GOPS:         434.032
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 1.2112
  GOPS:         845.441
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 1.27496
  GOPS:         1606.32
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 1.73278
  GOPS:         1891.07
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 1.68299
  GOPS:         1947.01
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 1.22256
  GOPS:         2010.21
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 1.22219
  GOPS:         2010.81
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 1.01885
  GOPS:         502.525
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 1.22265
  GOPS:         4020.14
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 1.22249
  GOPS:         2010.33
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 1.02024
  GOPS:         501.844
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 1.22298
  GOPS:         251.19
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 1.32524
  GOPS:         618.151