Running benchmarks...
  Threads: 2
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.22089
  GOPS:         98.2891
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.18768
  GOPS:         202.075
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.19762
  GOPS:         400.795
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 1.5124
  GOPS:         126.951
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 3.09814
  GOPS:         30.9864
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 3.09839
  GOPS:         15.4919
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 3.09713
  GOPS:         1983.77
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 1.16395
  GOPS:         985.332
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 1.25468
  GOPS:         1958.75
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 2.06593
  GOPS:         1982.64
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 2.06589
  GOPS:         991.338
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 2.06591
  GOPS:         1858.74
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 2.06587
  GOPS:         1982.7
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 1.19773
  GOPS:         854.953
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 1.29268
  GOPS:         1584.31
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 2.32475
  GOPS:         1761.91
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 3.51261
  GOPS:         1865.73
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 3.40894
  GOPS:         1922.47
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 2.47913
  GOPS:         1982.63
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 2.479
  GOPS:         1982.73
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 2.06591
  GOPS:         495.664
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 2.47933
  GOPS:         3964.94
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 2.47899
  GOPS:         1982.75
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 2.06613
  GOPS:         495.613
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 2.47892
  GOPS:         247.85
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 2.68551
  GOPS:         610.089