Running benchmarks...
  Threads: 8
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.4984
  GOPS:         128.137
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.47843
  GOPS:         259.735
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.48216
  GOPS:         518.162
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.7223
  GOPS:         178.366
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 10.2792
  GOPS:         22.4141
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 10.2801
  GOPS:         11.2061
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 2.28629
  GOPS:         358.31
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 9.14371
  GOPS:         358.366
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 9.14624
  GOPS:         358.267
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 9.14562
  GOPS:         358.292
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 9.14308
  GOPS:         179.196
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 9.14477
  GOPS:         335.93
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 9.14364
  GOPS:         358.369
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 8.23116
  GOPS:         238.858
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 13.7159
  GOPS:         286.686
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 24.6901
  GOPS:         318.522
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 46.6365
  GOPS:         337.26
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 90.5466
  GOPS:         347.415
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 10.9758
  GOPS:         358.257
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 10.9712
  GOPS:         358.407
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 9.14605
  GOPS:         89.5687
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 10.9718
  GOPS:         716.775
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 10.9701
  GOPS:         358.444
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 9.1464
  GOPS:         179.131
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 9.14582
  GOPS:         89.571
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 9.14601
  GOPS:         179.138