Running benchmarks...
  Threads: 9
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.67344
  GOPS:         129.075
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.66599
  GOPS:         259.305
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.64886
  GOPS:         523.999
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.93677
  GOPS:         178.442
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 11.5558
  GOPS:         22.4304
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 11.5556
  GOPS:         11.2153
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 2.57124
  GOPS:         358.426
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 10.281
  GOPS:         358.565
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 10.2852
  GOPS:         358.416
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 10.2832
  GOPS:         358.489
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 10.2823
  GOPS:         179.259
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 10.2817
  GOPS:         336.13
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 10.2836
  GOPS:         358.473
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 9.25602
  GOPS:         238.962
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 15.4225
  GOPS:         286.832
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 27.7628
  GOPS:         318.676
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 52.446
  GOPS:         337.389
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 101.817
  GOPS:         347.579
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 12.34
  GOPS:         358.482
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 12.337
  GOPS:         358.571
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 10.2828
  GOPS:         89.6251
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 12.3398
  GOPS:         716.978
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 12.3345
  GOPS:         358.643
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 10.285
  GOPS:         179.212
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 10.2871
  GOPS:         89.5877
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 10.2824
  GOPS:         179.258