Running benchmarks...
  Threads: 3
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.08588
  GOPS:         66.3059
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.08125
  GOPS:         133.179
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.07765
  GOPS:         267.248
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.26583
  GOPS:         91.0077
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 3.85718
  GOPS:         22.3998
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 3.85223
  GOPS:         11.2143
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 0.857413
  GOPS:         358.287
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 3.42891
  GOPS:         358.365
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 3.42846
  GOPS:         358.412
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 3.42916
  GOPS:         358.338
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 3.42866
  GOPS:         179.195
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 3.42775
  GOPS:         336.081
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 3.42905
  GOPS:         358.35
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 3.08622
  GOPS:         238.894
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 5.14302
  GOPS:         286.711
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 9.25888
  GOPS:         318.518
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 17.4918
  GOPS:         337.2
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 33.9534
  GOPS:         347.431
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 4.11359
  GOPS:         358.461
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 4.11393
  GOPS:         358.431
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 3.42872
  GOPS:         89.5961
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 4.11561
  GOPS:         716.569
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 4.11351
  GOPS:         358.467
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 3.43052
  GOPS:         179.098
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 3.43201
  GOPS:         89.5103
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 3.42921
  GOPS:         179.166