Running benchmarks...
  Threads: 6
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.12376
  GOPS:         128.141
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.11409
  GOPS:         258.507
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.10899
  GOPS:         519.39
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.29549
  GOPS:         177.847
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 7.71007
  GOPS:         22.4122
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 7.70984
  GOPS:         11.2065
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 1.7147
  GOPS:         358.314
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 6.85666
  GOPS:         358.425
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 6.8579
  GOPS:         358.36
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 6.86087
  GOPS:         358.205
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 6.85774
  GOPS:         179.185
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 6.86004
  GOPS:         335.858
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 6.8572
  GOPS:         358.397
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 6.17205
  GOPS:         238.909
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 10.2883
  GOPS:         286.647
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 18.5193
  GOPS:         318.492
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 34.9833
  GOPS:         337.203
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 67.8986
  GOPS:         347.474
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 8.22676
  GOPS:         358.479
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 8.22562
  GOPS:         358.529
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 6.8577
  GOPS:         89.5927
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 8.22796
  GOPS:         716.854
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 8.22637
  GOPS:         358.496
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 6.8615
  GOPS:         179.086
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 6.85982
  GOPS:         89.565
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 6.86952
  GOPS:         178.877