Running benchmarks...
  Threads: 7
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.30823
  GOPS:         128.418
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.30352
  GOPS:         257.764
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.29996
  GOPS:         516.941
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.51096
  GOPS:         177.901
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 8.9941
  GOPS:         22.4147
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 8.99267
  GOPS:         11.2091
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 2.00057
  GOPS:         358.297
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 8.00087
  GOPS:         358.361
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 8.00099
  GOPS:         358.356
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 7.99928
  GOPS:         358.432
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 8.00354
  GOPS:         179.121
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 8.00164
  GOPS:         335.931
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 8.00391
  GOPS:         358.225
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 7.20409
  GOPS:         238.798
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 12.0006
  GOPS:         286.705
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 21.601
  GOPS:         318.563
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 40.8035
  GOPS:         337.288
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 79.2119
  GOPS:         347.487
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 9.60102
  GOPS:         358.362
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 9.59689
  GOPS:         358.516
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 8.00247
  GOPS:         89.5723
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 9.60274
  GOPS:         716.596
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 9.5996
  GOPS:         358.415
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 8.00475
  GOPS:         179.094
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 8.01166
  GOPS:         89.4696
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 8.00335
  GOPS:         179.125