Running benchmarks...
  Threads: 10
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.85909
  GOPS:         129.095
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.84292
  GOPS:         260.456
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.84078
  GOPS:         521.519
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 2.15185
  GOPS:         178.451
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 12.8423
  GOPS:         22.4258
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 12.8402
  GOPS:         11.2147
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 2.85654
  GOPS:         358.475
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 11.4248
  GOPS:         358.518
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 11.4288
  GOPS:         358.391
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 11.4254
  GOPS:         358.499
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 11.4236
  GOPS:         179.278
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 11.4243
  GOPS:         336.126
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 11.4244
  GOPS:         358.531
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 10.2807
  GOPS:         239.05
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 17.1378
  GOPS:         286.804
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 30.8495
  GOPS:         318.657
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 58.2759
  GOPS:         337.375
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 113.134
  GOPS:         347.567
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 13.7086
  GOPS:         358.548
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 13.7091
  GOPS:         358.534
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 11.4278
  GOPS:         89.6061
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 13.7071
  GOPS:         717.177
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 13.7345
  GOPS:         357.873
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 11.4325
  GOPS:         179.139
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 11.4341
  GOPS:         89.5565
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 11.427
  GOPS:         179.225