Running benchmarks...
  Threads: 5
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.09936
  GOPS:         109.155
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.08423
  GOPS:         221.356
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.08271
  GOPS:         443.334
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.27598
  GOPS:         150.473
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 6.42966
  GOPS:         22.3962
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 6.4233
  GOPS:         11.2092
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 1.42994
  GOPS:         358.058
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 5.7142
  GOPS:         358.405
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 5.71339
  GOPS:         358.456
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 5.71659
  GOPS:         358.256
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 5.71459
  GOPS:         179.19
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 5.7127
  GOPS:         336.093
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 5.71656
  GOPS:         358.257
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 5.14434
  GOPS:         238.864
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 8.57488
  GOPS:         286.605
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 15.4287
  GOPS:         318.575
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 29.1443
  GOPS:         337.301
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 56.5715
  GOPS:         347.539
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 6.85621
  GOPS:         358.449
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 6.85859
  GOPS:         358.324
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 5.71353
  GOPS:         89.6118
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 6.85588
  GOPS:         716.932
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 6.85642
  GOPS:         358.438
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 5.71869
  GOPS:         179.062
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 5.71624
  GOPS:         89.5694
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 5.713
  GOPS:         179.24