Running benchmarks...
  Threads: 2
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.06597
  GOPS:         45.0293
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.05831
  GOPS:         90.7108
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.05577
  GOPS:         181.857
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.23993
  GOPS:         61.939
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 2.57003
  GOPS:         22.4122
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 2.57311
  GOPS:         11.1927
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 0.572868
  GOPS:         357.499
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 2.28651
  GOPS:         358.275
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 2.28621
  GOPS:         358.322
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 2.28777
  GOPS:         358.079
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 2.2879
  GOPS:         179.029
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 2.2862
  GOPS:         335.928
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 2.28639
  GOPS:         358.295
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 2.05921
  GOPS:         238.693
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 3.43202
  GOPS:         286.431
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 6.17318
  GOPS:         318.487
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 11.6637
  GOPS:         337.127
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 22.6499
  GOPS:         347.213
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 2.74753
  GOPS:         357.79
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 2.74697
  GOPS:         357.863
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 2.28753
  GOPS:         89.5289
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 2.74636
  GOPS:         715.885
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 2.74526
  GOPS:         358.087
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 2.28845
  GOPS:         178.986
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 2.28785
  GOPS:         89.5164
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 2.28732
  GOPS:         179.074