Running benchmarks...
  Threads: 1
  QoS: Utility
Determining FP64 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.06191
  GOPS:         22.6007
Determining FP32 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.05276
  GOPS:         45.5943
Determining FP16 Neon FMLA performance...
  Repetitions:  200000000
  Duration (s): 1.05424
  GOPS:         91.0612
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  40000000
  Duration (s): 1.23542
  GOPS:         31.0826
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 1.28804
  GOPS:         22.3596
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  30000000
  Duration (s): 1.29008
  GOPS:         11.1621
Determining FP32 AMX performance...
  Repetitions:  10000000
  Duration (s): 0.286918
  GOPS:         356.896
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  25000000
  Duration (s): 1.1461
  GOPS:         357.386
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  25000000
  Duration (s): 1.14618
  GOPS:         357.362
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 1.14816
  GOPS:         356.744
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 1.14803
  GOPS:         178.393
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  25000000
  Duration (s): 1.1458
  GOPS:         335.137
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  25000000
  Duration (s): 1.14602
  GOPS:         357.41
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  60000000
  Duration (s): 1.03206
  GOPS:         238.125
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  60000000
  Duration (s): 1.71934
  GOPS:         285.877
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  60000000
  Duration (s): 3.09505
  GOPS:         317.617
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  60000000
  Duration (s): 5.84381
  GOPS:         336.438
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  60000000
  Duration (s): 11.3437
  GOPS:         346.638
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 1.3748
  GOPS:         357.521
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  15000000
  Duration (s): 1.37698
  GOPS:         356.955
Determining FP64 SME FMOPA performance ...
  Repetitions:  25000000
  Duration (s): 1.14557
  GOPS:         89.3877
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  15000000
  Duration (s): 1.37362
  GOPS:         715.657
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  15000000
  Duration (s): 1.37437
  GOPS:         357.633
Determining FP32 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 1.14787
  GOPS:         178.417
Determining FP64 SME FMLA performance...
  Repetitions:  50000000
  Duration (s): 1.1474
  GOPS:         89.2454
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  25000000
  Duration (s): 1.14632
  GOPS:         178.659