Running benchmarks...
  Threads: 9
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.80968
  GOPS:         298.394
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.77895
  GOPS:         607.099
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.79452
  GOPS:         1203.66
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 2.25427
  GOPS:         383.273
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 8.09675
  GOPS:         53.3547
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 8.10346
  GOPS:         26.6553
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 11.8014
  GOPS:         2342.77
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 2.43775
  GOPS:         2117.1
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 4.74829
  GOPS:         2329.09
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 7.91487
  GOPS:         2328.78
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 7.87465
  GOPS:         1170.34
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 8.00335
  GOPS:         2159.1
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 8.46592
  GOPS:         2177.2
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 3.8162
  GOPS:         1207.48
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 6.28543
  GOPS:         1466.25
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 11.123
  GOPS:         1657.11
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 16.7068
  GOPS:         1765.22
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 15.9868
  GOPS:         1844.72
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 9.9443
  GOPS:         2224.23
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 9.93328
  GOPS:         2226.7
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 9.5411
  GOPS:         482.963
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 9.53337
  GOPS:         4640.2
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 9.60938
  GOPS:         2301.75
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 7.96512
  GOPS:         578.522
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 9.48289
  GOPS:         291.557
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 9.74967
  GOPS:         756.211