Running benchmarks...
  Threads: 7
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.61738
  GOPS:         259.679
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.58763
  GOPS:         529.091
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.60471
  GOPS:         1046.92
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 2.03984
  GOPS:         329.438
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 6.28609
  GOPS:         53.4513
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 6.28495
  GOPS:         26.7305
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 9.17896
  GOPS:         2342.75
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 1.88734
  GOPS:         2126.85
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 3.7914
  GOPS:         2268.71
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 6.12363
  GOPS:         2341.09
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 7.15587
  GOPS:         1001.7
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 7.3021
  GOPS:         1840.57
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 7.30732
  GOPS:         1961.87
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 2.76897
  GOPS:         1294.35
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 4.70676
  GOPS:         1522.92
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 8.33433
  GOPS:         1720.11
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 12.5833
  GOPS:         1822.86
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 12.0775
  GOPS:         1899.2
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 7.63261
  GOPS:         2253.91
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 7.6472
  GOPS:         2249.61
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 7.22957
  GOPS:         495.742
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 7.39
  GOPS:         4655.81
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 7.34915
  GOPS:         2340.84
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 5.95553
  GOPS:         601.793
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 7.26557
  GOPS:         295.971
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 7.51114
  GOPS:         763.453