Running benchmarks...
  Threads: 5
  QoS: User Interactive
Determining FP64 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.38183
  GOPS:         217.104
Determining FP32 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.35508
  GOPS:         442.778
Determining FP16 Neon FMLA performance...
  Repetitions:  500000000
  Duration (s): 1.3543
  GOPS:         886.064
Determining BF16-BF16-FP32 BFMMLA Neon performance
  Repetitions:  100000000
  Duration (s): 1.69766
  GOPS:         282.742
Determining FP32 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 4.51523
  GOPS:         53.1535
Detemining FP64 SSVE FMLA (Z accumulation) performance...
  Repetitions:  50000000
  Duration (s): 4.49983
  GOPS:         26.6677
Determining FP32 AMX performance...
  Repetitions:  300000000
  Duration (s): 6.5576
  GOPS:         2342.32
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  35000000
  Duration (s): 1.34372
  GOPS:         2133.77
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  75000000
  Duration (s): 2.62583
  GOPS:         2339.83
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 4.37394
  GOPS:         2341.14
Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 4.37364
  GOPS:         1170.65
Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)...
  Repetitions:  125000000
  Duration (s): 4.37294
  GOPS:         2195.32
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  125000000
  Duration (s): 4.37335
  GOPS:         2341.45
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)..
  Repetitions:  125000000
  Duration (s): 1.64047
  GOPS:         1560.53
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  125000000
  Duration (s): 2.7345
  GOPS:         1872.37
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  125000000
  Duration (s): 4.91936
  GOPS:         2081.57
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  100000000
  Duration (s): 7.43451
  GOPS:         2203.78
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  50000000
  Duration (s): 7.22862
  GOPS:         2266.54
Determining FP16-FP16-FP32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 5.24685
  GOPS:         2341.98
Determining BF16-BF16-FP32 SME BFMOPA performance...
  Repetitions:  75000000
  Duration (s): 5.24715
  GOPS:         2341.84
Determining FP64 SME FMOPA performance ...
  Repetitions:  125000000
  Duration (s): 4.37376
  GOPS:         585.309
Determining I8-I8-I32 SME SMOPA performance...
  Repetitions:  75000000
  Duration (s): 5.24896
  GOPS:         4682.07
Determining I16-I16-I32 SME FMOPA performance...
  Repetitions:  75000000
  Duration (s): 5.24749
  GOPS:         2341.69
Determining FP32 SME FMLA performance...
  Repetitions:  125000000
  Duration (s): 3.79528
  GOPS:         674.522
Determining FP64 SME FMLA performance...
  Repetitions:  150000000
  Duration (s): 4.55381
  GOPS:         337.3
Determining BF16-BF16-FP32 SME BFDOT performance...
  Repetitions:  100000000
  Duration (s): 5.18924
  GOPS:         789.326