Running benchmarks...
  Threads: 3
  QoS: User Interactive
Determining FP32 Neon performance...
  Repetitions:  1000000000
  Total time:  2.457719
  GFLOPS: 292.954565
Determining FP32 SSVE performance...
  Repetitions:  100000000
  Total time:  9.293505
  GFLOPS: 30.989385
Determining FP32 AMX performance...
  Repetitions:  100000000
  Total time:  1.548561
  GFLOPS: 1983.777197
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  250000000
  Total time:  8.410678
  GFLOPS: 1460.999934
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  250000000
  Total time:  6.205980
  GFLOPS: 1980.025717
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  250000000
  Total time:  6.197202
  GFLOPS: 1982.830316
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  250000000
  Total time:  6.197384
  GFLOPS: 1982.772086
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)...
  Repetitions:  250000000
  Total time:  2.391891
  GFLOPS: 1284.339462
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  250000000
  Total time:  3.875066
  GFLOPS: 1585.521382
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  250000000
  Total time:  6.972680
  GFLOPS: 1762.306602
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  250000000
  Total time:  13.173500
  GFLOPS: 1865.563442
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  250000000
  Total time:  25.564691
  GFLOPS: 1922.651833
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  12.395301
  GFLOPS: 1982.686826
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  12.394469
  GFLOPS: 1982.819917