Running benchmarks...
  Threads: 1
  QoS: User Interactive
Determining FP32 Neon performance...
  Repetitions:  1000000000
  Total time:  2.158828
  GFLOPS: 111.171432
Determining FP32 SSVE performance...
  Repetitions:  100000000
  Total time:  3.062510
  GFLOPS: 31.346836
Determining FP32 AMX performance...
  Repetitions:  100000000
  Total time:  0.511152
  GFLOPS: 2003.317995
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  250000000
  Total time:  8.156035
  GFLOPS: 502.204809
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  250000000
  Total time:  4.076806
  GFLOPS: 1004.708097
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  250000000
  Total time:  2.038202
  GFLOPS: 2009.614356
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  250000000
  Total time:  2.040132
  GFLOPS: 2007.713226
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)...
  Repetitions:  250000000
  Total time:  2.359423
  GFLOPS: 434.004415
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  250000000
  Total time:  2.423596
  GFLOPS: 845.025326
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  250000000
  Total time:  2.550567
  GFLOPS: 1605.917429
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  250000000
  Total time:  4.334013
  GFLOPS: 1890.165073
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  250000000
  Total time:  8.413994
  GFLOPS: 1947.232194
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  4.076279
  GFLOPS: 2009.675981
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  4.076515
  GFLOPS: 2009.559636