Running benchmarks...
  Threads: 2
  QoS: User Interactive
Determining FP32 Neon performance...
  Repetitions:  1000000000
  Total time:  2.324612
  GFLOPS: 206.486072
Determining FP32 SSVE performance...
  Repetitions:  100000000
  Total time:  6.195969
  GFLOPS: 30.987889
Determining FP32 AMX performance...
  Repetitions:  100000000
  Total time:  1.032406
  GFLOPS: 1983.715709
Determining FP32 SME FMOPA performance (1 tile)...
  Repetitions:  250000000
  Total time:  8.339321
  GFLOPS: 982.334173
Determining FP32 SME FMOPA performance (2 tiles)...
  Repetitions:  250000000
  Total time:  4.177101
  GFLOPS: 1961.168763
Determining FP32 SME FMOPA performance (4 tiles)...
  Repetitions:  250000000
  Total time:  4.131399
  GFLOPS: 1982.863432
Determining FP32 SME FMOPA performance (4 tiles, reordering)...
  Repetitions:  250000000
  Total time:  4.132010
  GFLOPS: 1982.570226
Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block)...
  Repetitions:  250000000
  Total time:  2.392870
  GFLOPS: 855.875998
Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)...
  Repetitions:  250000000
  Total time:  2.585106
  GFLOPS: 1584.461140
Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)...
  Repetitions:  250000000
  Total time:  4.649751
  GFLOPS: 1761.814772
Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)...
  Repetitions:  250000000
  Total time:  8.780381
  GFLOPS: 1865.978253
Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)...
  Repetitions:  250000000
  Total time:  17.042490
  GFLOPS: 1922.723733
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  8.262835
  GFLOPS: 1982.854553
Determining FP32 SME BFMOPA performance (widening)...
  Repetitions:  250000000
  Total time:  8.263026
  GFLOPS: 1982.808719