|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.08% |
27.158 |
17.630 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-34.28% |
50.030 |
32.878 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.26% |
51.460 |
33.831 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.25% |
51.460 |
33.832 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.82% |
97.201 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-33.58% |
28614.397 |
19004.484 |
7.568 |
-0.02% |
7.568 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-33.33% |
27.874 |
18.583 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-33.33% |
49.315 |
32.877 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-33.33% |
26.445 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-33.33% |
95.056 |
63.374 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.33% |
96.481 |
64.325 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-33.33% |
27.874 |
18.584 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-33.33% |
95.053 |
63.374 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC3
|
-32.05% |
27.875 |
18.940 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-31.14% |
7.862 |
5.414 |
0.061 |
-0.77% |
0.061 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-30.86% |
7.862 |
5.435 |
0.067 |
0.00% |
0.067 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.43% |
16.438 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-26.08% |
16.438 |
12.151 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-25.00% |
11.435 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-24.87% |
1.120 |
0.842 |
0.002 |
-1.25% |
0.002 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-ary3
Profile
|
-24.56% |
6.027 |
4.546 |
0.046 |
-0.95% |
0.046 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-23.08% |
9.291 |
7.147 |
0.001 |
-0.04% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-22.21% |
192.260 |
149.567 |
0.139 |
-0.11% |
0.139 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/999
|
-21.74% |
2169.108 |
1697.500 |
2.351 |
-0.04% |
2.351 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-21.74% |
192.972 |
151.029 |
2.891 |
-1.50% |
2.891 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int64_t_
|
-21.66% |
863071.605 |
676109.284 |
502.659 |
-0.04% |
502.659 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
-21.39% |
1100.670 |
865.207 |
2.191 |
0.13% |
2.191 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int64_t_
|
-21.33% |
1293268.022 |
1017419.214 |
1340.917 |
0.25% |
1340.917 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
-20.47% |
576.060 |
458.161 |
2.419 |
-0.16% |
2.419 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-20.00% |
10.721 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
-19.67% |
305.174 |
245.160 |
0.887 |
-0.00% |
0.887 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/51
|
-19.56% |
131.507 |
105.778 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-19.27% |
193.690 |
156.366 |
0.830 |
-0.56% |
0.830 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/51
|
-19.26% |
77.900 |
62.894 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
-18.42% |
54.320 |
44.311 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.26% |
82.194 |
67.181 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.21% |
192.258 |
157.241 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
-16.68% |
14313.837 |
11926.749 |
0.442 |
-0.01% |
0.442 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint64_t_
|
-16.65% |
14302.861 |
11921.299 |
6.411 |
-0.01% |
6.411 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/16
|
-16.46% |
56.463 |
47.170 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/171
|
-16.35% |
0.494 |
0.413 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
-16.29% |
38.293 |
32.057 |
0.015 |
0.05% |
0.015 |
|
SingleSource/Benchmarks/Shootout/Shootout-sieve
Profile
|
-15.78% |
8.469 |
7.133 |
0.019 |
-0.67% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
-14.75% |
43.597 |
37.164 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-14.28% |
10.006 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
-13.80% |
41.454 |
35.734 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/5001
|
-13.78% |
14.304 |
12.333 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-13.46% |
74.329 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/171
|
-13.06% |
0.618 |
0.537 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/171
|
-12.65% |
4.214 |
3.681 |
0.000 |
0.01% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/171
|
-12.64% |
4.214 |
3.681 |
0.001 |
-0.03% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_MULADDSUB_LAMBDA/44217
|
-12.42% |
347.821 |
304.635 |
6.379 |
-6.92% |
6.379 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int64_t_
|
-12.12% |
781298.324 |
686620.588 |
264.864 |
-0.03% |
264.864 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-11.11% |
6.432 |
5.718 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
-10.99% |
138.651 |
123.411 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
-10.75% |
285.883 |
255.162 |
0.005 |
0.00% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
-10.61% |
140.084 |
125.225 |
0.048 |
0.01% |
0.048 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/5001
|
-10.42% |
208.821 |
187.068 |
2.396 |
0.28% |
2.396 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
-10.35% |
283.019 |
253.725 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-10.20% |
69.327 |
62.256 |
0.172 |
-0.02% |
0.172 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
-9.99% |
71.468 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-9.91% |
153.311 |
138.118 |
5.099 |
0.03% |
5.099 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
-9.23% |
46.455 |
42.168 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int32_t_
|
-9.18% |
438586.943 |
398346.439 |
833.187 |
-0.17% |
833.187 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-9.06% |
170.099 |
154.681 |
1.792 |
-1.38% |
1.792 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
-8.84% |
35.021 |
31.925 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_uint8_t_
|
-8.80% |
439477.387 |
400794.505 |
717.544 |
-0.64% |
717.544 |
|
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-8.51% |
4.822 |
4.411 |
0.016 |
-0.16% |
0.016 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
-8.48% |
604378.893 |
553103.886 |
6409.603 |
-1.32% |
6409.603 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
-8.30% |
232.289 |
212.998 |
1.307 |
-0.00% |
1.307 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/1000
|
-8.30% |
2868135.246 |
2630078.947 |
5682.292 |
-0.01% |
5682.292 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/1000
|
-8.30% |
2868258.197 |
2630195.489 |
75.459 |
-0.00% |
75.459 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/1000
|
-8.30% |
2868184.426 |
2630135.338 |
55.789 |
-0.00% |
55.789 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/1000
|
-8.30% |
2868143.443 |
2630135.338 |
75.000 |
-0.01% |
75.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/1000
|
-8.28% |
2868295.082 |
2630718.045 |
96.889 |
-0.00% |
96.889 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
-8.27% |
95.058 |
87.194 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-8.27% |
37.165 |
34.092 |
0.043 |
-0.00% |
0.043 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
-8.22% |
2883.889 |
2646.706 |
0.063 |
-0.01% |
0.063 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_int32_t_
|
-8.19% |
594789.744 |
546057.188 |
603.889 |
0.25% |
603.889 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/999
|
-8.18% |
2883.132 |
2647.274 |
0.044 |
-0.00% |
0.044 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/10
|
-8.17% |
35.022 |
32.162 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
-8.12% |
1461.635 |
1342.986 |
0.043 |
-0.01% |
0.043 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/999
|
-8.03% |
1460.934 |
1343.659 |
0.019 |
-0.00% |
0.019 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-8.01% |
10462.558 |
9624.323 |
4.391 |
0.00% |
4.391 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
-8.00% |
759.730 |
698.976 |
0.021 |
-0.01% |
0.021 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
-7.96% |
17.868 |
16.447 |
0.233 |
-0.03% |
0.233 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-7.94% |
152.946 |
140.806 |
1.010 |
0.01% |
1.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
-7.87% |
63.613 |
58.606 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/256
|
-7.72% |
759.039 |
700.415 |
2.462 |
0.00% |
2.462 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-7.68% |
9.291 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/32
|
-7.58% |
94382.904 |
87230.587 |
0.348 |
-0.00% |
0.348 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/32
|
-7.58% |
94376.837 |
87226.972 |
2.572 |
-0.00% |
2.572 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/32
|
-7.57% |
94372.623 |
87224.576 |
4.047 |
-0.01% |
4.047 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/32
|
-7.57% |
94378.185 |
87230.184 |
2.818 |
0.00% |
2.818 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/32
|
-7.57% |
94375.624 |
87229.159 |
1.901 |
-0.01% |
1.901 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
-7.40% |
39529.930 |
36605.104 |
0.955 |
0.00% |
0.955 |
|
MultiSource/Benchmarks/TSVC/ControlLoops-dbl/ControlLoops-dbl
Profile
|
-7.32% |
9.789 |
9.072 |
0.271 |
-0.62% |
0.271 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
-7.31% |
400.970 |
371.663 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/256
|
-7.15% |
400.251 |
371.646 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
-7.14% |
20.012 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk
Profile
|
-6.99% |
38.545 |
35.850 |
0.477 |
-3.14% |
0.477 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-6.98% |
576375.732 |
536131.741 |
367.238 |
-0.01% |
367.238 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
-6.90% |
165.815 |
154.378 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/51
|
-6.87% |
166.524 |
155.090 |
0.007 |
-0.01% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-6.67% |
580039.363 |
541332.551 |
1157.155 |
0.32% |
1157.155 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-moments
Profile
|
-6.67% |
0.272 |
0.254 |
0.005 |
-1.46% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-6.67% |
10.721 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/5001
|
-6.49% |
120.619 |
112.789 |
0.843 |
0.38% |
0.843 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-6.43% |
100.059 |
93.627 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/44217
|
-6.30% |
5416.777 |
5075.674 |
10.632 |
-0.36% |
10.632 |
|
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-6.25% |
55.529 |
52.061 |
0.011 |
-0.02% |
0.011 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/5001
|
-6.18% |
119.842 |
112.439 |
0.303 |
-0.37% |
0.303 |
|
SingleSource/Benchmarks/BenchmarkGame/n-body
Profile
|
-6.16% |
2.635 |
2.473 |
0.001 |
0.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/51
|
-6.07% |
94.349 |
88.624 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-6.06% |
23.585 |
22.156 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-6.06% |
23.586 |
22.156 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-6.06% |
23.585 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/44217
|
-6.06% |
3728.665 |
3502.786 |
32.141 |
-1.15% |
32.141 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
-6.04% |
13.209 |
12.411 |
0.049 |
0.01% |
0.049 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/44217
|
-6.00% |
5399.496 |
5075.599 |
9.680 |
0.25% |
9.680 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-5.88% |
12.150 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-5.88% |
12.150 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
-5.87% |
16580.694 |
15607.947 |
0.242 |
-0.01% |
0.242 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Mid>
|
-5.86% |
16580.356 |
15608.143 |
0.525 |
-0.01% |
0.525 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-5.68% |
62.895 |
59.321 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/28
|
-5.67% |
100.774 |
95.057 |
0.003 |
-0.01% |
0.003 |
|
MultiSource/Benchmarks/TSVC/Symbolics-dbl/Symbolics-dbl
Profile
|
-5.48% |
11.080 |
10.472 |
0.096 |
-1.16% |
0.096 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-5.44% |
65.757 |
62.180 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/128
|
-5.39% |
2920.642 |
2763.091 |
0.614 |
-0.01% |
0.614 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/256
|
-5.38% |
418.458 |
395.955 |
0.010 |
-0.01% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-5.38% |
66.468 |
62.894 |
0.002 |
-0.01% |
0.002 |
|
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-5.05% |
3.264 |
3.100 |
0.007 |
0.07% |
0.007 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/44217
|
-5.05% |
3746.210 |
3557.127 |
8.667 |
0.11% |
8.667 |
|
SingleSource/Benchmarks/McGill/queens
Profile
|
-4.94% |
3.395 |
3.227 |
0.000 |
-0.02% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/44217
|
-4.81% |
1605.157 |
1527.941 |
5.990 |
0.00% |
5.990 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/64
|
-4.81% |
725.227 |
690.349 |
0.235 |
0.01% |
0.235 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/256
|
-4.80% |
11953.034 |
11379.869 |
23.480 |
-0.18% |
23.480 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-4.69% |
45.741 |
43.596 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-4.62% |
178.952 |
170.677 |
0.115 |
-0.00% |
0.115 |
|
MultiSource/Benchmarks/TSVC/StatementReordering-flt/StatementReordering-flt
Profile
|
-4.37% |
8.618 |
8.241 |
0.078 |
-0.17% |
0.078 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, First>
|
-4.16% |
8791.326 |
8425.639 |
0.269 |
-0.00% |
0.269 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-4.02% |
283155.754 |
271758.849 |
1364.208 |
-0.18% |
1364.208 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-3.93% |
157.893 |
151.696 |
0.079 |
0.02% |
0.079 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.85% |
37.166 |
35.735 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-3.84% |
18.582 |
17.869 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Mid>
|
-3.80% |
25375.711 |
24411.648 |
0.255 |
0.00% |
0.255 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
-3.78% |
37.880 |
36.450 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/1024/1024
|
-3.61% |
55301.462 |
53306.846 |
17.797 |
-0.32% |
17.797 |
|
SingleSource/Benchmarks/Misc/mandel
Profile
|
-3.55% |
1.328 |
1.281 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/28
|
-3.51% |
40.740 |
39.310 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/44217
|
-3.45% |
1592.993 |
1538.066 |
4.510 |
0.36% |
4.510 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-3.45% |
20.726 |
20.012 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/10
|
-3.39% |
42.169 |
40.737 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.33% |
43919.317 |
42454.810 |
0.587 |
-0.00% |
0.587 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
-3.25% |
13.194 |
12.765 |
0.005 |
0.01% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-3.23% |
22.156 |
21.441 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-3.23% |
22.156 |
21.441 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-3.23% |
22.156 |
21.440 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-3.23% |
22.157 |
21.442 |
0.014 |
0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-3.23% |
22.156 |
21.442 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.23% |
30250.281 |
29274.415 |
0.885 |
-0.00% |
0.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-3.23% |
22.156 |
21.441 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt
Profile
|
-3.20% |
10.295 |
9.965 |
0.121 |
-0.44% |
0.121 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
-3.12% |
31227.060 |
30251.426 |
0.578 |
0.00% |
0.578 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
-3.11% |
5862.854 |
5680.603 |
4.222 |
-0.01% |
4.222 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
-3.02% |
13805.175 |
13387.954 |
0.341 |
-0.01% |
0.341 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
-3.02% |
24160.333 |
23430.278 |
3.827 |
-0.00% |
3.827 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_PRESSURE_CALC_LAMBDA/171
|
-3.00% |
1.359 |
1.319 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-2.94% |
48.600 |
47.170 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.90% |
49.316 |
47.886 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
-2.84% |
12815.770 |
12451.508 |
0.256 |
-0.00% |
0.256 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
-2.78% |
21079.553 |
20494.335 |
34.170 |
-0.01% |
34.170 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/5001
|
-2.68% |
14.485 |
14.096 |
0.052 |
0.02% |
0.052 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-2.64% |
157.924 |
153.760 |
1.424 |
0.01% |
1.424 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
-2.63% |
18530.589 |
18042.433 |
0.358 |
-0.01% |
0.358 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/16
|
-2.60% |
55.033 |
53.602 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-2.53% |
155.456 |
151.525 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/999
|
-2.52% |
762.507 |
743.297 |
0.030 |
-0.00% |
0.030 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.49% |
16732.414 |
16315.883 |
0.391 |
-0.00% |
0.391 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-2.49% |
61.478 |
59.948 |
0.009 |
-0.01% |
0.009 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/16
|
-2.49% |
43.719 |
42.632 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
-2.48% |
14.432 |
14.075 |
0.057 |
-0.53% |
0.057 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-2.47% |
282328.595 |
275349.269 |
1287.666 |
-0.12% |
1287.666 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
-2.44% |
29.304 |
28.589 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, None>
|
-2.39% |
15377.501 |
15010.658 |
0.456 |
-0.01% |
0.456 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
-2.28% |
933550.802 |
912281.782 |
3583.421 |
-0.26% |
3583.421 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/16
|
-2.27% |
31.448 |
30.732 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/Trimaran/enc-3des/enc-3des
Profile
|
-2.22% |
3.143 |
3.073 |
0.000 |
-0.04% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/16
|
-2.22% |
32.162 |
31.448 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/Pathfinder/PathFinder
Profile
|
-2.22% |
7.208 |
7.047 |
0.021 |
-2.35% |
0.021 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-2.19% |
62.335 |
60.972 |
0.077 |
-0.22% |
0.077 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/10
|
-2.08% |
34.307 |
33.592 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/CoMD/CoMD
Profile
|
-2.06% |
5.063 |
4.959 |
0.006 |
0.02% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int64_t_
|
-2.03% |
424488.795 |
415884.892 |
2032.672 |
-0.27% |
2032.672 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-2.00% |
35.736 |
35.022 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC1
|
-1.95% |
13.167 |
12.910 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_autovec_int64_t_
|
-1.90% |
1944447.222 |
1907498.638 |
13737.120 |
0.13% |
13737.120 |
|
SingleSource/Benchmarks/Misc/perlin
Profile
|
-1.90% |
7.274 |
7.136 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
-1.89% |
37.882 |
37.165 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
-1.88% |
37.880 |
37.166 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-1.88% |
37.880 |
37.166 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/28
|
-1.79% |
40.025 |
39.310 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d
Profile
|
-1.78% |
163.947 |
161.027 |
0.126 |
0.01% |
0.126 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/28
|
-1.77% |
80.763 |
79.336 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/5001
|
-1.74% |
178.842 |
175.732 |
0.169 |
-0.05% |
0.169 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
-1.72% |
2.466 |
2.424 |
0.000 |
-0.03% |
0.000 |
|
SingleSource/Benchmarks/Misc/flops-8
Profile
|
-1.71% |
1.768 |
1.738 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/16
|
-1.67% |
42.883 |
42.168 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/44217
|
-1.62% |
129.303 |
127.206 |
0.296 |
-0.09% |
0.296 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/16
|
-1.60% |
45.030 |
44.311 |
0.002 |
-0.00% |
0.002 |
|
SingleSource/Benchmarks/CoyoteBench/lpbench
Profile
|
-1.54% |
7.741 |
7.622 |
0.002 |
0.24% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/51
|
-1.53% |
93.630 |
92.200 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/999
|
-1.52% |
1480.513 |
1457.991 |
0.070 |
-0.01% |
0.070 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/28
|
-1.45% |
96.455 |
95.056 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/5001
|
-1.36% |
178.267 |
175.841 |
0.136 |
0.04% |
0.136 |
|
MultiSource/Benchmarks/Bullet/bullet
Profile
|
-1.31% |
14.704 |
14.511 |
0.048 |
-0.06% |
0.048 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/51
|
-1.28% |
55.748 |
55.033 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_GEN_LIN_RECUR_RAW/44217
|
-1.20% |
512.222 |
506.089 |
0.470 |
-0.01% |
0.470 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/28
|
-1.19% |
60.037 |
59.323 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
-1.18% |
460.719 |
455.271 |
0.014 |
-0.01% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
-1.13% |
305.808 |
302.356 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_1D_RAW/5001
|
-1.13% |
128.443 |
126.995 |
0.436 |
0.50% |
0.436 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-1.11% |
64.326 |
63.611 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
-1.10% |
130.079 |
128.647 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint8_t_To_uint64_t_
|
-1.09% |
21719.074 |
21481.608 |
16.438 |
-0.08% |
16.438 |
|
MultiSource/Benchmarks/TSVC/IndirectAddressing-dbl/IndirectAddressing-dbl
Profile
|
-1.08% |
11.721 |
11.594 |
0.031 |
-0.43% |
0.031 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
-1.02% |
70.041 |
69.329 |
0.000 |
0.00% |
0.000 |