|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.09% |
27.160 |
17.630 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-34.28% |
50.029 |
32.878 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.26% |
51.461 |
33.831 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.25% |
51.458 |
33.832 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.82% |
97.199 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-33.54% |
28596.846 |
19004.484 |
7.568 |
-0.02% |
7.568 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.33% |
96.488 |
64.325 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-33.33% |
95.060 |
63.374 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-33.33% |
27.874 |
18.583 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-33.33% |
26.445 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-33.33% |
49.315 |
32.877 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-33.33% |
95.057 |
63.374 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-33.33% |
27.874 |
18.584 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC3
|
-32.05% |
27.874 |
18.940 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-31.14% |
7.862 |
5.414 |
0.061 |
-0.77% |
0.061 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-30.86% |
7.862 |
5.435 |
0.067 |
0.00% |
0.067 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.43% |
16.438 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-ary3
Profile
|
-26.21% |
6.161 |
4.546 |
0.046 |
-0.95% |
0.046 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-26.08% |
16.438 |
12.151 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
-25.36% |
202.166 |
150.901 |
4.516 |
-0.03% |
4.516 |
|
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-25.00% |
1.122 |
0.842 |
0.002 |
-1.25% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-25.00% |
11.435 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-23.07% |
9.291 |
7.147 |
0.001 |
-0.04% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-22.21% |
192.260 |
149.567 |
0.139 |
-0.11% |
0.139 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int64_t_
|
-21.88% |
865429.455 |
676109.284 |
502.659 |
-0.04% |
502.659 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/999
|
-21.74% |
2169.189 |
1697.500 |
2.351 |
-0.04% |
2.351 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-21.74% |
192.971 |
151.029 |
2.891 |
-1.50% |
2.891 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
-21.39% |
1100.660 |
865.207 |
2.191 |
0.13% |
2.191 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int64_t_
|
-20.98% |
1287583.794 |
1017419.214 |
1340.917 |
0.25% |
1340.917 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
-20.47% |
576.058 |
458.161 |
2.419 |
-0.16% |
2.419 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-20.00% |
10.720 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
-19.67% |
305.189 |
245.160 |
0.887 |
-0.00% |
0.887 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/51
|
-19.56% |
131.507 |
105.778 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-19.27% |
193.687 |
156.366 |
0.830 |
-0.56% |
0.830 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/51
|
-19.27% |
77.904 |
62.894 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
-18.42% |
54.319 |
44.311 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.26% |
82.192 |
67.181 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.21% |
192.260 |
157.241 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
-16.68% |
14314.105 |
11926.749 |
0.442 |
-0.01% |
0.442 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint64_t_
|
-16.65% |
14302.397 |
11921.299 |
6.411 |
-0.01% |
6.411 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/16
|
-16.46% |
56.462 |
47.170 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/171
|
-16.35% |
0.494 |
0.413 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
-16.31% |
38.303 |
32.057 |
0.015 |
0.05% |
0.015 |
|
SingleSource/Benchmarks/Shootout/Shootout-sieve
Profile
|
-15.17% |
8.408 |
7.133 |
0.019 |
-0.67% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
-14.76% |
43.598 |
37.164 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-14.28% |
10.006 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
-13.80% |
41.454 |
35.734 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/5001
|
-13.78% |
14.304 |
12.333 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-13.46% |
74.329 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/171
|
-13.06% |
0.618 |
0.537 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/171
|
-12.64% |
4.213 |
3.681 |
0.000 |
0.01% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/171
|
-12.63% |
4.213 |
3.681 |
0.001 |
-0.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int64_t_
|
-12.11% |
781243.848 |
686620.588 |
264.864 |
-0.03% |
264.864 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
-11.14% |
285.528 |
253.725 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-11.11% |
6.433 |
5.718 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
-11.00% |
138.657 |
123.411 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
-10.75% |
285.890 |
255.162 |
0.005 |
0.00% |
0.005 |
|
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_MULADDSUB_LAMBDA/44217
|
-10.66% |
340.974 |
304.635 |
6.379 |
-6.92% |
6.379 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
-10.61% |
140.082 |
125.225 |
0.048 |
0.01% |
0.048 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-10.20% |
69.325 |
62.256 |
0.172 |
-0.02% |
0.172 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/5001
|
-10.11% |
208.117 |
187.068 |
2.396 |
0.28% |
2.396 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
-10.00% |
71.471 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-9.92% |
153.320 |
138.118 |
5.099 |
0.03% |
5.099 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
-9.23% |
46.457 |
42.168 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int32_t_
|
-9.18% |
438626.098 |
398346.439 |
833.187 |
-0.17% |
833.187 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-9.03% |
170.041 |
154.681 |
1.792 |
-1.38% |
1.792 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
-8.87% |
233.724 |
212.998 |
1.307 |
-0.00% |
1.307 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
-8.84% |
35.021 |
31.925 |
0.001 |
-0.00% |
0.001 |
|
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-8.51% |
4.822 |
4.411 |
0.016 |
-0.16% |
0.016 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_uint8_t_
|
-8.40% |
437565.625 |
400794.505 |
717.544 |
-0.64% |
717.544 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/1000
|
-8.30% |
2868143.443 |
2630078.947 |
5682.292 |
-0.01% |
5682.292 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/1000
|
-8.30% |
2868200.820 |
2630135.338 |
55.789 |
-0.00% |
55.789 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/1000
|
-8.30% |
2868209.016 |
2630195.489 |
75.459 |
-0.00% |
75.459 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/1000
|
-8.30% |
2868098.361 |
2630135.338 |
75.000 |
-0.01% |
75.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/1000
|
-8.28% |
2868245.902 |
2630718.045 |
96.889 |
-0.00% |
96.889 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
-8.27% |
95.059 |
87.194 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-8.27% |
37.165 |
34.092 |
0.043 |
-0.00% |
0.043 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
-8.23% |
2884.078 |
2646.706 |
0.063 |
-0.01% |
0.063 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/999
|
-8.18% |
2883.169 |
2647.274 |
0.044 |
-0.00% |
0.044 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/10
|
-8.16% |
35.021 |
32.162 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
-8.13% |
602082.830 |
553103.886 |
6409.603 |
-1.32% |
6409.603 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
-8.12% |
1461.619 |
1342.986 |
0.043 |
-0.01% |
0.043 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_int32_t_
|
-8.04% |
593799.658 |
546057.188 |
603.889 |
0.25% |
603.889 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/999
|
-8.02% |
1460.876 |
1343.659 |
0.019 |
-0.00% |
0.019 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-8.01% |
10462.190 |
9624.323 |
4.391 |
0.00% |
4.391 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
-7.99% |
759.709 |
698.976 |
0.021 |
-0.01% |
0.021 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
-7.96% |
17.868 |
16.447 |
0.233 |
-0.03% |
0.233 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-7.94% |
152.951 |
140.806 |
1.010 |
0.01% |
1.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
-7.87% |
63.610 |
58.606 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/256
|
-7.72% |
759.031 |
700.415 |
2.462 |
0.00% |
2.462 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-7.69% |
9.291 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/32
|
-7.58% |
94376.567 |
87224.576 |
4.047 |
-0.01% |
4.047 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/32
|
-7.58% |
94376.534 |
87226.972 |
2.572 |
-0.00% |
2.572 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/32
|
-7.57% |
94377.107 |
87229.159 |
1.901 |
-0.01% |
1.901 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/32
|
-7.57% |
94377.562 |
87230.587 |
0.348 |
-0.00% |
0.348 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/32
|
-7.57% |
94376.702 |
87230.184 |
2.818 |
0.00% |
2.818 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
-7.40% |
39530.156 |
36605.104 |
0.955 |
0.00% |
0.955 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
-7.31% |
400.968 |
371.663 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/256
|
-7.15% |
400.253 |
371.646 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
-7.14% |
20.012 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
-6.90% |
165.813 |
154.378 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/51
|
-6.87% |
166.532 |
155.090 |
0.007 |
-0.01% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-6.75% |
574917.706 |
536131.741 |
367.238 |
-0.01% |
367.238 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-6.66% |
10.720 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/5001
|
-6.66% |
120.465 |
112.439 |
0.303 |
-0.37% |
0.303 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/44217
|
-6.49% |
5427.752 |
5075.599 |
9.680 |
0.25% |
9.680 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/5001
|
-6.45% |
120.560 |
112.789 |
0.843 |
0.38% |
0.843 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/44217
|
-6.43% |
5424.727 |
5075.674 |
10.632 |
-0.36% |
10.632 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-6.43% |
100.063 |
93.627 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-6.27% |
577567.251 |
541332.551 |
1157.155 |
0.32% |
1157.155 |
|
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk
Profile
|
-6.23% |
38.234 |
35.850 |
0.477 |
-3.14% |
0.477 |
|
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-6.23% |
55.519 |
52.061 |
0.011 |
-0.02% |
0.011 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/44217
|
-6.20% |
3734.170 |
3502.786 |
32.141 |
-1.15% |
32.141 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/51
|
-6.06% |
94.345 |
88.624 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-6.06% |
23.586 |
22.156 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-6.06% |
23.586 |
22.156 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
-6.06% |
13.212 |
12.411 |
0.049 |
0.01% |
0.049 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-6.06% |
23.585 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/BenchmarkGame/n-body
Profile
|
-6.00% |
2.631 |
2.473 |
0.001 |
0.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-5.88% |
12.151 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-5.88% |
12.150 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
-5.87% |
16580.728 |
15607.947 |
0.242 |
-0.01% |
0.242 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Mid>
|
-5.87% |
16580.880 |
15608.143 |
0.525 |
-0.01% |
0.525 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/44217
|
-5.86% |
1623.064 |
1527.941 |
5.990 |
0.00% |
5.990 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-5.68% |
62.897 |
59.321 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/28
|
-5.68% |
100.777 |
95.057 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-5.44% |
65.755 |
62.180 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/128
|
-5.40% |
2920.742 |
2763.091 |
0.614 |
-0.01% |
0.614 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-5.38% |
66.467 |
62.894 |
0.002 |
-0.01% |
0.002 |
|
MultiSource/Benchmarks/TSVC/Symbolics-dbl/Symbolics-dbl
Profile
|
-5.32% |
11.061 |
10.472 |
0.096 |
-1.16% |
0.096 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/44217
|
-5.28% |
3755.549 |
3557.127 |
8.667 |
0.11% |
8.667 |
|
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-5.05% |
3.264 |
3.100 |
0.007 |
0.07% |
0.007 |
|
SingleSource/Benchmarks/McGill/queens
Profile
|
-4.95% |
3.395 |
3.227 |
0.000 |
-0.02% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/64
|
-4.81% |
725.225 |
690.349 |
0.235 |
0.01% |
0.235 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-4.69% |
45.742 |
43.596 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-4.63% |
178.964 |
170.677 |
0.115 |
-0.00% |
0.115 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/256
|
-4.62% |
11930.678 |
11379.869 |
23.480 |
-0.18% |
23.480 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/44217
|
-4.45% |
1609.772 |
1538.066 |
4.510 |
0.36% |
4.510 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, First>
|
-4.23% |
8797.489 |
8425.639 |
0.269 |
-0.00% |
0.269 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-3.94% |
157.912 |
151.696 |
0.079 |
0.02% |
0.079 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_IMP_HYDRO_2D_RAW/44217
|
-3.93% |
7522.946 |
7227.418 |
88.620 |
0.54% |
88.620 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.85% |
37.166 |
35.735 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-3.84% |
18.583 |
17.869 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Mid>
|
-3.80% |
25376.042 |
24411.648 |
0.255 |
0.00% |
0.255 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
-3.78% |
37.880 |
36.450 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/1024/1024
|
-3.57% |
55280.462 |
53306.846 |
17.797 |
-0.32% |
17.797 |
|
SingleSource/Benchmarks/Misc/mandel
Profile
|
-3.55% |
1.328 |
1.281 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/28
|
-3.51% |
40.738 |
39.310 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-3.45% |
20.727 |
20.012 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/10
|
-3.39% |
42.168 |
40.737 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
-3.35% |
13.207 |
12.765 |
0.005 |
0.01% |
0.005 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.34% |
43920.755 |
42454.810 |
0.587 |
-0.00% |
0.587 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.23% |
30251.577 |
29274.415 |
0.885 |
-0.00% |
0.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-3.23% |
22.156 |
21.440 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-3.23% |
22.156 |
21.441 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-3.23% |
22.156 |
21.441 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-3.22% |
22.156 |
21.442 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-3.22% |
22.156 |
21.441 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-3.22% |
22.157 |
21.442 |
0.014 |
0.00% |
0.014 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
-3.12% |
31226.243 |
30251.426 |
0.578 |
0.00% |
0.578 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-3.11% |
280483.494 |
271758.849 |
1364.208 |
-0.18% |
1364.208 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
-3.11% |
5862.793 |
5680.603 |
4.222 |
-0.01% |
4.222 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
-3.03% |
13805.708 |
13387.954 |
0.341 |
-0.01% |
0.341 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
-3.02% |
24161.127 |
23430.278 |
3.827 |
-0.00% |
3.827 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_PRESSURE_CALC_LAMBDA/171
|
-3.01% |
1.360 |
1.319 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-2.95% |
48.601 |
47.170 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.90% |
49.314 |
47.886 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
-2.84% |
12815.669 |
12451.508 |
0.256 |
-0.00% |
0.256 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
-2.78% |
21081.253 |
20494.335 |
34.170 |
-0.01% |
34.170 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
-2.75% |
14.472 |
14.075 |
0.057 |
-0.53% |
0.057 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/999
|
-2.72% |
764.074 |
743.297 |
0.030 |
-0.00% |
0.030 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
-2.64% |
18531.660 |
18042.433 |
0.358 |
-0.01% |
0.358 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-2.63% |
157.916 |
153.760 |
1.424 |
0.01% |
1.424 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/5001
|
-2.62% |
14.476 |
14.096 |
0.052 |
0.02% |
0.052 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/16
|
-2.60% |
55.035 |
53.602 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-2.52% |
155.446 |
151.525 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/16
|
-2.51% |
43.730 |
42.632 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-2.49% |
61.481 |
59.948 |
0.009 |
-0.01% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.49% |
16732.258 |
16315.883 |
0.391 |
-0.00% |
0.391 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
-2.44% |
29.303 |
28.589 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, None>
|
-2.39% |
15377.801 |
15010.658 |
0.456 |
-0.01% |
0.456 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/16
|
-2.27% |
31.447 |
30.732 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/16
|
-2.22% |
32.162 |
31.448 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-2.18% |
62.333 |
60.972 |
0.077 |
-0.22% |
0.077 |
|
MultiSource/Benchmarks/Trimaran/enc-3des/enc-3des
Profile
|
-2.10% |
3.139 |
3.073 |
0.000 |
-0.04% |
0.000 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/512/512
|
-2.10% |
14069.380 |
13773.765 |
100.601 |
-0.16% |
100.601 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/10
|
-2.08% |
34.306 |
33.592 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/256
|
-2.01% |
404.092 |
395.955 |
0.010 |
-0.01% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-2.00% |
35.736 |
35.022 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC1
|
-1.96% |
13.168 |
12.910 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
-1.94% |
930313.333 |
912281.782 |
3583.421 |
-0.26% |
3583.421 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int32_t_
|
-1.91% |
547755.102 |
537306.748 |
2707.010 |
0.26% |
2707.010 |
|
SingleSource/Benchmarks/Misc/perlin
Profile
|
-1.90% |
7.274 |
7.136 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
-1.89% |
37.880 |
37.165 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/TSVC/GlobalDataFlow-dbl/GlobalDataFlow-dbl
Profile
|
-1.89% |
11.634 |
11.414 |
0.079 |
-3.21% |
0.079 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-1.88% |
37.880 |
37.166 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
-1.88% |
37.879 |
37.166 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
-1.85% |
463.839 |
455.271 |
0.014 |
-0.01% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/28
|
-1.78% |
40.023 |
39.310 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/28
|
-1.77% |
80.763 |
79.336 |
0.001 |
-0.00% |
0.001 |
|
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d
Profile
|
-1.74% |
163.880 |
161.027 |
0.126 |
0.01% |
0.126 |
|
SingleSource/Benchmarks/CoyoteBench/lpbench
Profile
|
-1.73% |
7.756 |
7.622 |
0.002 |
0.24% |
0.002 |
|
MultiSource/Benchmarks/TSVC/NodeSplitting-flt/NodeSplitting-flt
Profile
|
-1.73% |
11.193 |
10.999 |
0.048 |
-0.29% |
0.048 |
|
SingleSource/Benchmarks/Misc/flops-8
Profile
|
-1.70% |
1.768 |
1.738 |
0.000 |
-0.00% |
0.000 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/Pathfinder/PathFinder
Profile
|
-1.69% |
7.169 |
7.047 |
0.021 |
-2.35% |
0.021 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/5001
|
-1.67% |
178.719 |
175.732 |
0.169 |
-0.05% |
0.169 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/16
|
-1.67% |
42.883 |
42.168 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/44217
|
-1.66% |
129.356 |
127.206 |
0.296 |
-0.09% |
0.296 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
-1.63% |
2.464 |
2.424 |
0.000 |
-0.03% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/16
|
-1.59% |
45.027 |
44.311 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/51
|
-1.52% |
93.627 |
92.200 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/5001
|
-1.43% |
178.398 |
175.841 |
0.136 |
0.04% |
0.136 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-1.39% |
279217.461 |
275349.269 |
1287.666 |
-0.12% |
1287.666 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/51
|
-1.28% |
55.746 |
55.033 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_GEN_LIN_RECUR_RAW/44217
|
-1.21% |
512.273 |
506.089 |
0.470 |
-0.01% |
0.470 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/28
|
-1.19% |
60.035 |
59.323 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int32_t_
|
-1.16% |
545633.540 |
539325.329 |
2145.721 |
-0.10% |
2145.721 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-1.11% |
64.324 |
63.611 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
-1.10% |
305.725 |
302.356 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
-1.10% |
130.079 |
128.647 |
0.004 |
-0.01% |
0.004 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/CoMD/CoMD
Profile
|
-1.09% |
5.013 |
4.959 |
0.006 |
0.02% |
0.006 |
|
MultiSource/Benchmarks/Bullet/bullet
Profile
|
-1.03% |
14.662 |
14.511 |
0.048 |
-0.06% |
0.048 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
-1.02% |
70.043 |
69.329 |
0.000 |
0.00% |
0.000 |