|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.08% |
27.159 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-34.28% |
50.030 |
32.878 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.26% |
51.460 |
33.831 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.26% |
51.459 |
33.831 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.82% |
97.202 |
64.328 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-33.49% |
28597.132 |
19019.038 |
4.558 |
0.05% |
4.558 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-33.33% |
27.874 |
18.583 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-33.33% |
95.056 |
63.371 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-33.33% |
95.058 |
63.372 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.33% |
96.488 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-33.33% |
26.444 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-33.33% |
49.314 |
32.877 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-33.33% |
27.874 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC3
|
-32.05% |
27.874 |
18.939 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-31.11% |
7.862 |
5.416 |
0.012 |
-0.06% |
0.012 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-30.97% |
7.862 |
5.427 |
0.017 |
-0.53% |
0.017 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.43% |
16.439 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-26.08% |
16.438 |
12.150 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-25.00% |
11.436 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-ary3
Profile
|
-24.75% |
6.076 |
4.572 |
0.086 |
-0.39% |
0.086 |
|
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-23.81% |
1.120 |
0.853 |
0.003 |
0.13% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-23.06% |
9.291 |
7.149 |
0.002 |
-0.02% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-22.24% |
192.258 |
149.492 |
0.260 |
-0.16% |
0.260 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int64_t_
|
-21.89% |
865250.000 |
675824.494 |
679.083 |
-0.08% |
679.083 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-21.85% |
192.975 |
150.812 |
3.439 |
-1.64% |
3.439 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/999
|
-21.71% |
2169.127 |
1698.245 |
3.875 |
-0.00% |
3.875 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
-21.50% |
1100.689 |
864.094 |
0.914 |
0.00% |
0.914 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int64_t_
|
-21.02% |
1292127.306 |
1020575.182 |
1215.321 |
-0.10% |
1215.321 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
-20.34% |
576.066 |
458.869 |
1.997 |
-0.00% |
1.997 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-20.00% |
10.721 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
-19.83% |
305.172 |
244.654 |
0.240 |
-0.21% |
0.240 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/51
|
-19.56% |
131.509 |
105.784 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/51
|
-19.26% |
77.903 |
62.899 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-19.05% |
193.690 |
156.795 |
0.640 |
-0.29% |
0.640 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
-18.42% |
54.319 |
44.314 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.26% |
82.194 |
67.186 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.21% |
192.257 |
157.240 |
0.020 |
-0.00% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint64_t_
|
-16.64% |
14301.798 |
11921.400 |
0.203 |
-0.01% |
0.203 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
-16.61% |
14302.528 |
11927.447 |
2.362 |
-0.00% |
2.362 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/16
|
-16.45% |
56.463 |
47.173 |
0.001 |
-0.00% |
0.001 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
-16.30% |
38.278 |
32.038 |
0.011 |
-0.01% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-15.50% |
166.593 |
140.776 |
5.022 |
-0.01% |
5.022 |
|
SingleSource/Benchmarks/Shootout/Shootout-sieve
Profile
|
-14.85% |
8.422 |
7.172 |
0.028 |
-0.13% |
0.028 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
-14.75% |
43.597 |
37.167 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-14.28% |
10.006 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
-13.79% |
41.453 |
35.737 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-13.46% |
74.331 |
64.327 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/171
|
-12.68% |
4.215 |
3.680 |
0.012 |
-0.01% |
0.012 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/171
|
-12.65% |
4.213 |
3.680 |
0.010 |
-0.06% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int64_t_
|
-12.01% |
780434.152 |
686716.389 |
839.167 |
-0.01% |
839.167 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-11.35% |
170.119 |
150.817 |
5.449 |
-3.84% |
5.449 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-11.11% |
6.433 |
5.718 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
-11.00% |
138.659 |
123.409 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/5001
|
-10.85% |
208.079 |
185.510 |
1.432 |
-0.47% |
1.432 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
-10.75% |
285.892 |
255.160 |
0.004 |
0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
-10.62% |
140.088 |
125.212 |
0.011 |
-0.00% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
-10.35% |
283.031 |
253.729 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-10.17% |
69.328 |
62.277 |
0.166 |
0.01% |
0.166 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
-10.00% |
71.472 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-9.99% |
153.327 |
138.016 |
5.245 |
-0.04% |
5.245 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/5001
|
-9.81% |
207.107 |
186.782 |
2.899 |
0.33% |
2.899 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_int32_t_
|
-9.57% |
595358.247 |
538369.279 |
4730.956 |
-1.16% |
4730.956 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_BAND_LIN_EQ_RAW/171
|
-9.54% |
0.247 |
0.224 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int32_t_
|
-9.49% |
439487.790 |
397800.683 |
951.431 |
-0.31% |
951.431 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
-9.23% |
46.457 |
42.169 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC1
|
-9.07% |
5.138 |
4.672 |
0.051 |
-0.49% |
0.051 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_BAND_LIN_EQ_LAMBDA/171
|
-9.01% |
0.246 |
0.224 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_uint8_t_
|
-9.00% |
437688.555 |
398296.359 |
1029.035 |
0.08% |
1029.035 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
-8.84% |
35.020 |
31.924 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-8.66% |
37.164 |
33.944 |
0.097 |
-0.44% |
0.097 |
|
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-8.52% |
4.822 |
4.411 |
0.014 |
-0.17% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/1000
|
-8.30% |
2868278.689 |
2630225.564 |
10.780 |
0.00% |
10.780 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/1000
|
-8.30% |
2868139.344 |
2630112.782 |
4551.290 |
-0.01% |
4551.290 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/1000
|
-8.30% |
2868180.328 |
2630210.526 |
10.780 |
-0.00% |
10.780 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/1000
|
-8.29% |
2868176.230 |
2630274.436 |
39.943 |
-0.00% |
39.943 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/1000
|
-8.28% |
2868319.672 |
2630725.564 |
37.885 |
-0.00% |
37.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
-8.27% |
95.061 |
87.197 |
0.055 |
-0.00% |
0.055 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/10
|
-8.23% |
35.022 |
32.139 |
0.020 |
-0.08% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
-8.23% |
2883.974 |
2646.726 |
0.053 |
-0.01% |
0.053 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/999
|
-8.18% |
2883.198 |
2647.213 |
0.107 |
-0.00% |
0.107 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
-8.12% |
1461.654 |
1342.993 |
0.017 |
-0.00% |
0.017 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
-8.02% |
231.578 |
212.995 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/999
|
-8.02% |
1460.912 |
1343.677 |
0.044 |
-0.00% |
0.044 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-8.00% |
10461.888 |
9624.665 |
4.922 |
-0.00% |
4.922 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
-7.99% |
759.729 |
699.013 |
0.022 |
-0.00% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
-7.87% |
63.611 |
58.607 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/256
|
-7.72% |
759.021 |
700.433 |
0.019 |
0.00% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-7.69% |
9.291 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/32
|
-7.59% |
94392.231 |
87228.372 |
2.736 |
-0.01% |
2.736 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/32
|
-7.58% |
94377.916 |
87228.689 |
2.138 |
-0.00% |
2.138 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/32
|
-7.57% |
94377.781 |
87229.312 |
3.393 |
0.00% |
3.393 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/32
|
-7.57% |
94376.298 |
87229.590 |
1.557 |
-0.01% |
1.557 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/32
|
-7.57% |
94374.006 |
87230.405 |
1.782 |
-0.00% |
1.782 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
-7.39% |
39528.289 |
36606.997 |
0.214 |
0.00% |
0.214 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-7.37% |
575742.475 |
533292.438 |
1349.572 |
-0.25% |
1349.572 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
-7.31% |
400.969 |
371.669 |
0.004 |
0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
-7.14% |
17.868 |
16.592 |
0.158 |
0.71% |
0.158 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/256
|
-7.14% |
400.236 |
371.651 |
0.024 |
0.00% |
0.024 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
-7.14% |
20.012 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
-6.89% |
165.815 |
154.383 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/51
|
-6.86% |
166.528 |
155.102 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/5001
|
-6.67% |
120.151 |
112.133 |
0.561 |
-0.64% |
0.561 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-6.67% |
10.721 |
10.006 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/44217
|
-6.65% |
5422.597 |
5061.891 |
7.529 |
-0.63% |
7.529 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_IMP_HYDRO_2D_RAW/44217
|
-6.55% |
7699.000 |
7194.979 |
161.962 |
0.09% |
161.962 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-6.43% |
100.063 |
93.629 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-6.41% |
578842.546 |
541724.165 |
398.271 |
-0.25% |
398.271 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/44217
|
-6.23% |
5415.608 |
5078.348 |
11.500 |
-0.30% |
11.500 |
|
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-6.17% |
55.502 |
52.076 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-6.06% |
23.586 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-6.06% |
23.586 |
22.156 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-6.06% |
23.586 |
22.157 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-6.06% |
23.586 |
22.157 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/51
|
-6.06% |
94.342 |
88.627 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-6.06% |
23.586 |
22.157 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-6.05% |
23.585 |
22.157 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
-6.05% |
13.210 |
12.410 |
0.266 |
0.01% |
0.266 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_PRESSURE_CALC_RAW/171
|
-6.02% |
1.317 |
1.238 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/BenchmarkGame/n-body
Profile
|
-6.01% |
2.631 |
2.473 |
0.001 |
0.03% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/5001
|
-5.96% |
119.915 |
112.764 |
0.264 |
0.36% |
0.264 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-5.89% |
12.150 |
11.435 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-5.88% |
12.150 |
11.435 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
-5.87% |
16580.729 |
15608.207 |
0.220 |
-0.00% |
0.220 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Mid>
|
-5.86% |
16580.759 |
15608.709 |
0.114 |
-0.00% |
0.114 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-5.68% |
62.896 |
59.323 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/28
|
-5.67% |
100.776 |
95.060 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/44217
|
-5.50% |
3720.766 |
3516.156 |
21.407 |
-0.77% |
21.407 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/44217
|
-5.45% |
3735.814 |
3532.157 |
8.389 |
-0.59% |
8.389 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-5.43% |
65.754 |
62.182 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC1
|
-5.41% |
5.128 |
4.850 |
0.076 |
-1.51% |
0.076 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/128
|
-5.41% |
2920.733 |
2762.775 |
0.158 |
-0.02% |
0.158 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-5.38% |
66.469 |
62.894 |
0.002 |
-0.01% |
0.002 |
|
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-5.28% |
3.269 |
3.096 |
0.008 |
-0.03% |
0.008 |
|
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/5001
|
-5.04% |
20.377 |
19.349 |
0.241 |
-0.21% |
0.241 |
|
MultiSource/Applications/obsequi/Obsequi
Profile
|
-4.96% |
4.427 |
4.208 |
0.009 |
-0.00% |
0.009 |
|
SingleSource/Benchmarks/McGill/queens
Profile
|
-4.94% |
3.395 |
3.227 |
0.000 |
-0.02% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/64
|
-4.88% |
725.423 |
690.048 |
0.102 |
-0.04% |
0.102 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/44217
|
-4.86% |
1616.526 |
1537.927 |
3.263 |
-0.05% |
3.263 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-4.68% |
45.742 |
43.600 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-4.62% |
178.952 |
170.678 |
0.116 |
-0.00% |
0.116 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/256
|
-4.50% |
11932.847 |
11395.311 |
20.769 |
-0.05% |
20.769 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
-4.48% |
596584.116 |
569862.266 |
241.606 |
0.59% |
241.606 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/44217
|
-4.47% |
1609.491 |
1537.606 |
10.066 |
0.33% |
10.066 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, First>
|
-4.16% |
8790.942 |
8425.610 |
0.334 |
-0.00% |
0.334 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/1024/1024
|
-4.14% |
55433.000 |
53140.692 |
53.009 |
-0.63% |
53.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-4.09% |
280958.149 |
269480.539 |
1305.429 |
-0.30% |
1305.429 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-3.96% |
157.912 |
151.653 |
1.326 |
-0.01% |
1.326 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Mid>
|
-3.85% |
25375.467 |
24399.658 |
5.523 |
-0.04% |
5.523 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-3.84% |
18.582 |
17.868 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.84% |
37.164 |
35.737 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
-3.77% |
37.880 |
36.451 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/256
|
-3.60% |
410.774 |
395.973 |
0.003 |
-0.00% |
0.003 |
|
SingleSource/Benchmarks/Misc/mandel
Profile
|
-3.56% |
1.328 |
1.281 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/28
|
-3.51% |
40.740 |
39.311 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-3.45% |
20.727 |
20.012 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/10
|
-3.39% |
42.168 |
40.740 |
0.001 |
-0.00% |
0.001 |
|
SingleSource/Benchmarks/Misc/flops-8
Profile
|
-3.35% |
1.766 |
1.707 |
0.015 |
-1.76% |
0.015 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.33% |
43918.497 |
42457.451 |
1.147 |
-0.00% |
1.147 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
-3.29% |
13.202 |
12.768 |
0.006 |
0.04% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-3.23% |
22.157 |
21.442 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-3.23% |
22.157 |
21.442 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-3.22% |
22.157 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-3.22% |
22.156 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-3.22% |
22.157 |
21.443 |
0.003 |
0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-3.22% |
22.156 |
21.442 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.22% |
30249.708 |
29275.640 |
0.567 |
0.00% |
0.567 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
-3.20% |
14.475 |
14.011 |
0.006 |
-0.98% |
0.006 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/5001
|
-3.20% |
14.474 |
14.011 |
0.060 |
-0.53% |
0.060 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
-3.12% |
31226.579 |
30251.783 |
4.114 |
-0.00% |
4.114 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
-3.11% |
5862.908 |
5680.691 |
0.092 |
-0.01% |
0.092 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
-3.02% |
13805.597 |
13388.078 |
0.513 |
-0.01% |
0.513 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
-3.02% |
24160.994 |
23430.460 |
0.328 |
-0.00% |
0.328 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-2.94% |
48.600 |
47.173 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_ADI_RAW/5001
|
-2.94% |
325.337 |
315.787 |
1.235 |
-0.73% |
1.235 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.90% |
49.316 |
47.887 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
-2.84% |
12815.587 |
12452.139 |
0.138 |
0.00% |
0.138 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
-2.78% |
21080.106 |
20494.935 |
0.497 |
-0.00% |
0.497 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int32_t_
|
-2.71% |
545051.242 |
530296.128 |
4213.282 |
-1.05% |
4213.282 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-2.69% |
157.924 |
153.668 |
0.057 |
-0.03% |
0.057 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
-2.63% |
18530.223 |
18043.415 |
0.321 |
-0.00% |
0.321 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-2.60% |
279538.183 |
272273.083 |
1681.930 |
-0.35% |
1681.930 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/16
|
-2.59% |
55.032 |
53.606 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-2.53% |
155.450 |
151.521 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-2.52% |
61.477 |
59.926 |
0.004 |
-0.04% |
0.004 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/16
|
-2.50% |
43.729 |
42.634 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.48% |
16732.270 |
16316.517 |
0.296 |
-0.00% |
0.296 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/999
|
-2.44% |
761.952 |
743.342 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
-2.43% |
29.302 |
28.590 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, None>
|
-2.38% |
15377.455 |
15011.560 |
0.195 |
0.00% |
0.195 |
|
MultiSource/Benchmarks/Trimaran/enc-3des/enc-3des
Profile
|
-2.37% |
3.149 |
3.074 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/16
|
-2.27% |
31.448 |
30.733 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/16
|
-2.22% |
32.163 |
31.448 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/5001
|
-2.13% |
179.345 |
175.519 |
0.111 |
0.02% |
0.111 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-2.13% |
62.338 |
61.009 |
0.027 |
-0.16% |
0.027 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
-2.09% |
2.469 |
2.417 |
0.003 |
-0.31% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/10
|
-2.08% |
34.305 |
33.593 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-2.00% |
35.735 |
35.021 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/Misc/perlin
Profile
|
-1.90% |
7.274 |
7.136 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
-1.89% |
37.879 |
37.165 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-1.88% |
37.880 |
37.167 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int32_t_
|
-1.88% |
548367.508 |
538058.457 |
3709.485 |
-0.33% |
3709.485 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC1
|
-1.87% |
13.158 |
12.912 |
0.005 |
0.00% |
0.005 |
|
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d
Profile
|
-1.87% |
164.031 |
160.970 |
0.138 |
-0.02% |
0.138 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/CoMD/CoMD
Profile
|
-1.82% |
5.036 |
4.945 |
0.013 |
-0.26% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/28
|
-1.78% |
40.024 |
39.310 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/28
|
-1.76% |
80.760 |
79.335 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/CoyoteBench/lpbench
Profile
|
-1.68% |
7.734 |
7.604 |
0.006 |
0.01% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/16
|
-1.66% |
42.883 |
42.169 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
-1.55% |
930527.926 |
916064.304 |
939.708 |
-0.14% |
939.708 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/51
|
-1.53% |
93.631 |
92.201 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/999
|
-1.45% |
1479.539 |
1458.076 |
0.030 |
-0.00% |
0.030 |
|
MultiSource/Benchmarks/Bullet/bullet
Profile
|
-1.37% |
14.735 |
14.534 |
0.019 |
0.09% |
0.019 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/5001
|
-1.35% |
178.444 |
176.036 |
0.092 |
0.07% |
0.092 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/51
|
-1.27% |
55.746 |
55.036 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint8_t>/256
|
-1.19% |
419.551 |
414.557 |
1.345 |
0.00% |
1.345 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/28
|
-1.19% |
60.037 |
59.325 |
0.006 |
-0.00% |
0.006 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_GEN_LIN_RECUR_RAW/44217
|
-1.18% |
512.291 |
506.234 |
0.116 |
0.02% |
0.116 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_atan_novec_double_
|
-1.15% |
960.091 |
949.031 |
1.026 |
-0.11% |
1.026 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
-1.12% |
305.769 |
302.359 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-1.11% |
64.325 |
63.613 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
-1.09% |
130.076 |
128.652 |
0.002 |
-0.00% |
0.002 |
|
MultiSource/Applications/siod/siod
Profile
|
-1.06% |
5.461 |
5.403 |
0.020 |
-0.20% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_uint8_t_
|
-1.05% |
335759.170 |
332226.021 |
275.920 |
-0.13% |
275.920 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
-1.01% |
70.040 |
69.329 |
0.001 |
0.00% |
0.001 |