|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.09% |
27.159 |
17.630 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-34.29% |
50.030 |
32.877 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.26% |
51.460 |
33.831 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.26% |
51.460 |
33.832 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.82% |
97.202 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-33.51% |
28596.846 |
19013.665 |
2.598 |
0.02% |
2.598 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-33.34% |
95.059 |
63.370 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-33.33% |
27.875 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-33.33% |
49.316 |
32.877 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-33.33% |
95.059 |
63.374 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-33.33% |
27.874 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-33.33% |
26.444 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.33% |
96.486 |
64.328 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC3
|
-32.05% |
27.874 |
18.940 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-31.06% |
7.862 |
5.420 |
0.009 |
-0.67% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-31.04% |
7.862 |
5.422 |
0.007 |
0.02% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.43% |
16.438 |
11.436 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-26.09% |
16.438 |
12.150 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-25.00% |
11.435 |
8.576 |
0.000 |
-0.01% |
0.000 |
|
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-23.79% |
1.121 |
0.854 |
0.001 |
-0.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-23.07% |
9.291 |
7.147 |
0.001 |
-0.04% |
0.001 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-ary3
Profile
|
-22.56% |
5.986 |
4.636 |
0.030 |
0.23% |
0.030 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-22.22% |
192.260 |
149.538 |
0.047 |
-0.13% |
0.047 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int64_t_
|
-21.86% |
864212.871 |
675266.152 |
759.549 |
-0.16% |
759.549 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-21.81% |
192.998 |
150.896 |
1.937 |
-1.58% |
1.937 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/999
|
-21.74% |
2169.143 |
1697.514 |
1.227 |
-0.04% |
1.227 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
-21.30% |
1100.676 |
866.225 |
1.221 |
-0.09% |
1.221 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int64_t_
|
-20.92% |
1290463.100 |
1020523.324 |
636.956 |
-0.10% |
636.956 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
-20.47% |
576.071 |
458.136 |
1.140 |
-0.16% |
1.140 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-20.00% |
10.721 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
-19.68% |
305.186 |
245.136 |
2.000 |
-0.01% |
2.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/51
|
-19.56% |
131.506 |
105.779 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/51
|
-19.27% |
77.904 |
62.894 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
-18.89% |
186.036 |
150.900 |
4.857 |
-0.03% |
4.857 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-18.82% |
193.677 |
157.236 |
0.167 |
-0.01% |
0.167 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
-18.42% |
54.318 |
44.312 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.26% |
82.192 |
67.184 |
0.014 |
-0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.21% |
192.264 |
157.247 |
0.010 |
0.00% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint64_t_
|
-16.65% |
14302.344 |
11921.163 |
0.495 |
-0.01% |
0.495 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
-16.61% |
14302.160 |
11926.632 |
0.187 |
-0.01% |
0.187 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/16
|
-16.46% |
56.463 |
47.170 |
0.002 |
-0.01% |
0.002 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
-16.43% |
38.297 |
32.003 |
0.039 |
-0.10% |
0.039 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/171
|
-16.35% |
0.494 |
0.413 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/Shootout/Shootout-sieve
Profile
|
-15.36% |
8.409 |
7.118 |
0.049 |
-0.88% |
0.049 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
-14.76% |
43.599 |
37.165 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-14.28% |
10.006 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
-13.80% |
41.454 |
35.734 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/5001
|
-13.78% |
14.304 |
12.333 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-13.47% |
74.336 |
64.326 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/171
|
-13.07% |
0.618 |
0.537 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/171
|
-12.74% |
4.217 |
3.680 |
0.010 |
-0.02% |
0.010 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/171
|
-12.66% |
4.213 |
3.680 |
0.002 |
-0.07% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int64_t_
|
-12.05% |
780932.886 |
686868.369 |
249.248 |
0.01% |
249.248 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-11.11% |
6.433 |
5.718 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-11.00% |
170.108 |
151.390 |
3.866 |
-3.47% |
3.866 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
-10.99% |
138.649 |
123.410 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
-10.75% |
285.879 |
255.161 |
0.010 |
0.00% |
0.010 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/5001
|
-10.66% |
208.663 |
186.418 |
4.254 |
0.14% |
4.254 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
-10.61% |
140.080 |
125.221 |
0.025 |
0.00% |
0.025 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
-10.35% |
283.032 |
253.730 |
0.002 |
0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-10.20% |
69.328 |
62.254 |
0.175 |
-0.02% |
0.175 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
-10.00% |
71.471 |
64.327 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-9.55% |
37.165 |
33.614 |
0.260 |
-1.40% |
0.260 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int32_t_
|
-9.30% |
439082.132 |
398256.454 |
712.362 |
-0.20% |
712.362 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
-9.23% |
46.456 |
42.167 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_uint8_t_
|
-8.92% |
437579.375 |
398566.306 |
762.853 |
0.14% |
762.853 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
-8.87% |
233.723 |
212.985 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
-8.84% |
35.020 |
31.924 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_INIT3_LAMBDA/5001
|
-8.67% |
21.587 |
19.716 |
0.298 |
-0.05% |
0.298 |
|
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-8.51% |
4.822 |
4.411 |
0.011 |
-0.16% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/1000
|
-8.30% |
2868221.311 |
2630180.451 |
50.250 |
-0.01% |
50.250 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/1000
|
-8.30% |
2868209.016 |
2630172.932 |
20.128 |
-0.01% |
20.128 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/1000
|
-8.30% |
2868094.262 |
2630101.504 |
56.266 |
-0.00% |
56.266 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/1000
|
-8.30% |
2868192.623 |
2630233.962 |
5690.004 |
-0.00% |
5690.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/1000
|
-8.28% |
2868315.574 |
2630763.158 |
46.077 |
0.00% |
46.077 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
-8.27% |
95.057 |
87.196 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
-8.23% |
2884.040 |
2646.666 |
2.378 |
-0.01% |
2.378 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/999
|
-8.18% |
2883.109 |
2647.337 |
0.048 |
0.00% |
0.048 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/10
|
-8.16% |
35.020 |
32.161 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_int32_t_
|
-8.16% |
588697.890 |
540643.239 |
4857.306 |
-0.75% |
4857.306 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
-8.11% |
1461.588 |
1342.984 |
5.394 |
-0.01% |
5.394 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/999
|
-8.02% |
1460.856 |
1343.682 |
0.020 |
-0.00% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
-8.00% |
759.740 |
698.970 |
0.016 |
-0.01% |
0.016 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
-7.99% |
17.867 |
16.439 |
0.241 |
-0.08% |
0.241 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-7.99% |
10461.999 |
9625.995 |
4.342 |
0.01% |
4.342 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-7.94% |
152.946 |
140.804 |
1.211 |
0.01% |
1.211 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-7.90% |
578181.360 |
532533.026 |
1849.347 |
-0.39% |
1849.347 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
-7.87% |
63.610 |
58.606 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC1
|
-7.77% |
5.100 |
4.704 |
0.013 |
0.18% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/256
|
-7.72% |
759.005 |
700.409 |
0.024 |
-0.00% |
0.024 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-7.69% |
9.292 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/32
|
-7.59% |
94396.089 |
87228.689 |
0.986 |
-0.01% |
0.986 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/32
|
-7.58% |
94374.815 |
87222.679 |
4.241 |
-0.00% |
4.241 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/32
|
-7.58% |
94377.697 |
87225.421 |
3.450 |
-0.01% |
3.450 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/32
|
-7.57% |
94375.624 |
87228.785 |
2.885 |
0.00% |
2.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/32
|
-7.57% |
94372.893 |
87229.159 |
1.720 |
-0.01% |
1.720 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
-7.39% |
39527.419 |
36604.455 |
0.725 |
0.00% |
0.725 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
-7.31% |
400.963 |
371.659 |
0.006 |
-0.00% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/256
|
-7.14% |
400.244 |
371.650 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
-7.14% |
20.012 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-6.96% |
579107.203 |
538818.040 |
2326.463 |
-0.15% |
2326.463 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
-6.90% |
165.809 |
154.372 |
0.006 |
-0.01% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/51
|
-6.87% |
166.527 |
155.088 |
0.006 |
-0.01% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-6.67% |
10.721 |
10.006 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/5001
|
-6.56% |
120.874 |
112.950 |
0.190 |
0.08% |
0.190 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-6.43% |
100.057 |
93.628 |
0.001 |
0.00% |
0.001 |
|
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-6.27% |
55.540 |
52.060 |
0.012 |
-0.03% |
0.012 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/44217
|
-6.16% |
5409.523 |
5076.478 |
24.772 |
0.27% |
24.772 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
-6.07% |
13.212 |
12.411 |
0.033 |
0.01% |
0.033 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-6.07% |
23.586 |
22.155 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-6.06% |
23.586 |
22.156 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/51
|
-6.06% |
94.343 |
88.625 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-6.06% |
23.586 |
22.156 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-6.06% |
23.586 |
22.156 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-6.06% |
23.585 |
22.156 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-6.06% |
23.585 |
22.156 |
0.000 |
0.00% |
0.000 |
|
SingleSource/Benchmarks/BenchmarkGame/n-body
Profile
|
-5.93% |
2.631 |
2.475 |
0.000 |
0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-5.88% |
12.150 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-5.88% |
12.150 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/44217
|
-5.88% |
5409.046 |
5091.152 |
8.915 |
-0.06% |
8.915 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/44217
|
-5.87% |
3720.713 |
3502.164 |
32.443 |
-1.17% |
32.443 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Mid>
|
-5.87% |
16581.292 |
15607.773 |
0.590 |
-0.01% |
0.590 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
-5.86% |
16580.641 |
15608.263 |
0.048 |
-0.00% |
0.048 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-5.69% |
62.897 |
59.319 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/28
|
-5.68% |
100.777 |
95.055 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/44217
|
-5.50% |
3729.554 |
3524.523 |
15.512 |
-0.81% |
15.512 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC1
|
-5.44% |
4.998 |
4.726 |
0.033 |
0.36% |
0.033 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-5.43% |
65.755 |
62.181 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/128
|
-5.40% |
2920.754 |
2763.087 |
0.551 |
-0.01% |
0.551 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-5.37% |
66.467 |
62.895 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/5001
|
-5.22% |
119.615 |
113.376 |
0.110 |
-0.04% |
0.110 |
|
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-5.13% |
3.267 |
3.100 |
0.007 |
0.08% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
-5.04% |
588087.748 |
558420.128 |
5182.052 |
-0.37% |
5182.052 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/44217
|
-5.01% |
1599.433 |
1519.229 |
10.836 |
-0.57% |
10.836 |
|
SingleSource/Benchmarks/McGill/queens
Profile
|
-4.95% |
3.395 |
3.227 |
0.000 |
-0.03% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/64
|
-4.81% |
725.054 |
690.154 |
0.130 |
-0.02% |
0.130 |
|
MultiSource/Applications/obsequi/Obsequi
Profile
|
-4.79% |
4.425 |
4.213 |
0.005 |
0.06% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-4.69% |
45.742 |
43.597 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/1024/1024
|
-4.68% |
55583.077 |
52980.308 |
132.243 |
-0.93% |
132.243 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-4.62% |
178.959 |
170.690 |
0.109 |
0.00% |
0.109 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/256
|
-4.58% |
11941.712 |
11395.279 |
14.674 |
-0.05% |
14.674 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/256
|
-4.16% |
413.128 |
395.940 |
0.009 |
-0.01% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, First>
|
-4.16% |
8791.184 |
8425.593 |
0.247 |
-0.00% |
0.247 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-4.02% |
157.945 |
151.590 |
1.246 |
0.04% |
1.246 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/44217
|
-3.92% |
1607.572 |
1544.587 |
4.743 |
-0.18% |
4.743 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.85% |
37.166 |
35.735 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-3.85% |
18.583 |
17.869 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Mid>
|
-3.84% |
25374.551 |
24399.665 |
0.307 |
-0.04% |
0.307 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-3.80% |
280529.577 |
269875.724 |
373.968 |
-0.15% |
373.968 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
-3.77% |
37.879 |
36.450 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/Misc/mandel
Profile
|
-3.56% |
1.328 |
1.281 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/28
|
-3.51% |
40.739 |
39.309 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC1
|
-3.48% |
4.973 |
4.800 |
0.054 |
-1.37% |
0.054 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-3.45% |
20.726 |
20.012 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
-3.42% |
13.204 |
12.753 |
0.011 |
-0.08% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/10
|
-3.39% |
42.169 |
40.738 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC1
|
-3.38% |
5.057 |
4.886 |
0.027 |
-0.79% |
0.027 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.33% |
43918.308 |
42454.540 |
1.250 |
-0.00% |
1.250 |
|
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk
Profile
|
-3.23% |
38.883 |
37.625 |
0.442 |
-0.84% |
0.442 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-3.23% |
22.157 |
21.441 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-3.23% |
22.156 |
21.441 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-3.23% |
22.156 |
21.441 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-3.23% |
22.157 |
21.442 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-3.23% |
22.157 |
21.442 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-3.23% |
22.156 |
21.441 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.22% |
30249.892 |
29274.750 |
4.433 |
-0.00% |
4.433 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
-3.12% |
31225.598 |
30251.156 |
0.681 |
0.00% |
0.681 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
-3.11% |
5863.207 |
5680.712 |
4.093 |
-0.01% |
4.093 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC1
|
-3.09% |
5.037 |
4.881 |
0.046 |
-0.01% |
0.046 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/5001
|
-3.08% |
14.471 |
14.026 |
0.172 |
-0.42% |
0.172 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
-3.03% |
24161.081 |
23429.700 |
0.203 |
-0.01% |
0.203 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
-3.02% |
13805.688 |
13388.089 |
0.299 |
-0.01% |
0.299 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
-3.00% |
14.472 |
14.038 |
0.075 |
-0.79% |
0.075 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_PRESSURE_CALC_LAMBDA/171
|
-3.00% |
1.359 |
1.319 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-2.95% |
48.602 |
47.170 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.89% |
49.313 |
47.886 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/999
|
-2.86% |
765.161 |
743.301 |
0.013 |
-0.00% |
0.013 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
-2.84% |
12815.462 |
12451.162 |
0.467 |
-0.00% |
0.467 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
-2.78% |
21080.134 |
20494.379 |
2.682 |
-0.01% |
2.682 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/512/512
|
-2.76% |
14065.500 |
13677.510 |
147.403 |
-0.86% |
147.403 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
-2.63% |
18530.628 |
18042.922 |
0.335 |
-0.00% |
0.335 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/16
|
-2.60% |
55.034 |
53.603 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-2.53% |
61.481 |
59.925 |
0.010 |
-0.05% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-2.53% |
155.455 |
151.523 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/16
|
-2.50% |
43.729 |
42.636 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.48% |
16732.154 |
16316.372 |
0.244 |
0.00% |
0.244 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
-2.44% |
29.303 |
28.589 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-2.41% |
157.486 |
153.692 |
0.017 |
-0.01% |
0.017 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, None>
|
-2.38% |
15377.032 |
15010.768 |
0.443 |
-0.00% |
0.443 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-2.38% |
279762.787 |
273111.155 |
438.425 |
-0.04% |
438.425 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/16
|
-2.27% |
31.447 |
30.733 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/16
|
-2.22% |
32.162 |
31.448 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/BitBench/drop3/drop3
Profile
|
-2.18% |
0.520 |
0.508 |
0.002 |
-0.63% |
0.002 |
|
MultiSource/Benchmarks/Trimaran/enc-3des/enc-3des
Profile
|
-2.17% |
3.142 |
3.074 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/10
|
-2.09% |
34.308 |
33.590 |
0.001 |
-0.01% |
0.001 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
-2.07% |
2.470 |
2.419 |
0.003 |
-0.24% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-2.00% |
35.736 |
35.022 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-1.99% |
62.334 |
61.090 |
0.034 |
-0.02% |
0.034 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC1
|
-1.98% |
13.169 |
12.908 |
0.003 |
-0.03% |
0.003 |
|
MultiSource/Benchmarks/ASC_Sequoia/IRSmk/IRSmk
Profile
|
-1.97% |
16.267 |
15.946 |
0.122 |
1.02% |
0.122 |
|
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d
Profile
|
-1.95% |
164.246 |
161.039 |
0.137 |
0.02% |
0.137 |
|
SingleSource/Benchmarks/CoyoteBench/lpbench
Profile
|
-1.92% |
7.762 |
7.613 |
0.001 |
0.12% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-1.88% |
37.880 |
37.166 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
-1.88% |
37.879 |
37.166 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
-1.88% |
37.880 |
37.167 |
0.000 |
0.00% |
0.000 |
|
SingleSource/Benchmarks/Misc/perlin
Profile
|
-1.88% |
7.273 |
7.136 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/28
|
-1.78% |
40.024 |
39.309 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/28
|
-1.77% |
80.764 |
79.335 |
0.000 |
0.00% |
0.000 |
|
SingleSource/Benchmarks/Misc/flops-8
Profile
|
-1.73% |
1.768 |
1.737 |
0.000 |
0.00% |
0.000 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/CoMD/CoMD
Profile
|
-1.70% |
5.051 |
4.965 |
0.001 |
-0.11% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/16
|
-1.67% |
42.883 |
42.168 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/16
|
-1.59% |
45.027 |
44.313 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/51
|
-1.53% |
93.627 |
92.199 |
0.001 |
0.00% |
0.001 |
|
MultiSource/Benchmarks/Bullet/bullet
Profile
|
-1.44% |
14.713 |
14.501 |
0.031 |
-0.13% |
0.031 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
-1.39% |
931225.333 |
918313.725 |
131.101 |
-0.05% |
131.101 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/44217
|
-1.34% |
129.256 |
127.530 |
0.202 |
0.02% |
0.202 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/999
|
-1.30% |
1477.285 |
1458.037 |
0.025 |
-0.00% |
0.025 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/5001
|
-1.30% |
177.976 |
175.661 |
0.251 |
-0.06% |
0.251 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/51
|
-1.28% |
55.747 |
55.033 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_GEN_LIN_RECUR_RAW/44217
|
-1.22% |
512.433 |
506.200 |
0.510 |
0.01% |
0.510 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/28
|
-1.19% |
60.037 |
59.321 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/5001
|
-1.19% |
177.794 |
175.682 |
0.497 |
0.06% |
0.497 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint8_t_To_uint64_t_
|
-1.19% |
21722.088 |
21464.559 |
6.971 |
-0.16% |
6.971 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int32_t_
|
-1.18% |
548634.871 |
542151.351 |
1547.730 |
0.01% |
1547.730 |
|
MultiSource/Benchmarks/VersaBench/8b10b/8b10b
Profile
|
-1.14% |
7.423 |
7.338 |
0.011 |
-0.08% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_uint8_t_
|
-1.14% |
336833.173 |
332990.458 |
155.940 |
0.07% |
155.940 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-1.11% |
64.323 |
63.609 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
-1.10% |
130.079 |
128.650 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
-1.09% |
305.679 |
302.343 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
-1.05% |
153.122 |
151.520 |
0.005 |
-0.00% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
-1.02% |
70.044 |
69.327 |
0.003 |
-0.00% |
0.003 |