|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.09% |
27.161 |
17.630 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-34.29% |
50.031 |
32.876 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.26% |
51.462 |
33.829 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.26% |
51.462 |
33.831 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.83% |
97.206 |
64.322 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-33.52% |
28596.878 |
19012.084 |
9.461 |
0.01% |
9.461 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-33.34% |
49.317 |
32.876 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-33.34% |
95.066 |
63.373 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-33.34% |
27.875 |
18.582 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.33% |
96.488 |
64.324 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-33.33% |
27.875 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-33.33% |
26.445 |
17.630 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-33.33% |
95.057 |
63.373 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC3
|
-32.06% |
27.875 |
18.940 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-31.16% |
7.862 |
5.413 |
0.023 |
-0.11% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-30.76% |
7.862 |
5.444 |
0.023 |
-0.23% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.44% |
16.439 |
11.436 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-28.58% |
15.010 |
10.721 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-28.57% |
15.009 |
10.721 |
0.000 |
0.00% |
0.000 |
|
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-ary3
Profile
|
-26.50% |
6.115 |
4.494 |
0.070 |
-2.08% |
0.070 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-26.09% |
16.440 |
12.150 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-25.00% |
11.436 |
8.577 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-24.81% |
1.119 |
0.842 |
0.006 |
-1.22% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-23.07% |
9.292 |
7.149 |
0.003 |
-0.02% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-22.16% |
192.268 |
149.669 |
0.212 |
-0.04% |
0.212 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-21.74% |
192.986 |
151.034 |
2.100 |
-1.49% |
2.100 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int64_t_
|
-21.71% |
863142.151 |
675714.700 |
489.211 |
-0.10% |
489.211 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/999
|
-21.71% |
2169.139 |
1698.195 |
2.116 |
-0.00% |
2.116 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
-21.50% |
1100.694 |
864.085 |
1.349 |
0.00% |
1.349 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int64_t_
|
-21.18% |
1292540.590 |
1018743.440 |
675.153 |
-0.28% |
675.153 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
-20.35% |
576.068 |
458.844 |
2.354 |
-0.01% |
2.354 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-20.01% |
10.722 |
8.576 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
-19.67% |
305.193 |
245.157 |
1.346 |
-0.01% |
1.346 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/51
|
-19.57% |
131.513 |
105.780 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/51
|
-19.27% |
77.906 |
62.896 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-19.08% |
193.693 |
156.732 |
0.514 |
-0.33% |
0.514 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
-18.42% |
54.319 |
44.314 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.26% |
82.194 |
67.185 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.22% |
192.273 |
157.242 |
0.195 |
-0.00% |
0.195 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_PRESSURE_CALC_LAMBDA/5001
|
-17.27% |
48.879 |
40.437 |
1.913 |
-2.03% |
1.913 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
-16.67% |
14313.264 |
11927.289 |
0.129 |
-0.01% |
0.129 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint64_t_
|
-16.65% |
14302.875 |
11921.227 |
8.802 |
-0.01% |
8.802 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/16
|
-16.46% |
56.466 |
47.173 |
0.001 |
-0.01% |
0.001 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
-16.39% |
38.274 |
32.002 |
0.042 |
-0.10% |
0.042 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/171
|
-16.35% |
0.494 |
0.413 |
0.005 |
0.00% |
0.005 |
|
SingleSource/Benchmarks/Shootout/Shootout-sieve
Profile
|
-15.39% |
8.428 |
7.130 |
0.050 |
-0.70% |
0.050 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-15.32% |
166.284 |
140.802 |
0.577 |
0.00% |
0.577 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
-14.75% |
43.598 |
37.165 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-14.29% |
10.007 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
-13.79% |
41.454 |
35.737 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/5001
|
-13.78% |
14.304 |
12.333 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-13.47% |
74.336 |
64.324 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/171
|
-13.12% |
0.619 |
0.537 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/171
|
-12.66% |
4.214 |
3.680 |
0.002 |
-0.06% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/171
|
-12.65% |
4.214 |
3.681 |
0.009 |
0.01% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int64_t_
|
-12.14% |
781919.643 |
687033.366 |
199.428 |
0.03% |
199.428 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-11.12% |
6.433 |
5.718 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
-11.00% |
138.661 |
123.407 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-10.97% |
169.396 |
150.815 |
3.756 |
-3.84% |
3.756 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
-10.75% |
285.899 |
255.161 |
0.004 |
0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
-10.60% |
140.092 |
125.240 |
0.000 |
0.02% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
-10.35% |
283.027 |
253.726 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-10.21% |
69.328 |
62.252 |
0.189 |
-0.02% |
0.189 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-10.01% |
153.399 |
138.047 |
2.902 |
-0.02% |
2.902 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
-10.01% |
71.475 |
64.323 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/5001
|
-9.38% |
205.903 |
186.584 |
6.130 |
0.02% |
6.130 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_int32_t_
|
-9.28% |
439283.019 |
398509.714 |
1220.427 |
-0.13% |
1220.427 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
-9.23% |
46.458 |
42.168 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
-9.15% |
234.434 |
212.989 |
0.006 |
-0.00% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-9.10% |
7.862 |
7.147 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_int32_t_
|
-8.95% |
596571.429 |
543149.649 |
1809.040 |
-0.29% |
1809.040 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
-8.85% |
35.023 |
31.923 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/5001
|
-8.74% |
204.643 |
186.760 |
2.081 |
0.32% |
2.081 |
|
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-8.52% |
4.822 |
4.411 |
0.000 |
-0.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_with_cond_arith_autovec_uint8_t_
|
-8.46% |
437982.478 |
400928.981 |
2127.653 |
-0.61% |
2127.653 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-8.37% |
37.167 |
34.057 |
0.041 |
-0.11% |
0.041 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/1000
|
-8.30% |
2868139.344 |
2630109.023 |
129.370 |
-0.00% |
129.370 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/1000
|
-8.30% |
2868209.016 |
2630259.398 |
37.885 |
-0.00% |
37.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/1000
|
-8.29% |
2868094.262 |
2630278.195 |
3111.784 |
-0.00% |
3111.784 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/1000
|
-8.29% |
2868090.164 |
2630278.195 |
72.270 |
-0.00% |
72.270 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/1000
|
-8.28% |
2868233.607 |
2630729.323 |
174.028 |
-0.00% |
174.028 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
-8.27% |
95.058 |
87.197 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
-8.22% |
2883.949 |
2646.792 |
0.010 |
-0.01% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/999
|
-8.18% |
2883.037 |
2647.320 |
4.152 |
0.00% |
4.152 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/10
|
-8.16% |
35.022 |
32.164 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
-8.11% |
1461.573 |
1342.978 |
1.678 |
-0.01% |
1.678 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/999
|
-8.02% |
1460.879 |
1343.648 |
0.043 |
0.00% |
0.043 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-8.00% |
10462.491 |
9625.189 |
7.513 |
0.00% |
7.513 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
-7.99% |
759.719 |
698.989 |
0.017 |
-0.01% |
0.017 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
-7.87% |
63.610 |
58.606 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/256
|
-7.72% |
759.009 |
700.414 |
0.992 |
0.00% |
0.992 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-7.70% |
9.292 |
8.576 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointIncreasing/32
|
-7.57% |
94378.320 |
87229.811 |
2.411 |
-0.00% |
2.411 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDEqualsA/32
|
-7.57% |
94376.298 |
87230.031 |
0.249 |
-0.00% |
0.249 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersAllDisjointDecreasing/32
|
-7.57% |
94375.185 |
87231.181 |
1.579 |
-0.00% |
1.579 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDBeforeA/32
|
-7.57% |
94374.815 |
87231.488 |
1.985 |
0.00% |
1.985 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchVecWithRuntimeChecks4PointersDAfterA/32
|
-7.57% |
94372.438 |
87231.555 |
2.025 |
-0.00% |
2.025 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
-7.40% |
39530.216 |
36605.208 |
31.364 |
0.00% |
31.364 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-7.37% |
575406.146 |
533011.592 |
1844.803 |
-0.30% |
1844.803 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
-7.31% |
400.961 |
371.661 |
0.007 |
-0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
-7.24% |
17.869 |
16.576 |
0.174 |
0.62% |
0.174 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
-7.15% |
20.013 |
18.582 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/256
|
-7.15% |
400.249 |
371.646 |
0.006 |
-0.00% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
-6.90% |
165.818 |
154.383 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/51
|
-6.87% |
166.529 |
155.096 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-6.68% |
10.721 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_DIFF_PREDICT_LAMBDA/44217
|
-6.57% |
5417.836 |
5062.147 |
19.214 |
-0.63% |
19.214 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-6.49% |
579428.214 |
541828.482 |
1550.638 |
-0.23% |
1550.638 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_DIFF_PREDICT_RAW/44217
|
-6.46% |
5413.615 |
5063.912 |
11.867 |
0.02% |
11.867 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-6.43% |
100.062 |
93.632 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-6.23% |
55.529 |
52.067 |
0.012 |
-0.01% |
0.012 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-6.06% |
23.586 |
22.155 |
0.022 |
-0.00% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-6.06% |
23.586 |
22.156 |
0.018 |
-0.00% |
0.018 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-6.06% |
23.586 |
22.156 |
0.026 |
0.00% |
0.026 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-6.06% |
23.586 |
22.156 |
0.026 |
-0.00% |
0.026 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/51
|
-6.06% |
94.341 |
88.623 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-6.06% |
23.586 |
22.156 |
0.026 |
-0.00% |
0.026 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-6.06% |
23.585 |
22.157 |
0.022 |
0.00% |
0.022 |
|
SingleSource/Benchmarks/BenchmarkGame/n-body
Profile
|
-5.98% |
2.630 |
2.473 |
0.001 |
0.03% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/5001
|
-5.94% |
119.594 |
112.494 |
0.471 |
0.12% |
0.471 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-5.89% |
12.151 |
11.435 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-5.89% |
12.150 |
11.435 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
-5.87% |
16581.296 |
15607.689 |
10.863 |
-0.01% |
10.863 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Mid>
|
-5.87% |
16580.901 |
15608.344 |
11.124 |
-0.01% |
11.124 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
-5.74% |
599819.195 |
565400.000 |
2504.100 |
-0.20% |
2504.100 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-5.68% |
62.896 |
59.323 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/28
|
-5.67% |
100.776 |
95.058 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/5001
|
-5.53% |
119.870 |
113.239 |
0.089 |
0.12% |
0.089 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_INT_PREDICT_RAW/44217
|
-5.47% |
3742.097 |
3537.538 |
11.589 |
-0.17% |
11.589 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-5.44% |
65.754 |
62.180 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/128
|
-5.43% |
2920.862 |
2762.190 |
7.766 |
-0.04% |
7.766 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-5.38% |
66.470 |
62.896 |
0.675 |
-0.01% |
0.675 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_INT_PREDICT_LAMBDA/44217
|
-5.26% |
3734.713 |
3538.359 |
22.671 |
-0.42% |
22.671 |
|
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/AMGmk
Profile
|
-5.09% |
38.677 |
36.707 |
0.619 |
-0.82% |
0.619 |
|
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-5.06% |
3.265 |
3.100 |
0.007 |
0.08% |
0.007 |
|
SingleSource/Benchmarks/McGill/queens
Profile
|
-4.93% |
3.395 |
3.227 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/64
|
-4.84% |
725.131 |
690.004 |
1.022 |
-0.04% |
1.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-4.69% |
45.742 |
43.598 |
0.338 |
-0.00% |
0.338 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-4.61% |
178.953 |
170.702 |
0.134 |
0.00% |
0.134 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/44217
|
-4.41% |
1602.142 |
1531.454 |
13.857 |
0.23% |
13.857 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/256
|
-4.38% |
11924.237 |
11401.557 |
12.028 |
0.01% |
12.028 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, First>
|
-4.15% |
8791.168 |
8426.007 |
7.882 |
0.00% |
7.882 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/44217
|
-4.09% |
1606.757 |
1541.015 |
2.899 |
-0.41% |
2.899 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-3.93% |
157.926 |
151.711 |
1.247 |
0.03% |
1.247 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-3.85% |
18.583 |
17.867 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Mid>
|
-3.85% |
25375.630 |
24399.052 |
24.741 |
-0.05% |
24.741 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.85% |
37.166 |
35.737 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
-3.78% |
37.881 |
36.450 |
0.001 |
0.00% |
0.001 |
|
SingleSource/Benchmarks/Misc/mandel
Profile
|
-3.53% |
1.328 |
1.281 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/28
|
-3.51% |
40.738 |
39.310 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-3.46% |
20.728 |
20.011 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/256
|
-3.43% |
410.011 |
395.968 |
0.007 |
-0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/10
|
-3.39% |
42.170 |
40.740 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
-3.34% |
13.203 |
12.762 |
0.007 |
-0.01% |
0.007 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.33% |
43919.935 |
42456.756 |
32.057 |
0.00% |
32.057 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-3.23% |
22.157 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-3.23% |
22.157 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-3.23% |
22.156 |
21.441 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-3.22% |
22.156 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-3.22% |
22.156 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-3.22% |
22.155 |
21.442 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.22% |
30251.243 |
29276.872 |
28.378 |
0.00% |
28.378 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/1024/1024
|
-3.20% |
54959.846 |
53201.692 |
59.889 |
-0.51% |
59.889 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
-3.13% |
31227.561 |
30250.745 |
21.903 |
0.00% |
21.903 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
-3.11% |
5863.099 |
5680.669 |
4.013 |
-0.01% |
4.013 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
-3.03% |
13806.006 |
13387.822 |
9.095 |
-0.01% |
9.095 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
-2.99% |
24161.770 |
23438.779 |
10.679 |
0.03% |
10.679 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_PRESSURE_CALC_LAMBDA/171
|
-2.99% |
1.359 |
1.319 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/2048/2048
|
-2.94% |
342237.000 |
332160.500 |
3448.666 |
1.32% |
3448.666 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-2.94% |
48.601 |
47.173 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.90% |
49.315 |
47.885 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
-2.84% |
12816.819 |
12452.265 |
8.096 |
-0.00% |
8.096 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-2.84% |
280995.964 |
273011.337 |
484.786 |
0.19% |
484.786 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
-2.78% |
21080.681 |
20494.979 |
19.614 |
-0.00% |
19.614 |
|
MultiSource/Benchmarks/llubenchmark/llu
Profile
|
-2.66% |
23.625 |
22.997 |
0.241 |
-0.57% |
0.241 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
-2.63% |
18531.494 |
18043.382 |
14.300 |
-0.00% |
14.300 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/16
|
-2.60% |
55.033 |
53.603 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/999
|
-2.53% |
762.621 |
743.308 |
0.005 |
0.00% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-2.53% |
155.454 |
151.520 |
0.091 |
-0.01% |
0.091 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-2.49% |
61.484 |
59.951 |
0.008 |
-0.00% |
0.008 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.49% |
16732.466 |
16316.575 |
12.699 |
-0.00% |
12.699 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
-2.44% |
29.303 |
28.589 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, None>
|
-2.38% |
15377.919 |
15011.774 |
7.901 |
-0.00% |
7.901 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
-2.29% |
14.481 |
14.150 |
0.022 |
-0.00% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/16
|
-2.27% |
31.448 |
30.734 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/16
|
-2.25% |
43.618 |
42.636 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint8_t>/16
|
-2.22% |
32.161 |
31.447 |
0.004 |
-0.00% |
0.004 |
|
MultiSource/Benchmarks/Trimaran/enc-3des/enc-3des
Profile
|
-2.20% |
3.142 |
3.073 |
0.000 |
-0.03% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_2D_LAMBDA/5001
|
-2.19% |
178.388 |
174.475 |
1.059 |
-0.58% |
1.059 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-2.13% |
62.337 |
61.007 |
0.057 |
-0.16% |
0.057 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/10
|
-2.09% |
34.307 |
33.592 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
-2.03% |
2.468 |
2.418 |
0.003 |
-0.28% |
0.003 |
|
SingleSource/Benchmarks/CoyoteBench/lpbench
Profile
|
-2.02% |
7.749 |
7.592 |
0.010 |
-0.04% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-2.01% |
35.739 |
35.020 |
0.001 |
-0.01% |
0.001 |
|
SingleSource/Benchmarks/Polybench/stencils/seidel-2d/seidel-2d
Profile
|
-1.97% |
164.261 |
161.024 |
0.138 |
0.01% |
0.138 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC1
|
-1.92% |
13.161 |
12.909 |
0.006 |
-0.02% |
0.006 |
|
SingleSource/Benchmarks/Misc/perlin
Profile
|
-1.90% |
7.274 |
7.136 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
-1.89% |
37.882 |
37.165 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-1.89% |
37.881 |
37.164 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
-1.88% |
37.879 |
37.166 |
0.001 |
0.00% |
0.001 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/Pathfinder/PathFinder
Profile
|
-1.83% |
7.170 |
7.039 |
0.010 |
-2.46% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/28
|
-1.78% |
40.024 |
39.311 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/28
|
-1.77% |
80.765 |
79.335 |
0.002 |
0.00% |
0.002 |
|
SingleSource/Benchmarks/Misc/flops-8
Profile
|
-1.70% |
1.768 |
1.738 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/512/512
|
-1.70% |
13987.760 |
13750.039 |
87.992 |
-0.33% |
87.992 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint32_t>/127
|
-1.68% |
170.109 |
167.251 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/16
|
-1.66% |
42.882 |
42.170 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/16
|
-1.59% |
45.027 |
44.311 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIND_FIRST_MIN_RAW/44217
|
-1.56% |
129.021 |
127.007 |
0.440 |
-0.25% |
0.440 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-1.55% |
280000.404 |
275669.964 |
756.913 |
-0.00% |
756.913 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
-1.54% |
930502.674 |
916176.316 |
1637.383 |
-0.13% |
1637.383 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/51
|
-1.52% |
93.627 |
92.201 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
-1.52% |
153.861 |
151.525 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_GEN_LIN_RECUR_RAW/44217
|
-1.30% |
512.646 |
505.972 |
0.126 |
-0.01% |
0.126 |
|
MultiSource/Applications/siod/siod
Profile
|
-1.30% |
5.478 |
5.407 |
0.007 |
-0.14% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/51
|
-1.28% |
55.748 |
55.035 |
0.001 |
-0.01% |
0.001 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/CoMD/CoMD
Profile
|
-1.25% |
5.035 |
4.972 |
0.011 |
0.03% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
-1.24% |
460.999 |
455.290 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/Bullet/bullet
Profile
|
-1.19% |
14.674 |
14.498 |
0.035 |
-0.15% |
0.035 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint64_t>/28
|
-1.19% |
60.037 |
59.322 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
-1.12% |
305.760 |
302.336 |
0.009 |
-0.00% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint8_t_To_uint64_t_
|
-1.11% |
21730.067 |
21488.338 |
2.941 |
-0.05% |
2.941 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-1.10% |
64.323 |
63.613 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
-1.10% |
130.081 |
128.654 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_2D_RAW/5001
|
-1.04% |
177.636 |
175.795 |
0.112 |
-0.02% |
0.112 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
-1.02% |
70.044 |
69.326 |
0.003 |
-0.00% |
0.003 |