|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
52.16% |
32.881 |
50.031 |
0.001 |
52.17% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
51.12% |
63.377 |
95.774 |
0.003 |
51.13% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
50.00% |
33.832 |
50.748 |
0.002 |
50.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
50.00% |
18.583 |
27.874 |
0.001 |
49.99% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
49.99% |
17.631 |
26.445 |
0.000 |
50.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
49.98% |
64.331 |
96.482 |
0.006 |
49.99% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
46.64% |
10.722 |
15.724 |
0.000 |
46.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
44.65% |
5.435 |
7.862 |
0.000 |
44.05% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
43.74% |
11.437 |
16.439 |
0.000 |
43.75% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/171
|
32.06% |
0.375 |
0.495 |
0.006 |
32.05% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
28.04% |
1694.794 |
2169.950 |
4.828 |
28.04% |
4.828 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
27.53% |
864.174 |
1102.042 |
0.037 |
27.53% |
0.037 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
26.68% |
455.316 |
576.774 |
0.023 |
26.68% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
24.99% |
8.577 |
10.721 |
0.000 |
25.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
24.70% |
63.617 |
79.333 |
0.001 |
24.71% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
24.69% |
245.891 |
306.613 |
0.009 |
24.70% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, First>
|
23.08% |
11898.197 |
14644.174 |
0.598 |
23.07% |
0.598 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
22.81% |
106.499 |
130.793 |
0.001 |
22.81% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
22.04% |
158.128 |
192.975 |
4.112 |
22.24% |
4.112 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
21.86% |
45.747 |
55.747 |
0.002 |
21.86% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
21.80% |
157.264 |
191.549 |
0.014 |
21.78% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
19.98% |
67.905 |
81.475 |
0.003 |
19.99% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint8_t_
|
19.97% |
11928.735 |
14311.280 |
1.535 |
19.98% |
1.535 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
19.95% |
11928.215 |
14307.840 |
2.077 |
19.95% |
2.077 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint8_t_
|
19.93% |
11930.824 |
14308.741 |
2.626 |
19.94% |
2.626 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
19.93% |
7.151 |
8.576 |
0.000 |
19.93% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
19.92% |
7.152 |
8.577 |
0.000 |
19.95% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint8_t_
|
19.87% |
11930.806 |
14301.706 |
4.688 |
19.88% |
4.688 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint32_t_
|
19.84% |
11933.958 |
14301.978 |
0.224 |
19.98% |
0.224 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint32_t_
|
19.84% |
11934.120 |
14301.835 |
5.972 |
19.85% |
5.972 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
19.81% |
7.159 |
8.577 |
0.000 |
19.91% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint32_t_
|
19.78% |
11940.022 |
14301.453 |
0.308 |
19.74% |
0.308 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/5001
|
16.85% |
12.242 |
14.305 |
0.000 |
16.96% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/65
|
16.43% |
82.916 |
96.541 |
0.627 |
16.44% |
0.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
16.40% |
47.892 |
55.747 |
0.002 |
16.41% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
15.27% |
37.203 |
42.883 |
0.001 |
15.38% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
14.28% |
64.331 |
73.520 |
0.774 |
14.29% |
0.774 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/16
|
13.63% |
31.450 |
35.735 |
0.001 |
13.63% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int64_t_
|
13.39% |
690906.219 |
783393.498 |
1312.453 |
13.48% |
1312.453 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
13.19% |
37.886 |
42.884 |
0.001 |
13.20% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
12.50% |
5.718 |
6.433 |
0.000 |
12.50% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
12.04% |
255.173 |
285.893 |
0.293 |
12.04% |
0.293 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
11.80% |
125.296 |
140.086 |
0.003 |
11.84% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
11.77% |
123.420 |
137.948 |
0.001 |
11.78% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
11.54% |
253.747 |
283.034 |
2.063 |
11.55% |
2.063 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
11.38% |
144.391 |
160.817 |
0.003 |
11.38% |
0.003 |
|
MultiSource/Benchmarks/Trimaran/enc-pc1/enc-pc1
Profile
|
11.26% |
0.571 |
0.636 |
0.000 |
11.41% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
11.11% |
64.328 |
71.473 |
0.001 |
11.11% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
10.86% |
32.880 |
36.450 |
0.001 |
10.86% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
10.80% |
62.572 |
69.329 |
0.001 |
10.58% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/28
|
10.70% |
40.028 |
44.313 |
0.002 |
10.71% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
10.57% |
270.216 |
298.778 |
8.289 |
10.58% |
8.289 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
10.16% |
42.171 |
46.457 |
0.002 |
10.17% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
9.99% |
28.590 |
31.448 |
0.001 |
10.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
9.99% |
7.148 |
7.862 |
0.000 |
10.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
9.69% |
31.929 |
35.022 |
0.001 |
9.70% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_PRESSURE_CALC_RAW/171
|
9.57% |
1.238 |
1.357 |
0.000 |
9.58% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
9.09% |
7.862 |
8.577 |
0.000 |
9.08% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
9.08% |
7.863 |
8.577 |
0.000 |
9.09% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, First>
|
9.00% |
2292.980 |
2499.450 |
0.013 |
9.01% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
9.00% |
87.207 |
95.055 |
0.003 |
9.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
8.82% |
1343.096 |
1461.573 |
0.058 |
8.83% |
0.058 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/171
|
8.77% |
0.456 |
0.496 |
0.000 |
8.77% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
8.75% |
34.177 |
37.166 |
0.001 |
8.87% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
8.70% |
2652.657 |
2883.326 |
0.029 |
8.93% |
0.029 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, GreaterThanZero, First>
|
8.68% |
4497.401 |
4887.583 |
6.088 |
8.68% |
6.088 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
8.58% |
699.048 |
759.010 |
0.022 |
8.58% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
8.53% |
58.609 |
63.609 |
0.003 |
8.54% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
8.33% |
8.577 |
9.291 |
0.000 |
8.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
7.87% |
371.687 |
400.956 |
0.009 |
7.88% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
7.86% |
154.395 |
166.533 |
0.003 |
7.87% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/51
|
7.69% |
55.751 |
60.036 |
0.002 |
7.68% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
7.68% |
18.585 |
20.013 |
0.000 |
7.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
7.62% |
93.635 |
100.771 |
0.003 |
7.63% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int64_t_
|
7.54% |
802225.917 |
862734.568 |
1266.558 |
7.78% |
1266.558 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
7.25% |
142.001 |
152.298 |
0.066 |
1.66% |
0.066 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
7.14% |
10.007 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
7.14% |
10.007 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
6.89% |
62.183 |
66.467 |
0.002 |
6.90% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC32
|
6.76% |
153.312 |
163.676 |
0.002 |
6.74% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/44217
|
6.42% |
123.966 |
131.927 |
0.551 |
6.07% |
0.551 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC4
|
6.24% |
11.437 |
12.151 |
0.000 |
6.25% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/10
|
6.24% |
34.310 |
36.451 |
0.001 |
6.24% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC31
|
5.97% |
149.732 |
158.669 |
0.003 |
5.97% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
5.89% |
16.874 |
17.868 |
0.000 |
5.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int32_t_
|
5.71% |
564049.513 |
596262.887 |
1806.670 |
6.80% |
1806.670 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
5.65% |
302.388 |
319.468 |
0.013 |
5.66% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
5.63% |
128.662 |
135.907 |
0.136 |
5.64% |
0.136 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
5.63% |
574445.733 |
606792.174 |
1226.071 |
5.42% |
1226.071 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC64
|
5.40% |
307.199 |
323.777 |
0.002 |
5.40% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC127
|
5.37% |
615.235 |
648.265 |
0.020 |
5.62% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC128
|
5.36% |
618.643 |
651.829 |
0.042 |
5.37% |
0.042 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
5.22% |
160.099 |
168.458 |
1.072 |
4.29% |
1.072 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC8
|
5.05% |
41.502 |
43.599 |
0.001 |
5.06% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
5.01% |
153.737 |
161.446 |
0.453 |
2.66% |
0.453 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
4.94% |
149.848 |
157.244 |
0.002 |
5.08% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
4.54% |
47.176 |
49.316 |
0.002 |
4.54% |
0.002 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C++/HPCCG/HPCCG
Profile
|
4.40% |
5.973 |
6.235 |
0.050 |
-0.26% |
0.050 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, First>
|
4.24% |
8433.618 |
8791.578 |
11.527 |
4.33% |
11.527 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC4
|
4.08% |
23.349 |
24.300 |
0.001 |
4.08% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
4.01% |
36608.430 |
38077.739 |
30.262 |
4.01% |
30.262 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC15
|
3.94% |
75.639 |
78.623 |
0.001 |
3.95% |
0.001 |
|
MultiSource/Applications/siod/siod
Profile
|
3.92% |
5.462 |
5.676 |
0.009 |
4.19% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
3.92% |
36.451 |
37.880 |
0.001 |
3.92% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC16
|
3.70% |
80.640 |
83.625 |
0.001 |
3.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
3.57% |
20.014 |
20.728 |
0.000 |
3.57% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/28
|
3.56% |
60.043 |
62.182 |
0.001 |
3.57% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
3.50% |
42459.635 |
43945.211 |
21.387 |
3.50% |
21.387 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
3.44% |
155.459 |
160.805 |
0.005 |
5.07% |
0.005 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
3.36% |
17569.424 |
18160.148 |
14.618 |
3.36% |
14.618 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Last>
|
3.36% |
29278.514 |
30262.696 |
24.558 |
3.36% |
24.558 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/999
|
3.33% |
21.443 |
22.156 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/16
|
3.33% |
42.887 |
44.314 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/51
|
3.33% |
21.444 |
22.157 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/16
|
3.32% |
21.444 |
22.157 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/10
|
3.32% |
21.444 |
22.157 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/256
|
3.32% |
21.444 |
22.156 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/28
|
3.32% |
21.445 |
22.157 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
3.31% |
29285.457 |
30254.127 |
28.627 |
3.33% |
28.627 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
3.29% |
151.536 |
156.529 |
0.021 |
3.29% |
0.021 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
3.28% |
30252.777 |
31243.614 |
16.866 |
3.27% |
16.866 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
3.25% |
22697.792 |
23436.289 |
1.316 |
3.26% |
1.316 |
|
MultiSource/Benchmarks/PAQ8p/paq8p
Profile
|
3.25% |
131.471 |
135.747 |
0.041 |
3.36% |
0.041 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
3.24% |
13389.358 |
13823.738 |
1.548 |
3.25% |
1.548 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
3.22% |
22.157 |
22.871 |
0.000 |
3.22% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
3.22% |
22.156 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
3.23% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/10
|
3.22% |
22.157 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/999
|
3.22% |
22.158 |
22.872 |
0.001 |
3.23% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
3.22% |
22.158 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/51
|
3.21% |
22.158 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/28
|
3.21% |
22.158 |
22.871 |
0.000 |
3.22% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
3.21% |
22.159 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/256
|
3.21% |
22.159 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
3.21% |
22.159 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, Mid>
|
3.15% |
13390.252 |
13812.332 |
7.229 |
3.15% |
7.229 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
3.11% |
23432.052 |
24161.328 |
15.780 |
3.11% |
15.780 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
3.11% |
15610.119 |
16095.075 |
10.620 |
3.11% |
10.620 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
3.07% |
248.022 |
255.643 |
1.319 |
3.12% |
1.319 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
3.06% |
5689.577 |
5863.571 |
0.136 |
3.19% |
0.136 |
|
MicroBenchmarks/Builtins/Int128/Builtins.test:BM_DivideIntrinsic128UniformDivisor<__uint128_t>
|
3.03% |
37.366 |
38.500 |
0.033 |
3.10% |
0.033 |
|
MultiSource/Benchmarks/Fhourstones/fhourstones
Profile
|
2.97% |
2.315 |
2.384 |
0.014 |
2.28% |
0.014 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, Last>
|
2.96% |
19913.073 |
20502.460 |
11.116 |
2.97% |
11.116 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, Mid>
|
2.93% |
12451.620 |
12816.349 |
7.943 |
2.94% |
7.943 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
2.92% |
12452.733 |
12816.108 |
7.811 |
2.92% |
7.811 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<32, EqZero, Mid>
|
2.90% |
3120.096 |
3210.647 |
0.904 |
2.91% |
0.904 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
2.86% |
20496.443 |
21081.642 |
13.830 |
2.86% |
13.830 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, Last>
|
2.81% |
20505.375 |
21081.255 |
18.551 |
2.81% |
18.551 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, None>
|
2.78% |
21082.844 |
21668.431 |
15.231 |
2.78% |
15.231 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Last>
|
2.77% |
17558.267 |
18044.302 |
7.564 |
2.77% |
7.564 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
2.73% |
18044.671 |
18536.665 |
11.110 |
2.73% |
11.110 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_DEL_DOT_VEC_2D_LAMBDA/2
|
2.68% |
2.345 |
2.408 |
0.019 |
2.68% |
0.019 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, None>
|
2.56% |
16315.224 |
16732.734 |
10.629 |
2.56% |
10.629 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
2.55% |
16317.879 |
16734.668 |
6.645 |
2.56% |
6.645 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
2.45% |
2.435 |
2.494 |
0.004 |
2.78% |
0.004 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, LessThanZero, None>
|
2.37% |
15379.732 |
15744.518 |
7.401 |
2.38% |
7.401 |
|
MultiSource/Benchmarks/nbench/nbench
Profile
|
2.37% |
3.780 |
3.870 |
0.003 |
1.54% |
0.003 |
|
External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r
Profile
|
2.26% |
101.116 |
103.399 |
0.120 |
1.61% |
0.120 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/10
|
2.21% |
32.165 |
32.877 |
0.001 |
2.22% |
0.001 |
|
MultiSource/Applications/SPASS/SPASS
Profile
|
2.14% |
25.528 |
26.073 |
0.065 |
2.33% |
0.065 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
2.02% |
460.311 |
469.596 |
2.660 |
2.50% |
2.660 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
1.99% |
35.740 |
36.452 |
0.001 |
2.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
1.99% |
35.740 |
36.451 |
0.002 |
2.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/10
|
1.92% |
36.466 |
37.166 |
0.001 |
1.96% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
1.91% |
37.170 |
37.880 |
0.001 |
1.91% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
1.90% |
151.534 |
154.410 |
0.964 |
1.90% |
0.964 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC15
|
1.90% |
141.211 |
143.890 |
0.181 |
1.82% |
0.181 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
1.80% |
150.949 |
153.670 |
0.003 |
0.61% |
0.003 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
1.70% |
14.283 |
14.526 |
0.011 |
2.57% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_expf_novec_float_
|
1.66% |
455.753 |
463.330 |
0.071 |
1.61% |
0.071 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_asinf_autovec_float_
|
1.65% |
385.458 |
391.824 |
0.264 |
1.66% |
0.264 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC64
|
1.64% |
307.209 |
312.246 |
0.254 |
1.64% |
0.254 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
1.61% |
918763.469 |
933550.067 |
5009.141 |
1.23% |
5009.141 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
1.60% |
44.319 |
45.029 |
0.001 |
1.61% |
0.001 |
|
MultiSource/Benchmarks/Rodinia/backprop/backprop
Profile
|
1.55% |
2.444 |
2.482 |
0.010 |
-0.35% |
0.010 |
|
MultiSource/Applications/obsequi/Obsequi
Profile
|
1.47% |
4.211 |
4.273 |
0.007 |
1.14% |
0.007 |
|
External/SPEC/CINT2017rate/525.x264_r/525.x264_r
Profile
|
1.35% |
122.593 |
124.242 |
0.038 |
1.58% |
0.038 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, None>
|
1.34% |
5187.961 |
5257.455 |
2.374 |
1.35% |
2.374 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
1.33% |
213.011 |
215.848 |
0.004 |
1.34% |
0.004 |
|
SingleSource/Benchmarks/Linpack/linpack-pc
Profile
|
1.32% |
8.230 |
8.339 |
0.027 |
1.33% |
0.027 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, Mid>
|
1.15% |
4952.398 |
5009.226 |
1.371 |
1.15% |
1.371 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, GreaterThanZero, None>
|
1.15% |
3023.055 |
3057.698 |
0.366 |
1.15% |
0.366 |