|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
52.16% |
32.881 |
50.030 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
51.12% |
63.377 |
95.774 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
49.99% |
18.583 |
27.874 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
49.99% |
33.832 |
50.746 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
49.99% |
17.631 |
26.445 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
49.99% |
64.331 |
96.488 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
46.64% |
10.722 |
15.724 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
44.65% |
5.435 |
7.862 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
43.74% |
11.437 |
16.439 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/171
|
32.97% |
0.375 |
0.498 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
28.03% |
1694.794 |
2169.819 |
0.044 |
0.00% |
0.044 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
27.53% |
864.174 |
1102.055 |
0.022 |
0.00% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
26.68% |
455.316 |
576.790 |
0.005 |
0.00% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
24.99% |
8.577 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
24.70% |
63.617 |
79.331 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
24.69% |
245.891 |
306.601 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, First>
|
23.08% |
11898.197 |
14644.455 |
1.773 |
0.00% |
1.773 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
22.81% |
106.499 |
130.793 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
22.04% |
158.128 |
192.972 |
0.297 |
0.00% |
0.297 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
21.87% |
45.747 |
55.749 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
21.80% |
157.264 |
191.544 |
0.014 |
0.00% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
19.99% |
67.905 |
81.478 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint8_t_
|
19.95% |
11930.824 |
14311.572 |
1.429 |
0.00% |
1.429 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint8_t_
|
19.95% |
11930.806 |
14311.011 |
1.780 |
0.00% |
1.780 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
19.93% |
7.151 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
19.92% |
7.152 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
19.90% |
11928.215 |
14301.980 |
4.856 |
0.00% |
4.856 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint8_t_
|
19.89% |
11928.735 |
14301.825 |
5.651 |
0.00% |
5.651 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint32_t_
|
19.84% |
11934.120 |
14302.243 |
4.408 |
0.00% |
4.408 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint32_t_
|
19.84% |
11933.958 |
14301.766 |
0.096 |
0.00% |
0.096 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
19.81% |
7.159 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint32_t_
|
19.78% |
11940.022 |
14301.551 |
0.355 |
0.00% |
0.355 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/5001
|
16.85% |
12.242 |
14.305 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
16.40% |
47.892 |
55.746 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/65
|
15.43% |
82.916 |
95.712 |
1.474 |
0.00% |
1.474 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
15.27% |
37.203 |
42.884 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
14.03% |
64.331 |
73.357 |
0.874 |
0.00% |
0.874 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/16
|
13.63% |
31.450 |
35.736 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int64_t_
|
13.40% |
690906.219 |
783473.684 |
509.520 |
0.00% |
509.520 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
13.19% |
37.886 |
42.882 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
12.50% |
5.718 |
6.433 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
12.04% |
255.173 |
285.893 |
4.743 |
0.00% |
4.743 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
11.81% |
125.296 |
140.088 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
11.77% |
123.420 |
137.945 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
11.55% |
253.747 |
283.046 |
0.031 |
0.00% |
0.031 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
11.38% |
144.391 |
160.817 |
0.001 |
0.00% |
0.001 |
|
MultiSource/Benchmarks/Trimaran/enc-pc1/enc-pc1
Profile
|
11.28% |
0.571 |
0.636 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
11.11% |
64.328 |
71.473 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
10.86% |
32.880 |
36.451 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
10.80% |
62.572 |
69.328 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/28
|
10.70% |
40.028 |
44.312 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
10.56% |
270.216 |
298.762 |
9.212 |
0.00% |
9.212 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
10.00% |
28.590 |
31.448 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
10.00% |
7.148 |
7.862 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
9.69% |
31.929 |
35.022 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_PRESSURE_CALC_RAW/171
|
9.58% |
1.238 |
1.357 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
9.09% |
7.862 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
9.08% |
7.863 |
8.577 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
9.00% |
87.207 |
95.059 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, First>
|
9.00% |
2292.980 |
2499.327 |
0.082 |
0.00% |
0.082 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
8.83% |
1343.096 |
1461.657 |
4.709 |
0.00% |
4.709 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/171
|
8.77% |
0.456 |
0.496 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
8.74% |
34.177 |
37.165 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
8.69% |
2652.657 |
2883.247 |
2.760 |
0.00% |
2.760 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, GreaterThanZero, First>
|
8.67% |
4497.401 |
4887.115 |
0.389 |
0.00% |
0.389 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
8.58% |
699.048 |
759.025 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
8.53% |
58.609 |
63.609 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
8.33% |
8.577 |
9.291 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
7.87% |
371.687 |
400.951 |
0.010 |
0.00% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
7.86% |
154.395 |
166.533 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC1
|
7.86% |
4.786 |
5.162 |
0.056 |
0.00% |
0.056 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int64_t_
|
7.74% |
802225.917 |
864289.246 |
754.707 |
0.00% |
754.707 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/51
|
7.69% |
55.751 |
60.038 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
7.68% |
18.585 |
20.012 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
7.63% |
93.635 |
100.776 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
7.25% |
142.001 |
152.303 |
0.056 |
0.00% |
0.056 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
7.14% |
10.006 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
7.14% |
10.007 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
7.13% |
10.007 |
10.721 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC32
|
6.76% |
153.312 |
163.671 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/44217
|
6.69% |
123.966 |
132.265 |
0.480 |
0.00% |
0.480 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC4
|
6.24% |
11.437 |
12.150 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/10
|
6.24% |
34.310 |
36.451 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC31
|
5.97% |
149.732 |
158.668 |
0.005 |
0.00% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
5.89% |
16.874 |
17.868 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
5.65% |
302.388 |
319.478 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
5.62% |
128.662 |
135.890 |
0.133 |
0.00% |
0.133 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC64
|
5.40% |
307.199 |
323.777 |
0.004 |
0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC127
|
5.37% |
615.235 |
648.272 |
0.021 |
0.00% |
0.021 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC128
|
5.37% |
618.643 |
651.861 |
0.028 |
0.00% |
0.028 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
5.20% |
153.737 |
161.729 |
0.274 |
0.00% |
0.274 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC8
|
5.05% |
41.502 |
43.597 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
4.95% |
160.099 |
168.022 |
0.711 |
0.00% |
0.711 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
4.94% |
149.848 |
157.247 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int32_t_
|
4.68% |
564049.513 |
590443.692 |
4873.838 |
0.00% |
4873.838 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC7
|
4.66% |
36.936 |
38.659 |
0.648 |
0.00% |
0.648 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C++/HPCCG/HPCCG
Profile
|
4.66% |
5.973 |
6.251 |
0.033 |
0.00% |
0.033 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
4.53% |
47.176 |
49.314 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
4.37% |
574445.733 |
599526.091 |
4290.456 |
0.00% |
4290.456 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, First>
|
4.24% |
8433.618 |
8790.897 |
2.971 |
0.00% |
2.971 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC4
|
4.08% |
23.349 |
24.301 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
4.03% |
248.022 |
258.010 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
3.99% |
36608.430 |
38067.540 |
3.375 |
0.00% |
3.375 |
|
MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt
Profile
|
3.98% |
7.831 |
8.143 |
0.119 |
0.00% |
0.119 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC15
|
3.94% |
75.639 |
78.619 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
3.92% |
36.451 |
37.880 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC16
|
3.70% |
80.640 |
83.623 |
0.002 |
0.00% |
0.002 |
|
MultiSource/Applications/siod/siod
Profile
|
3.64% |
5.462 |
5.661 |
0.060 |
0.00% |
0.060 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
3.57% |
20.014 |
20.727 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/28
|
3.56% |
60.043 |
62.179 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
3.55% |
147.019 |
152.241 |
0.868 |
0.00% |
0.868 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
3.44% |
155.459 |
160.805 |
0.006 |
0.00% |
0.006 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
3.44% |
42459.635 |
43919.182 |
0.942 |
0.00% |
0.942 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/999
|
3.33% |
21.443 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/16
|
3.33% |
42.887 |
44.314 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
3.32% |
17569.424 |
18153.391 |
0.226 |
0.00% |
0.226 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Last>
|
3.32% |
29278.514 |
30251.124 |
0.512 |
0.00% |
0.512 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/10
|
3.32% |
21.444 |
22.157 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/16
|
3.32% |
21.444 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/51
|
3.32% |
21.444 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/256
|
3.32% |
21.444 |
22.156 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/28
|
3.32% |
21.445 |
22.157 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
3.30% |
29285.457 |
30251.901 |
0.524 |
0.00% |
0.524 |
|
MultiSource/Benchmarks/PAQ8p/paq8p
Profile
|
3.29% |
131.471 |
135.792 |
0.062 |
0.00% |
0.062 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
3.23% |
22.156 |
22.871 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
3.23% |
22.157 |
22.871 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/10
|
3.22% |
22.157 |
22.871 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
3.22% |
22.158 |
22.871 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/51
|
3.22% |
22.158 |
22.872 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/256
|
3.22% |
22.159 |
22.872 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
3.22% |
30252.777 |
31226.133 |
4.905 |
0.00% |
4.905 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/999
|
3.22% |
22.158 |
22.871 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/28
|
3.22% |
22.158 |
22.871 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
3.22% |
22.159 |
22.871 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
3.22% |
22.159 |
22.872 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
3.21% |
22697.792 |
23426.998 |
0.789 |
0.00% |
0.789 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, EqZero, First>
|
3.18% |
5682.439 |
5862.988 |
0.052 |
0.00% |
0.052 |
|
MultiSource/Benchmarks/Fhourstones/fhourstones
Profile
|
3.15% |
2.315 |
2.388 |
0.010 |
0.00% |
0.010 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, Mid>
|
3.11% |
13390.252 |
13807.022 |
0.234 |
0.00% |
0.234 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
3.10% |
13389.358 |
13804.947 |
0.613 |
0.00% |
0.613 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
3.10% |
23432.052 |
24159.085 |
0.732 |
0.00% |
0.732 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
3.09% |
15610.119 |
16093.177 |
0.250 |
0.00% |
0.250 |
|
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_MULADDSUB_LAMBDA/44217
|
3.09% |
327.274 |
337.387 |
1.657 |
0.00% |
1.657 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
3.06% |
5689.577 |
5863.605 |
0.106 |
0.00% |
0.106 |
|
MicroBenchmarks/Builtins/Int128/Builtins.test:BM_DivideIntrinsic128UniformDivisor<__uint128_t>
|
2.96% |
37.366 |
38.472 |
0.136 |
0.00% |
0.136 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, Last>
|
2.93% |
19913.073 |
20495.842 |
0.247 |
0.00% |
0.247 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, Mid>
|
2.92% |
12451.620 |
12815.101 |
0.460 |
0.00% |
0.460 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
2.91% |
12452.733 |
12815.442 |
4.373 |
0.00% |
4.373 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<32, EqZero, Mid>
|
2.88% |
3120.096 |
3210.043 |
0.269 |
0.00% |
0.269 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
2.86% |
460.311 |
473.467 |
1.119 |
0.00% |
1.119 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
2.84% |
20496.443 |
21079.481 |
0.839 |
0.00% |
0.839 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, Last>
|
2.80% |
20505.375 |
21079.958 |
0.856 |
0.00% |
0.856 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, None>
|
2.77% |
21082.844 |
21665.831 |
0.335 |
0.00% |
0.335 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Last>
|
2.76% |
17558.267 |
18042.921 |
0.443 |
0.00% |
0.443 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
2.76% |
12.416 |
12.758 |
0.007 |
0.00% |
0.007 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
2.69% |
18044.671 |
18530.151 |
3.262 |
0.00% |
3.262 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC1
|
2.66% |
4.988 |
5.121 |
0.014 |
0.00% |
0.014 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
2.64% |
2.435 |
2.499 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, None>
|
2.56% |
16315.224 |
16732.443 |
0.053 |
0.00% |
0.053 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
2.55% |
16317.879 |
16733.972 |
0.207 |
0.00% |
0.207 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, LessThanZero, None>
|
2.36% |
15379.732 |
15742.752 |
0.648 |
0.00% |
0.648 |
|
MultiSource/Applications/SPASS/SPASS
Profile
|
2.35% |
25.528 |
26.127 |
0.105 |
0.00% |
0.105 |
|
MultiSource/Benchmarks/nbench/nbench
Profile
|
2.29% |
3.780 |
3.867 |
0.009 |
0.00% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/10
|
2.21% |
32.165 |
32.877 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
1.99% |
35.740 |
36.452 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
1.99% |
35.740 |
36.450 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/10
|
1.92% |
36.466 |
37.166 |
0.001 |
0.00% |
0.001 |
|
External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r
Profile
|
1.91% |
101.116 |
103.051 |
0.199 |
0.00% |
0.199 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
1.91% |
37.170 |
37.880 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int64_t_
|
1.90% |
918763.469 |
936233.645 |
1594.463 |
0.00% |
1594.463 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_asinf_autovec_float_
|
1.76% |
385.458 |
392.230 |
0.118 |
0.00% |
0.118 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
1.70% |
14.283 |
14.526 |
0.006 |
0.00% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_expf_novec_float_
|
1.69% |
455.753 |
463.433 |
0.130 |
0.00% |
0.130 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
1.60% |
44.319 |
45.028 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC64
|
1.55% |
307.209 |
311.956 |
0.174 |
0.00% |
0.174 |
|
MultiSource/Applications/obsequi/Obsequi
Profile
|
1.43% |
4.211 |
4.271 |
0.010 |
0.00% |
0.010 |
|
External/SPEC/CINT2017rate/525.x264_r/525.x264_r
Profile
|
1.36% |
122.593 |
124.256 |
0.012 |
0.00% |
0.012 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
1.33% |
213.011 |
215.844 |
0.004 |
0.00% |
0.004 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, None>
|
1.20% |
5187.961 |
5250.015 |
2.095 |
0.00% |
2.095 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/999
|
1.15% |
867.023 |
876.997 |
0.338 |
0.00% |
0.338 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC128
|
1.10% |
287.894 |
291.053 |
0.596 |
0.00% |
0.596 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, Mid>
|
1.02% |
4952.398 |
5003.146 |
1.878 |
0.00% |
1.878 |