|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
52.17% |
32.878 |
50.031 |
0.001 |
52.16% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
51.12% |
63.375 |
95.774 |
0.003 |
51.12% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
50.00% |
33.832 |
50.748 |
0.002 |
50.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
50.00% |
18.583 |
27.874 |
0.001 |
50.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
50.00% |
17.631 |
26.445 |
0.000 |
49.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
49.98% |
64.329 |
96.482 |
0.006 |
49.98% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
46.66% |
10.721 |
15.724 |
0.000 |
46.64% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
44.50% |
5.441 |
7.862 |
0.000 |
44.65% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
43.75% |
11.436 |
16.439 |
0.000 |
43.74% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/171
|
32.06% |
0.375 |
0.495 |
0.006 |
32.06% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
28.04% |
1694.680 |
2169.950 |
4.828 |
28.04% |
4.828 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
27.53% |
864.152 |
1102.042 |
0.037 |
27.53% |
0.037 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
26.68% |
455.302 |
576.774 |
0.023 |
26.68% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
24.99% |
8.577 |
10.721 |
0.000 |
24.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
24.71% |
63.614 |
79.333 |
0.001 |
24.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
24.70% |
245.878 |
306.613 |
0.009 |
24.69% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, First>
|
23.05% |
11900.564 |
14644.174 |
0.598 |
23.08% |
0.598 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
22.81% |
106.500 |
130.793 |
0.001 |
22.81% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
22.51% |
157.514 |
192.975 |
4.112 |
22.04% |
4.112 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
21.87% |
45.743 |
55.747 |
0.002 |
21.86% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
20.18% |
159.386 |
191.549 |
0.014 |
21.80% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
19.99% |
67.904 |
81.475 |
0.003 |
19.98% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint8_t_
|
19.98% |
11927.904 |
14311.280 |
1.535 |
19.97% |
1.535 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint32_t_
|
19.98% |
11920.545 |
14301.978 |
0.224 |
19.84% |
0.224 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
19.96% |
11927.651 |
14307.840 |
2.077 |
19.95% |
2.077 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint8_t_
|
19.94% |
11929.718 |
14308.741 |
2.626 |
19.93% |
2.626 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
19.93% |
7.151 |
8.577 |
0.000 |
19.81% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
19.89% |
7.154 |
8.576 |
0.000 |
19.93% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint8_t_
|
19.88% |
11930.121 |
14301.706 |
4.688 |
19.87% |
4.688 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
19.86% |
7.155 |
8.577 |
0.000 |
19.92% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint32_t_
|
19.83% |
11935.088 |
14301.835 |
5.972 |
19.84% |
5.972 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint32_t_
|
19.76% |
11942.082 |
14301.453 |
0.308 |
19.78% |
0.308 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/5001
|
17.41% |
12.184 |
14.305 |
0.000 |
16.85% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/65
|
16.44% |
82.911 |
96.541 |
0.627 |
16.43% |
0.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
16.41% |
47.890 |
55.747 |
0.002 |
16.40% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
15.38% |
37.167 |
42.883 |
0.001 |
15.27% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
14.29% |
64.329 |
73.520 |
0.774 |
14.28% |
0.774 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int64_t_
|
13.29% |
691487.636 |
783393.498 |
1312.453 |
13.39% |
1312.453 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
13.20% |
37.883 |
42.884 |
0.001 |
13.19% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
12.50% |
5.718 |
6.433 |
0.000 |
12.50% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
12.04% |
255.162 |
285.893 |
0.293 |
12.04% |
0.293 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
11.82% |
125.282 |
140.086 |
0.003 |
11.80% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/16
|
11.80% |
31.965 |
35.735 |
0.001 |
13.63% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
11.78% |
123.415 |
137.948 |
0.001 |
11.77% |
0.001 |
|
MultiSource/Benchmarks/Trimaran/enc-pc1/enc-pc1
Profile
|
11.65% |
0.569 |
0.636 |
0.000 |
11.26% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
11.55% |
253.731 |
283.034 |
2.063 |
11.54% |
2.063 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
11.38% |
144.382 |
160.817 |
0.003 |
11.38% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
11.11% |
64.328 |
71.473 |
0.001 |
11.11% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
10.86% |
32.879 |
36.450 |
0.001 |
10.86% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
10.64% |
62.659 |
69.329 |
0.001 |
10.80% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
10.59% |
270.170 |
298.778 |
8.289 |
10.57% |
8.289 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
10.17% |
42.170 |
46.457 |
0.002 |
10.16% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
10.00% |
7.147 |
7.862 |
0.000 |
9.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
9.70% |
31.926 |
35.022 |
0.001 |
9.69% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_PRESSURE_CALC_RAW/171
|
9.58% |
1.238 |
1.357 |
0.000 |
9.57% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/28
|
9.34% |
40.526 |
44.313 |
0.002 |
10.70% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
9.09% |
7.862 |
8.577 |
0.000 |
9.09% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
9.08% |
7.862 |
8.577 |
0.000 |
9.08% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
9.01% |
87.199 |
95.055 |
0.003 |
9.00% |
0.003 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/171
|
8.78% |
0.456 |
0.496 |
0.000 |
8.77% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
8.72% |
34.186 |
37.166 |
0.001 |
8.75% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
8.70% |
2652.544 |
2883.326 |
0.029 |
8.70% |
0.029 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, GreaterThanZero, First>
|
8.66% |
4498.215 |
4887.583 |
6.088 |
8.68% |
6.088 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, First>
|
8.65% |
2300.551 |
2499.450 |
0.013 |
9.00% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
8.58% |
699.015 |
759.010 |
0.022 |
8.58% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
8.53% |
58.610 |
63.609 |
0.003 |
8.53% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
8.33% |
8.577 |
9.291 |
0.000 |
8.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
8.31% |
1349.404 |
1461.573 |
0.058 |
8.82% |
0.058 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
8.16% |
140.802 |
152.298 |
0.066 |
7.25% |
0.066 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
7.88% |
371.682 |
400.956 |
0.009 |
7.87% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
7.87% |
154.385 |
166.533 |
0.003 |
7.86% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
7.69% |
18.583 |
20.013 |
0.000 |
7.68% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
7.62% |
93.633 |
100.771 |
0.003 |
7.62% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int64_t_
|
7.41% |
803220.436 |
862734.568 |
1266.558 |
7.54% |
1266.558 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
6.89% |
62.183 |
66.467 |
0.002 |
6.89% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/51
|
6.69% |
56.269 |
60.036 |
0.002 |
7.69% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC32
|
6.33% |
153.936 |
163.676 |
0.002 |
6.76% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/44217
|
5.99% |
124.475 |
131.927 |
0.551 |
6.42% |
0.551 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC31
|
5.97% |
149.731 |
158.669 |
0.003 |
5.97% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int32_t_
|
5.74% |
563881.356 |
596262.887 |
1806.670 |
5.71% |
1806.670 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
5.66% |
302.356 |
319.468 |
0.013 |
5.65% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
5.64% |
128.656 |
135.907 |
0.136 |
5.63% |
0.136 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
5.62% |
16.917 |
17.868 |
0.000 |
5.89% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC64
|
5.40% |
307.187 |
323.777 |
0.002 |
5.40% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC127
|
5.37% |
615.236 |
648.265 |
0.020 |
5.37% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC128
|
5.37% |
618.638 |
651.829 |
0.042 |
5.36% |
0.042 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
5.26% |
29.875 |
31.448 |
0.001 |
9.99% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
5.08% |
160.313 |
168.458 |
1.072 |
5.22% |
1.072 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
4.93% |
149.850 |
157.244 |
0.002 |
4.94% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC8
|
4.87% |
41.575 |
43.599 |
0.001 |
5.05% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/10
|
4.70% |
34.814 |
36.451 |
0.001 |
6.24% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
4.54% |
47.173 |
49.316 |
0.002 |
4.54% |
0.002 |
|
MultiSource/Applications/siod/siod
Profile
|
4.54% |
5.430 |
5.676 |
0.009 |
3.92% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
4.40% |
581206.151 |
606792.174 |
1226.071 |
5.63% |
1226.071 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, First>
|
4.25% |
8433.231 |
8791.578 |
11.527 |
4.24% |
11.527 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC4
|
4.08% |
23.349 |
24.300 |
0.001 |
4.08% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC15
|
3.95% |
75.638 |
78.623 |
0.001 |
3.94% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
3.93% |
36638.940 |
38077.739 |
30.262 |
4.01% |
30.262 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
3.92% |
36.452 |
37.880 |
0.001 |
3.92% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC16
|
3.70% |
80.639 |
83.625 |
0.001 |
3.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
3.68% |
155.100 |
160.805 |
0.005 |
3.44% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
3.59% |
246.789 |
255.643 |
1.319 |
3.07% |
1.319 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
3.57% |
20.013 |
20.728 |
0.000 |
3.57% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
3.50% |
42460.302 |
43945.211 |
21.387 |
3.50% |
21.387 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/16
|
3.33% |
21.443 |
22.157 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/51
|
3.33% |
21.444 |
22.157 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/28
|
3.33% |
21.443 |
22.157 |
0.000 |
3.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/999
|
3.33% |
21.443 |
22.156 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/10
|
3.33% |
21.443 |
22.157 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/256
|
3.32% |
21.443 |
22.156 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
3.31% |
156.273 |
161.446 |
0.453 |
5.01% |
0.453 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
3.30% |
151.526 |
156.529 |
0.021 |
3.29% |
0.021 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
3.27% |
30253.349 |
31243.614 |
16.866 |
3.28% |
16.866 |
|
MultiSource/Benchmarks/PAQ8p/paq8p
Profile
|
3.27% |
131.448 |
135.747 |
0.041 |
3.25% |
0.041 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
3.25% |
17588.392 |
18160.148 |
14.618 |
3.36% |
14.618 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
3.25% |
13389.017 |
13823.738 |
1.548 |
3.24% |
1.548 |
|
MicroBenchmarks/Builtins/Int128/Builtins.test:BM_DivideIntrinsic128UniformDivisor<__uint128_t>
|
3.24% |
37.291 |
38.500 |
0.033 |
3.03% |
0.033 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
3.23% |
29308.727 |
30254.127 |
28.627 |
3.31% |
28.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
3.22% |
22.157 |
22.871 |
0.000 |
3.22% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/999
|
3.22% |
22.158 |
22.872 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/51
|
3.22% |
22.157 |
22.871 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/10
|
3.22% |
22.157 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/256
|
3.22% |
22.157 |
22.870 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/28
|
3.22% |
22.157 |
22.871 |
0.000 |
3.21% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
3.22% |
22.157 |
22.870 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
3.22% |
22.157 |
22.871 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
3.22% |
22.157 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
3.22% |
22.157 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
3.17% |
5683.582 |
5863.571 |
0.136 |
3.06% |
0.136 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
3.16% |
22719.270 |
23436.289 |
1.316 |
3.25% |
1.316 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Last>
|
3.14% |
29340.224 |
30262.696 |
24.558 |
3.36% |
24.558 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
3.08% |
23438.875 |
24161.328 |
15.780 |
3.11% |
15.780 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
2.98% |
15629.456 |
16095.075 |
10.620 |
3.11% |
10.620 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, Mid>
|
2.95% |
13416.898 |
13812.332 |
7.229 |
3.15% |
7.229 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<32, EqZero, Mid>
|
2.91% |
3119.857 |
3210.647 |
0.904 |
2.90% |
0.904 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, Last>
|
2.90% |
19924.536 |
20502.460 |
11.116 |
2.96% |
11.116 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
2.86% |
20496.384 |
21081.642 |
13.830 |
2.86% |
13.830 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, Mid>
|
2.81% |
12466.585 |
12816.349 |
7.943 |
2.93% |
7.943 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, None>
|
2.79% |
21080.889 |
21668.431 |
15.231 |
2.78% |
15.231 |
|
MultiSource/Applications/SPASS/SPASS
Profile
|
2.74% |
25.378 |
26.073 |
0.065 |
2.14% |
0.065 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, Last>
|
2.73% |
20520.789 |
21081.255 |
18.551 |
2.81% |
18.551 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
2.73% |
18044.106 |
18536.665 |
11.110 |
2.73% |
11.110 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
2.73% |
12475.958 |
12816.108 |
7.811 |
2.92% |
7.811 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Last>
|
2.68% |
17573.148 |
18044.302 |
7.564 |
2.77% |
7.564 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/28
|
2.64% |
60.583 |
62.182 |
0.001 |
3.56% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, None>
|
2.56% |
16314.912 |
16732.734 |
10.629 |
2.56% |
10.629 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
2.53% |
16321.679 |
16734.668 |
6.645 |
2.55% |
6.645 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
2.51% |
2.433 |
2.494 |
0.004 |
2.45% |
0.004 |
|
MultiSource/Benchmarks/Fhourstones/fhourstones
Profile
|
2.47% |
2.326 |
2.384 |
0.014 |
2.97% |
0.014 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, LessThanZero, None>
|
2.38% |
15378.751 |
15744.518 |
7.401 |
2.37% |
7.401 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
2.32% |
14.197 |
14.526 |
0.011 |
1.70% |
0.011 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/10
|
2.22% |
32.164 |
32.877 |
0.001 |
2.21% |
0.001 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/Pathfinder/PathFinder
Profile
|
2.20% |
7.062 |
7.218 |
0.018 |
0.01% |
0.018 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_ICCG_RAW/44217
|
2.12% |
290.281 |
296.436 |
1.506 |
0.54% |
1.506 |
|
External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r
Profile
|
2.05% |
101.318 |
103.399 |
0.120 |
2.26% |
0.120 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
2.00% |
35.738 |
36.452 |
0.001 |
1.99% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
1.99% |
35.738 |
36.451 |
0.002 |
1.99% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/10
|
1.96% |
36.452 |
37.166 |
0.001 |
1.92% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/16
|
1.94% |
43.470 |
44.314 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
1.92% |
37.167 |
37.880 |
0.001 |
1.91% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
1.90% |
151.530 |
154.410 |
0.964 |
1.90% |
0.964 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
1.83% |
150.909 |
153.670 |
0.003 |
1.80% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC15
|
1.79% |
141.355 |
143.890 |
0.181 |
1.90% |
0.181 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_expf_novec_float_
|
1.77% |
455.262 |
463.330 |
0.071 |
1.66% |
0.071 |
|
MultiSource/Benchmarks/mafft/pairlocalalign
Profile
|
1.74% |
47.083 |
47.903 |
0.009 |
0.74% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC64
|
1.65% |
307.181 |
312.246 |
0.254 |
1.64% |
0.254 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_asinf_autovec_float_
|
1.64% |
385.487 |
391.824 |
0.264 |
1.65% |
0.264 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
1.61% |
44.315 |
45.029 |
0.001 |
1.60% |
0.001 |
|
External/SPEC/CINT2017rate/525.x264_r/525.x264_r
Profile
|
1.52% |
122.386 |
124.242 |
0.038 |
1.35% |
0.038 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/256
|
1.51% |
462.618 |
469.596 |
2.660 |
2.02% |
2.660 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_2D_RAW/5001
|
1.39% |
2222.796 |
2253.697 |
9.085 |
0.69% |
9.085 |
|
SingleSource/Benchmarks/Linpack/linpack-pc
Profile
|
1.36% |
8.226 |
8.339 |
0.027 |
1.32% |
0.027 |
|
MultiSource/Benchmarks/nbench/nbench
Profile
|
1.36% |
3.818 |
3.870 |
0.003 |
2.37% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
1.34% |
212.998 |
215.848 |
0.004 |
1.33% |
0.004 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, None>
|
1.33% |
5188.480 |
5257.455 |
2.374 |
1.34% |
2.374 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, GreaterThanZero, None>
|
1.20% |
3021.344 |
3057.698 |
0.366 |
1.15% |
0.366 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, Mid>
|
1.11% |
4954.280 |
5009.226 |
1.371 |
1.15% |
1.371 |
|
MultiSource/Applications/JM/lencod/lencod
Profile
|
1.08% |
14.464 |
14.621 |
0.019 |
0.77% |
0.019 |
|
External/SPEC/CINT2017rate/531.deepsjeng_r/531.deepsjeng_r
Profile
|
1.06% |
137.122 |
138.574 |
0.098 |
0.83% |
0.098 |