|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
52.17% |
32.878 |
50.031 |
0.001 |
52.16% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
51.12% |
63.376 |
95.774 |
0.003 |
51.12% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
50.00% |
17.630 |
26.445 |
0.000 |
49.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
50.00% |
33.833 |
50.748 |
0.002 |
50.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
49.99% |
18.585 |
27.874 |
0.001 |
50.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
49.98% |
64.329 |
96.482 |
0.006 |
49.98% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
46.66% |
10.721 |
15.724 |
0.000 |
46.64% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
44.60% |
5.437 |
7.862 |
0.000 |
44.65% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
43.74% |
11.436 |
16.439 |
0.000 |
43.74% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/171
|
32.07% |
0.375 |
0.495 |
0.006 |
32.06% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
28.04% |
1694.687 |
2169.950 |
4.828 |
28.04% |
4.828 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
27.53% |
864.127 |
1102.042 |
0.037 |
27.53% |
0.037 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
26.68% |
455.288 |
576.774 |
0.023 |
26.68% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
25.00% |
8.577 |
10.721 |
0.000 |
24.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
24.71% |
63.613 |
79.333 |
0.001 |
24.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
24.71% |
245.870 |
306.613 |
0.009 |
24.69% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, First>
|
23.07% |
11899.193 |
14644.174 |
0.598 |
23.08% |
0.598 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
22.81% |
106.498 |
130.793 |
0.001 |
22.81% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
22.42% |
157.627 |
192.975 |
4.112 |
22.04% |
4.112 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
21.87% |
45.744 |
55.747 |
0.002 |
21.86% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
21.79% |
157.279 |
191.549 |
0.014 |
21.80% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
19.99% |
67.901 |
81.475 |
0.003 |
19.98% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint8_t_
|
19.99% |
11927.403 |
14311.280 |
1.535 |
19.97% |
1.535 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint8_t_To_uint32_t_
|
19.96% |
11927.276 |
14307.840 |
2.077 |
19.95% |
2.077 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint16_t_To_uint8_t_
|
19.95% |
11929.396 |
14308.741 |
2.626 |
19.93% |
2.626 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
19.91% |
7.153 |
8.577 |
0.000 |
19.92% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
19.89% |
7.154 |
8.577 |
0.000 |
19.81% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
19.89% |
7.153 |
8.576 |
0.000 |
19.93% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint8_t_
|
19.88% |
11929.633 |
14301.706 |
4.688 |
19.87% |
4.688 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint32_t_
|
19.86% |
11932.733 |
14301.978 |
0.224 |
19.84% |
0.224 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW16From_uint16_t_To_uint32_t_
|
19.84% |
11934.538 |
14301.835 |
5.972 |
19.84% |
5.972 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint32_t_
|
19.79% |
11938.374 |
14301.453 |
0.308 |
19.78% |
0.308 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/5001
|
17.40% |
12.185 |
14.305 |
0.000 |
16.85% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/65
|
16.44% |
82.912 |
96.541 |
0.627 |
16.43% |
0.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
16.41% |
47.890 |
55.747 |
0.002 |
16.40% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
15.38% |
37.167 |
42.883 |
0.001 |
15.27% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
14.29% |
64.327 |
73.520 |
0.774 |
14.28% |
0.774 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int64_t_
|
13.77% |
688601.378 |
783393.498 |
1312.453 |
13.39% |
1312.453 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/16
|
13.63% |
31.449 |
35.735 |
0.001 |
13.63% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
13.21% |
37.882 |
42.884 |
0.001 |
13.19% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
12.49% |
5.718 |
6.433 |
0.000 |
12.50% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC128
|
12.04% |
255.166 |
285.893 |
0.293 |
12.04% |
0.293 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC64
|
11.81% |
125.285 |
140.086 |
0.003 |
11.80% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC63
|
11.77% |
123.417 |
137.948 |
0.001 |
11.77% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC127
|
11.55% |
253.736 |
283.034 |
2.063 |
11.54% |
2.063 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
11.38% |
144.381 |
160.817 |
0.003 |
11.38% |
0.003 |
|
MultiSource/Benchmarks/Trimaran/enc-pc1/enc-pc1
Profile
|
11.18% |
0.572 |
0.636 |
0.000 |
11.26% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
11.11% |
64.329 |
71.473 |
0.001 |
11.11% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
10.86% |
32.879 |
36.450 |
0.001 |
10.86% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/28
|
10.71% |
40.027 |
44.313 |
0.002 |
10.70% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
10.59% |
270.175 |
298.778 |
8.289 |
10.57% |
8.289 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
10.58% |
62.697 |
69.329 |
0.001 |
10.80% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/16
|
10.17% |
42.169 |
46.457 |
0.002 |
10.16% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/10
|
10.00% |
28.589 |
31.448 |
0.001 |
9.99% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
9.99% |
7.148 |
7.862 |
0.000 |
9.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
9.70% |
31.926 |
35.022 |
0.001 |
9.69% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_PRESSURE_CALC_RAW/171
|
9.58% |
1.238 |
1.357 |
0.000 |
9.57% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
9.09% |
7.862 |
8.577 |
0.000 |
9.09% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
9.08% |
7.862 |
8.577 |
0.000 |
9.08% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/51
|
9.01% |
87.198 |
95.055 |
0.003 |
9.00% |
0.003 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, First>
|
9.01% |
2292.901 |
2499.450 |
0.013 |
9.00% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/999
|
8.94% |
2646.752 |
2883.326 |
0.029 |
8.70% |
0.029 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/999
|
8.83% |
1342.967 |
1461.573 |
0.058 |
8.82% |
0.058 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/171
|
8.78% |
0.456 |
0.496 |
0.000 |
8.77% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
8.70% |
34.192 |
37.166 |
0.001 |
8.75% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, GreaterThanZero, First>
|
8.68% |
4497.054 |
4887.583 |
6.088 |
8.68% |
6.088 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/256
|
8.59% |
698.991 |
759.010 |
0.022 |
8.58% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/28
|
8.53% |
58.608 |
63.609 |
0.003 |
8.53% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
8.33% |
8.577 |
9.291 |
0.000 |
8.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
7.90% |
141.152 |
152.298 |
0.066 |
7.25% |
0.066 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/256
|
7.88% |
371.663 |
400.956 |
0.009 |
7.87% |
0.009 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/51
|
7.87% |
154.382 |
166.533 |
0.003 |
7.86% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC8
|
7.69% |
18.583 |
20.013 |
0.000 |
7.68% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/51
|
7.69% |
55.750 |
60.036 |
0.002 |
7.69% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
7.63% |
93.630 |
100.771 |
0.003 |
7.62% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int64_t_
|
7.37% |
803530.495 |
862734.568 |
1266.558 |
7.54% |
1266.558 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
7.14% |
10.007 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
7.14% |
10.006 |
10.721 |
0.000 |
7.14% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
6.89% |
62.183 |
66.467 |
0.002 |
6.89% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC32
|
6.78% |
153.284 |
163.676 |
0.002 |
6.76% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIND_FIRST_MIN_LAMBDA/44217
|
6.39% |
124.001 |
131.927 |
0.551 |
6.42% |
0.551 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC4
|
6.25% |
11.436 |
12.151 |
0.000 |
6.24% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/10
|
6.25% |
34.308 |
36.451 |
0.001 |
6.24% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC31
|
5.97% |
149.727 |
158.669 |
0.003 |
5.97% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC63
|
5.66% |
302.366 |
319.468 |
0.013 |
5.65% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
5.65% |
16.912 |
17.868 |
0.000 |
5.89% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint64_t>/51
|
5.64% |
128.651 |
135.907 |
0.136 |
5.63% |
0.136 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC127
|
5.57% |
614.035 |
648.265 |
0.020 |
5.37% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_int32_t_
|
5.46% |
575353.279 |
606792.174 |
1226.071 |
5.63% |
1226.071 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC64
|
5.39% |
307.205 |
323.777 |
0.002 |
5.40% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC128
|
5.32% |
618.914 |
651.829 |
0.042 |
5.36% |
0.042 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
5.22% |
160.103 |
168.458 |
1.072 |
5.22% |
1.072 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
5.20% |
152.853 |
160.805 |
0.005 |
3.44% |
0.005 |
|
MultiSource/Benchmarks/TSVC/Expansion-flt/Expansion-flt
Profile
|
5.19% |
9.144 |
9.619 |
0.042 |
-0.93% |
0.042 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC8
|
5.09% |
41.489 |
43.599 |
0.001 |
5.05% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
4.95% |
149.829 |
157.244 |
0.002 |
4.94% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_int32_t_
|
4.87% |
568563.255 |
596262.887 |
1806.670 |
5.71% |
1806.670 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C++/HPCCG/HPCCG
Profile
|
4.82% |
5.949 |
6.235 |
0.050 |
4.40% |
0.050 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
4.54% |
47.172 |
49.316 |
0.002 |
4.54% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, First>
|
4.25% |
8433.038 |
8791.578 |
11.527 |
4.24% |
11.527 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC4
|
4.08% |
23.348 |
24.300 |
0.001 |
4.08% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, Last>
|
4.02% |
36604.717 |
38077.739 |
30.262 |
4.01% |
30.262 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC15
|
3.95% |
75.635 |
78.623 |
0.001 |
3.94% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint64_t>/10
|
3.92% |
36.452 |
37.880 |
0.001 |
3.92% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC16
|
3.70% |
80.640 |
83.625 |
0.001 |
3.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/28
|
3.57% |
60.038 |
62.182 |
0.001 |
3.56% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
3.57% |
20.013 |
20.728 |
0.000 |
3.57% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
3.51% |
42457.060 |
43945.211 |
21.387 |
3.50% |
21.387 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, Last>
|
3.37% |
29275.336 |
30262.696 |
24.558 |
3.36% |
24.558 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
3.37% |
17568.947 |
18160.148 |
14.618 |
3.36% |
14.618 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
3.34% |
29276.002 |
30254.127 |
28.627 |
3.31% |
28.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/51
|
3.34% |
21.442 |
22.157 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint32_t>/16
|
3.33% |
42.884 |
44.314 |
0.001 |
3.33% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/16
|
3.33% |
21.442 |
22.157 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/28
|
3.33% |
21.442 |
22.157 |
0.000 |
3.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/10
|
3.33% |
21.443 |
22.157 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/999
|
3.33% |
21.442 |
22.156 |
0.000 |
3.33% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint32_t>/256
|
3.33% |
21.442 |
22.156 |
0.001 |
3.32% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, None>
|
3.28% |
30252.074 |
31243.614 |
16.866 |
3.28% |
16.866 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
3.27% |
151.571 |
156.529 |
0.021 |
3.29% |
0.021 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
3.27% |
22695.221 |
23436.289 |
1.316 |
3.25% |
1.316 |
|
MultiSource/Benchmarks/PAQ8p/paq8p
Profile
|
3.26% |
131.466 |
135.747 |
0.041 |
3.25% |
0.041 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, GreaterThanZero, Mid>
|
3.25% |
13388.257 |
13823.738 |
1.548 |
3.24% |
1.548 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/256
|
3.23% |
247.634 |
255.643 |
1.319 |
3.07% |
1.319 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/999
|
3.23% |
22.157 |
22.872 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/256
|
3.22% |
22.156 |
22.870 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/28
|
3.22% |
22.157 |
22.871 |
0.000 |
3.21% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/10
|
3.22% |
22.156 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/16
|
3.22% |
22.157 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint16_t>/51
|
3.22% |
22.157 |
22.871 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/Builtins/Int128/Builtins.test:BM_DivideIntrinsic128UniformDivisor<__uint128_t>
|
3.21% |
37.302 |
38.500 |
0.033 |
3.03% |
0.033 |
|
MultiSource/Applications/siod/siod
Profile
|
3.18% |
5.502 |
5.676 |
0.009 |
3.92% |
0.009 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, Mid>
|
3.16% |
13389.866 |
13812.332 |
7.229 |
3.15% |
7.229 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, Last>
|
3.12% |
23430.393 |
24161.328 |
15.780 |
3.11% |
15.780 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, EqZero, Mid>
|
3.12% |
15608.353 |
16095.075 |
10.620 |
3.11% |
10.620 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
3.10% |
22.183 |
22.871 |
0.000 |
3.22% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
3.10% |
22.184 |
22.871 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
3.10% |
22.184 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
3.09% |
22.184 |
22.870 |
0.001 |
3.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
3.09% |
22.185 |
22.871 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
3.09% |
22.184 |
22.870 |
0.001 |
3.22% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, First>
|
3.07% |
5689.158 |
5863.571 |
0.136 |
3.06% |
0.136 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, Last>
|
2.97% |
19911.279 |
20502.460 |
11.116 |
2.96% |
11.116 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, GreaterThanZero, Mid>
|
2.94% |
12450.409 |
12816.349 |
7.943 |
2.93% |
7.943 |
|
MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1
Profile
|
2.93% |
2.423 |
2.494 |
0.004 |
2.45% |
0.004 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, EqZero, Mid>
|
2.93% |
12451.622 |
12816.108 |
7.811 |
2.92% |
7.811 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<32, EqZero, Mid>
|
2.91% |
3119.798 |
3210.647 |
0.904 |
2.90% |
0.904 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, None>
|
2.86% |
20495.345 |
21081.642 |
13.830 |
2.86% |
13.830 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, Last>
|
2.86% |
20494.993 |
21081.255 |
18.551 |
2.81% |
18.551 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, Last>
|
2.78% |
17556.210 |
18044.302 |
7.564 |
2.77% |
7.564 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, None>
|
2.76% |
21086.615 |
21668.431 |
15.231 |
2.78% |
15.231 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, GreaterThanZero, None>
|
2.73% |
18043.821 |
18536.665 |
11.110 |
2.73% |
11.110 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/5001
|
2.67% |
14.148 |
14.526 |
0.011 |
1.70% |
0.011 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, None>
|
2.57% |
16313.942 |
16732.734 |
10.629 |
2.56% |
10.629 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
2.56% |
16316.496 |
16734.668 |
6.645 |
2.55% |
6.645 |
|
MultiSource/Benchmarks/nbench/nbench
Profile
|
2.51% |
3.775 |
3.870 |
0.003 |
2.37% |
0.003 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_DEL_DOT_VEC_2D_LAMBDA/2
|
2.50% |
2.349 |
2.408 |
0.019 |
2.68% |
0.019 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, LessThanZero, None>
|
2.38% |
15378.735 |
15744.518 |
7.401 |
2.37% |
7.401 |
|
MultiSource/Benchmarks/Fhourstones/fhourstones
Profile
|
2.25% |
2.331 |
2.384 |
0.014 |
2.97% |
0.014 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/10
|
2.22% |
32.163 |
32.877 |
0.001 |
2.21% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/16
|
2.00% |
35.737 |
36.452 |
0.001 |
1.99% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
2.00% |
35.736 |
36.451 |
0.002 |
1.99% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
1.96% |
158.343 |
161.446 |
0.453 |
5.01% |
0.453 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/10
|
1.96% |
36.452 |
37.166 |
0.001 |
1.92% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC15
|
1.92% |
141.181 |
143.890 |
0.181 |
1.90% |
0.181 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
1.92% |
37.167 |
37.880 |
0.001 |
1.91% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
1.90% |
151.528 |
154.410 |
0.964 |
1.90% |
0.964 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
1.81% |
150.936 |
153.670 |
0.003 |
1.80% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_expf_novec_float_
|
1.78% |
455.226 |
463.330 |
0.071 |
1.66% |
0.071 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_asinf_autovec_float_
|
1.65% |
385.468 |
391.824 |
0.264 |
1.65% |
0.264 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC64
|
1.65% |
307.186 |
312.246 |
0.254 |
1.64% |
0.254 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint16_t>/28
|
1.62% |
44.313 |
45.029 |
0.001 |
1.60% |
0.001 |
|
MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt
Profile
|
1.61% |
7.771 |
7.896 |
0.045 |
0.14% |
0.045 |
|
MultiSource/Benchmarks/TSVC/Recurrences-dbl/Recurrences-dbl
Profile
|
1.53% |
5.636 |
5.722 |
0.013 |
-0.34% |
0.013 |
|
External/SPEC/CINT2017rate/525.x264_r/525.x264_r
Profile
|
1.50% |
122.401 |
124.242 |
0.038 |
1.35% |
0.038 |
|
MultiSource/Applications/obsequi/Obsequi
Profile
|
1.35% |
4.216 |
4.273 |
0.007 |
1.47% |
0.007 |
|
External/SPEC/CINT2017rate/500.perlbench_r/500.perlbench_r
Profile
|
1.34% |
102.029 |
103.399 |
0.120 |
2.26% |
0.120 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
1.34% |
213.002 |
215.848 |
0.004 |
1.33% |
0.004 |
|
SingleSource/Benchmarks/Linpack/linpack-pc
Profile
|
1.32% |
8.230 |
8.339 |
0.027 |
1.32% |
0.027 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, None>
|
1.31% |
5189.621 |
5257.455 |
2.374 |
1.34% |
2.374 |
|
External/SPEC/CFP2017rate/511.povray_r/511.povray_r
Profile
|
1.25% |
16.967 |
17.180 |
0.033 |
0.69% |
0.033 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, GreaterThanZero, None>
|
1.24% |
3020.179 |
3057.698 |
0.366 |
1.15% |
0.366 |
|
MultiSource/Applications/SPASS/SPASS
Profile
|
1.21% |
25.762 |
26.073 |
0.065 |
2.14% |
0.065 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, Mid>
|
1.15% |
4952.512 |
5009.226 |
1.371 |
1.15% |
1.371 |