|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint8_t>/65
|
-63.25% |
106.976 |
39.311 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint8_t>/127
|
-63.05% |
199.895 |
73.858 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
-43.75% |
224.906 |
126.505 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
-42.49% |
120.555 |
69.327 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, LessThanZero, None>
|
-33.29% |
8789.089 |
5863.513 |
0.043 |
0.00% |
0.043 |
|
MicroBenchmarks/SLPVectorization/SLPVectorizationBenchmarks.test:benchmark_xor_runtime_checks_pass<16, int>
|
-21.77% |
19.487 |
15.245 |
0.002 |
-0.02% |
0.002 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIRST_DIFF_RAW/171
|
-21.65% |
0.373 |
0.292 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIRST_DIFF_LAMBDA/171
|
-21.65% |
0.373 |
0.292 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/10
|
-21.08% |
13.580 |
10.717 |
0.002 |
-0.04% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_novec_uint8_t_
|
-18.48% |
266476.607 |
217241.155 |
3.441 |
-0.00% |
3.441 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_only_autovec_uint8_t_
|
-18.45% |
266398.575 |
217235.878 |
5.836 |
-0.00% |
5.836 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/10
|
-18.39% |
20.727 |
16.916 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint8_t>/65
|
-18.36% |
79.675 |
65.043 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint32_t_To_uint8_t_
|
-16.98% |
21452.470 |
17809.409 |
5.103 |
0.00% |
5.103 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint32_t_To_uint8_t_
|
-16.98% |
21452.133 |
17810.182 |
5.179 |
-0.00% |
5.179 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint32_t_To_uint8_t_
|
-16.97% |
21452.484 |
17811.996 |
3.885 |
0.02% |
3.885 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/16
|
-16.03% |
17.869 |
15.005 |
0.002 |
0.02% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/10
|
-14.97% |
35.023 |
29.780 |
0.001 |
-0.00% |
0.001 |
|
MultiSource/Benchmarks/VersaBench/8b10b/8b10b
Profile
|
-14.44% |
11.418 |
9.769 |
0.024 |
0.01% |
0.024 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC7
|
-14.34% |
74.581 |
63.883 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC7
|
-14.34% |
74.572 |
63.881 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC7
|
-14.30% |
74.543 |
63.883 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/28
|
-14.30% |
5.004 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/51
|
-14.29% |
5.003 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/256
|
-14.29% |
5.003 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/10
|
-14.29% |
5.003 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/999
|
-14.29% |
5.003 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/16
|
-14.29% |
5.003 |
4.288 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/10
|
-13.34% |
21.443 |
18.583 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/16
|
-13.00% |
29.304 |
25.493 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/16
|
-12.79% |
52.180 |
45.505 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/16
|
-11.91% |
30.019 |
26.444 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/SLPVectorization/SLPVectorizationBenchmarks.test:benchmark_add_xor_no_runtime_checks_needed<4, int>
|
-11.11% |
6.433 |
5.718 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, EqZero, First>
|
-11.06% |
2934.092 |
2609.455 |
17.627 |
-0.00% |
17.627 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/28
|
-11.03% |
86.491 |
76.952 |
0.164 |
0.00% |
0.164 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/28
|
-10.89% |
26.446 |
23.565 |
0.004 |
-0.04% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/28
|
-10.61% |
47.174 |
42.169 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/51
|
-9.86% |
152.243 |
137.229 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/51
|
-9.70% |
78.624 |
70.996 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-9.09% |
7.862 |
7.147 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC4
|
-8.34% |
8.577 |
7.862 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, EqZero, First>
|
-8.32% |
10041.817 |
9206.302 |
0.064 |
0.00% |
0.064 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<6, LessThanZero, First>
|
-8.31% |
11704.657 |
10731.829 |
0.103 |
0.01% |
0.103 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/999
|
-8.25% |
2867.515 |
2630.818 |
0.067 |
-0.00% |
0.067 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint8_t>/256
|
-8.21% |
744.063 |
682.963 |
0.012 |
0.01% |
0.012 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/28
|
-8.20% |
46.457 |
42.646 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/999
|
-8.16% |
1445.209 |
1327.230 |
0.013 |
-0.00% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint8_t>/256
|
-7.80% |
384.527 |
354.517 |
0.012 |
-0.01% |
0.012 |
|
SingleSource/Benchmarks/Linpack/linpack-pc
Profile
|
-7.71% |
10.102 |
9.323 |
0.002 |
0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/51
|
-7.35% |
42.169 |
39.072 |
0.000 |
-0.00% |
0.000 |
|
MultiSource/Benchmarks/Ptrdist/yacr2/yacr2
Profile
|
-7.30% |
1.313 |
1.217 |
0.000 |
-0.02% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIRST_DIFF_RAW/5001
|
-6.99% |
10.731 |
9.981 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIRST_DIFF_LAMBDA/5001
|
-6.98% |
10.730 |
9.981 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_autovec_int32_t_
|
-6.67% |
561647.153 |
524177.745 |
8695.408 |
0.05% |
8695.408 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-6.67% |
3.574 |
3.335 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
-6.67% |
10.721 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-6.66% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC7
|
-6.66% |
10.720 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-6.26% |
11.436 |
10.721 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-6.25% |
11.435 |
10.721 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_uint8_t_
|
-5.72% |
418651.107 |
394696.167 |
1474.465 |
-0.22% |
1474.465 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC64
|
-5.71% |
148.567 |
140.084 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_novec_int32_t_
|
-5.70% |
561686.699 |
529697.398 |
6133.366 |
-0.11% |
6133.366 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_uint8_t_
|
-5.69% |
418628.144 |
394826.185 |
1505.361 |
-0.21% |
1505.361 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC15
|
-5.56% |
136.533 |
128.935 |
0.008 |
-0.01% |
0.008 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC15
|
-5.55% |
136.524 |
128.945 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC15
|
-5.35% |
136.246 |
128.952 |
0.010 |
-0.00% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/10
|
-5.31% |
31.450 |
29.781 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
-5.01% |
14.296 |
13.580 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_single_load<uint16_t>/51
|
-4.72% |
79.336 |
75.588 |
0.017 |
-0.01% |
0.017 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, LessThanZero, First>
|
-4.35% |
22448.289 |
21471.413 |
0.007 |
-0.00% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-4.17% |
5.718 |
5.480 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-4.17% |
5.718 |
5.480 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-4.17% |
5.718 |
5.479 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-4.17% |
5.718 |
5.480 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-4.17% |
5.718 |
5.480 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-4.17% |
5.718 |
5.480 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC8
|
-3.98% |
79.062 |
75.917 |
0.081 |
0.03% |
0.081 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, First>
|
-3.85% |
38067.863 |
36601.516 |
1.286 |
0.00% |
1.286 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
-3.85% |
18.585 |
17.869 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, LessThanZero, Mid>
|
-3.84% |
25373.681 |
24398.557 |
0.136 |
-0.00% |
0.136 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC8
|
-3.84% |
78.995 |
75.965 |
0.027 |
0.03% |
0.027 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C++/HPCCG/HPCCG
Profile
|
-3.74% |
6.529 |
6.285 |
0.059 |
0.14% |
0.059 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_2D_RAW/171
|
-3.71% |
72.524 |
69.831 |
0.032 |
3.44% |
0.032 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/10
|
-3.58% |
20.013 |
19.298 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC4
|
-3.43% |
35.208 |
33.999 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC4
|
-3.43% |
35.207 |
33.999 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC4
|
-3.42% |
35.206 |
34.000 |
0.001 |
0.00% |
0.001 |
|
MultiSource/Benchmarks/mafft/pairlocalalign
Profile
|
-3.41% |
48.630 |
46.970 |
0.015 |
0.19% |
0.015 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, GreaterThanZero, None>
|
-3.34% |
43922.951 |
42454.876 |
4.302 |
-0.01% |
4.302 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1BigLoopWithReductionTC2
|
-3.33% |
14.294 |
13.818 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC2
|
-3.33% |
14.294 |
13.818 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
-3.22% |
18152.659 |
17567.850 |
0.471 |
0.00% |
0.471 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-3.22% |
30251.470 |
29276.955 |
0.069 |
0.00% |
0.069 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
-3.22% |
22695.575 |
21965.107 |
4.623 |
-0.00% |
4.623 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, LessThanZero, None>
|
-3.13% |
31228.532 |
30250.864 |
0.357 |
0.00% |
0.357 |
|
External/SPEC/CFP2017rate/511.povray_r/511.povray_r
Profile
|
-3.12% |
14.556 |
14.101 |
0.020 |
-0.20% |
0.020 |
|
MultiSource/Benchmarks/MallocBench/espresso/espresso
Profile
|
-3.04% |
0.856 |
0.830 |
0.001 |
-0.06% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, LessThanZero, Mid>
|
-2.94% |
6229.534 |
6046.517 |
0.083 |
0.00% |
0.083 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, EqZero, Mid>
|
-2.92% |
6228.741 |
6046.673 |
0.553 |
-0.00% |
0.553 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, Mid>
|
-2.92% |
6228.581 |
6046.612 |
0.136 |
-0.00% |
0.136 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<32, GreaterThanZero, Mid>
|
-2.89% |
3117.092 |
3026.857 |
0.015 |
-0.00% |
0.015 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_2D_RAW/44217
|
-2.83% |
29443.083 |
28609.458 |
102.958 |
-1.02% |
102.958 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, LessThanZero, None>
|
-2.78% |
21080.824 |
20495.169 |
0.337 |
0.00% |
0.337 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, None>
|
-2.78% |
21080.908 |
20495.506 |
0.307 |
-0.00% |
0.307 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<7, LessThanZero, Last>
|
-2.57% |
16314.055 |
15895.479 |
1.915 |
0.00% |
1.915 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC16
|
-2.52% |
143.401 |
139.784 |
0.077 |
0.00% |
0.077 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC16
|
-2.52% |
143.388 |
139.778 |
0.074 |
0.01% |
0.074 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/16
|
-2.50% |
28.590 |
27.875 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC16
|
-2.49% |
143.348 |
139.778 |
0.079 |
-0.00% |
0.079 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, First>
|
-2.47% |
2336.374 |
2278.566 |
0.003 |
0.00% |
0.003 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, EqZero, First>
|
-2.41% |
1157.128 |
1129.266 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<8, LessThanZero, None>
|
-2.38% |
15377.264 |
15011.581 |
0.053 |
-0.00% |
0.053 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_2D_RAW/5001
|
-2.37% |
2221.206 |
2168.500 |
10.559 |
-1.51% |
10.559 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, LessThanZero, Mid>
|
-2.37% |
8202.592 |
8008.524 |
0.143 |
-0.00% |
0.143 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-2.24% |
158.655 |
155.095 |
0.008 |
-0.00% |
0.008 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
-2.24% |
158.646 |
155.095 |
0.013 |
-0.01% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-2.24% |
158.652 |
155.101 |
0.025 |
0.00% |
0.025 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-2.23% |
158.652 |
155.110 |
0.013 |
-0.01% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC128
|
-2.23% |
158.647 |
155.109 |
0.022 |
-0.00% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-2.23% |
158.652 |
155.116 |
0.013 |
0.00% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-2.22% |
158.640 |
155.111 |
0.010 |
-0.01% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/10
|
-2.18% |
32.880 |
32.162 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/ImageProcessing/Dither/Dither.test:BENCHMARK_FLOYD_DITHER/128
|
-2.11% |
291.887 |
285.715 |
1.059 |
-0.28% |
1.059 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/16
|
-2.05% |
46.457 |
45.505 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/ImageProcessing/Dither/Dither.test:BENCHMARK_FLOYD_DITHER/512
|
-1.98% |
4884.924 |
4788.027 |
2.964 |
-0.28% |
2.964 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/44217
|
-1.93% |
139.313 |
136.620 |
1.039 |
-2.55% |
1.039 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_VOL3D_CALC_LAMBDA/0
|
-1.91% |
902.295 |
885.061 |
2.636 |
-0.84% |
2.636 |
|
MicroBenchmarks/ImageProcessing/Dither/Dither.test:BENCHMARK_FLOYD_DITHER/256
|
-1.90% |
1191.569 |
1168.927 |
1.667 |
-0.03% |
1.667 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<64, GreaterThanZero, Last>
|
-1.82% |
2477.986 |
2432.946 |
0.041 |
-0.00% |
0.041 |
|
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_VOL3D_CALC_RAW/0
|
-1.81% |
903.853 |
887.467 |
1.620 |
-0.44% |
1.620 |
|
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_2D_LAMBDA/5001
|
-1.81% |
2184.647 |
2145.070 |
0.789 |
0.90% |
0.789 |
|
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_IMP_HYDRO_2D_RAW/44217
|
-1.77% |
6420.138 |
6306.518 |
29.052 |
-3.93% |
29.052 |
|
MultiSource/Applications/hexxagon/hexxagon
Profile
|
-1.75% |
6.241 |
6.132 |
0.001 |
-0.03% |
0.001 |
|
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_INIT3_LAMBDA/44217
|
-1.73% |
232.813 |
228.791 |
0.275 |
-2.66% |
0.275 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_VOL3D_CALC_LAMBDA/2
|
-1.63% |
6.882 |
6.770 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_uint8_t_
|
-1.59% |
708505.071 |
697231.537 |
1506.822 |
-0.26% |
1506.822 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint32_t>/28
|
-1.57% |
45.745 |
45.029 |
0.000 |
-0.00% |
0.000 |
|
SingleSource/Benchmarks/Misc/ReedSolomon
Profile
|
-1.56% |
9.492 |
9.344 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, None>
|
-1.55% |
5255.366 |
5174.051 |
1.188 |
0.01% |
1.188 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC2
|
-1.47% |
19.337 |
19.053 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_uint8_t_
|
-1.46% |
709532.653 |
699166.834 |
1322.326 |
0.14% |
1322.326 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC2
|
-1.45% |
19.340 |
19.059 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint32_t>/16
|
-1.43% |
50.032 |
49.316 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC2
|
-1.41% |
19.341 |
19.068 |
0.003 |
-0.02% |
0.003 |
|
SingleSource/Benchmarks/Adobe-C++/stepanov_vector
Profile
|
-1.38% |
5.797 |
5.717 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, GreaterThanZero, Mid>
|
-1.32% |
2559.835 |
2525.929 |
2.007 |
-0.11% |
2.007 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, GreaterThanZero, None>
|
-1.31% |
3045.714 |
3005.739 |
0.816 |
-0.05% |
0.816 |
|
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, EqZero, None>
|
-1.28% |
3047.028 |
3008.049 |
0.200 |
0.01% |
0.200 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC31
|
-1.26% |
276.096 |
272.628 |
0.018 |
-0.01% |
0.018 |
|
MultiSource/Benchmarks/DOE-ProxyApps-C/Pathfinder/PathFinder
Profile
|
-1.26% |
6.372 |
6.292 |
0.019 |
-0.42% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC31
|
-1.24% |
276.089 |
272.666 |
0.025 |
-0.00% |
0.025 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint64_t>/28
|
-1.23% |
77.911 |
76.953 |
0.001 |
0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC31
|
-1.23% |
276.078 |
272.692 |
0.008 |
0.01% |
0.008 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC3
|
-1.20% |
27.489 |
27.160 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC3
|
-1.20% |
27.489 |
27.160 |
0.002 |
0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopTC3
|
-1.19% |
27.488 |
27.160 |
0.002 |
0.00% |
0.002 |
|
External/SPEC/CINT2017rate/505.mcf_r/505.mcf_r
Profile
|
-1.19% |
150.782 |
148.990 |
0.320 |
-0.22% |
0.320 |
|
MultiSource/Benchmarks/Ptrdist/anagram/anagram
Profile
|
-1.11% |
1.869 |
1.848 |
0.001 |
0.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/256
|
-1.09% |
197.998 |
195.836 |
0.013 |
0.00% |
0.013 |
|
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_ENERGY_CALC_LAMBDA/171
|
-1.04% |
5.364 |
5.308 |
0.009 |
0.09% |
0.009 |