|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
-46.16% |
18.583 |
10.006 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
-37.51% |
17.154 |
10.720 |
0.000 |
-6.25% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-37.50% |
17.154 |
10.720 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC2
|
-36.91% |
8.577 |
5.411 |
0.063 |
0.67% |
0.063 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC15
|
-36.84% |
27.159 |
17.153 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC16
|
-36.67% |
28.589 |
18.106 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC16
|
-36.67% |
28.589 |
18.106 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
-36.36% |
15.724 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC31
|
-35.20% |
50.029 |
32.419 |
0.004 |
-0.02% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
-35.19% |
50.029 |
32.426 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
-35.14% |
26.445 |
17.152 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-35.14% |
26.445 |
17.152 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC15
|
-35.14% |
26.445 |
17.152 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC15
|
-35.14% |
26.445 |
17.153 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC15
|
-35.14% |
26.444 |
17.153 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC15
|
-35.14% |
26.445 |
17.153 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-35.05% |
27.874 |
18.105 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC16
|
-35.05% |
27.874 |
18.106 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC16
|
-35.04% |
27.874 |
18.106 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC16
|
-35.04% |
27.874 |
18.106 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
-35.04% |
27.873 |
18.105 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC8
|
-34.79% |
16.439 |
10.720 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC8
|
-34.79% |
16.439 |
10.720 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
-34.79% |
16.439 |
10.720 |
0.000 |
-6.26% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-34.33% |
95.773 |
62.892 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC63
|
-34.33% |
95.774 |
62.893 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC63
|
-34.33% |
95.774 |
62.893 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC63
|
-34.33% |
95.773 |
62.893 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC63
|
-34.33% |
95.772 |
62.894 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC63
|
-34.33% |
95.771 |
62.894 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
-34.33% |
95.772 |
62.894 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC32
|
-34.28% |
50.746 |
33.352 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC32
|
-34.28% |
50.746 |
33.352 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
-34.28% |
50.747 |
33.353 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC32
|
-34.27% |
50.745 |
33.352 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC32
|
-34.27% |
50.746 |
33.353 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC32
|
-34.27% |
50.746 |
33.353 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-34.27% |
50.746 |
33.354 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC31
|
-34.26% |
49.317 |
32.419 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC31
|
-34.26% |
49.318 |
32.420 |
0.003 |
-0.02% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-34.26% |
49.316 |
32.419 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC31
|
-34.26% |
49.318 |
32.421 |
0.004 |
-0.01% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC31
|
-34.26% |
49.316 |
32.421 |
0.007 |
-0.02% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
-33.83% |
96.490 |
63.846 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
-33.83% |
96.491 |
63.846 |
0.002 |
-0.01% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC64
|
-33.83% |
96.490 |
63.847 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-33.83% |
96.489 |
63.848 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC64
|
-33.83% |
96.489 |
63.848 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
-33.83% |
96.488 |
63.847 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC64
|
-33.83% |
96.487 |
63.848 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC7
|
-33.34% |
15.009 |
10.006 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-33.34% |
15.009 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
-33.34% |
15.009 |
10.006 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC2
|
-32.09% |
7.862 |
5.339 |
0.065 |
-1.37% |
0.065 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC2
|
-31.53% |
7.862 |
5.383 |
0.032 |
0.44% |
0.032 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC4
|
-31.25% |
11.436 |
7.862 |
0.000 |
-8.34% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC4
|
-31.25% |
11.436 |
7.862 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC2
|
-31.02% |
7.862 |
5.423 |
0.035 |
-0.09% |
0.035 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
-30.76% |
7.862 |
5.444 |
0.044 |
-0.02% |
0.044 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC8
|
-30.44% |
16.439 |
11.436 |
0.000 |
6.65% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC8
|
-30.43% |
16.439 |
11.436 |
0.000 |
6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC2
|
-30.39% |
7.862 |
5.473 |
0.034 |
-0.91% |
0.034 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
-30.10% |
7.862 |
5.496 |
0.005 |
-0.66% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC7
|
-28.58% |
15.009 |
10.720 |
0.000 |
7.12% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC7
|
-28.58% |
15.009 |
10.720 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC3
|
-28.57% |
10.006 |
7.147 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
-28.57% |
10.006 |
7.147 |
0.000 |
-9.09% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
-26.90% |
40.737 |
29.781 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
-26.67% |
10.721 |
7.862 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
-26.67% |
10.721 |
7.862 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC4
|
-26.67% |
10.721 |
7.862 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
-26.47% |
24.300 |
17.868 |
0.000 |
-3.86% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
-26.28% |
37.165 |
27.398 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
-25.64% |
27.874 |
20.727 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
-25.09% |
66.469 |
49.794 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
-24.70% |
60.749 |
45.744 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
-24.00% |
17.868 |
13.580 |
0.000 |
-5.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
-23.86% |
115.782 |
88.151 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC3
|
-23.08% |
9.292 |
7.147 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
-23.08% |
9.292 |
7.147 |
0.000 |
-9.10% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-23.08% |
9.291 |
7.147 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC3
|
-23.08% |
9.291 |
7.147 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint16_t_To_uint64_t_
|
-21.92% |
28598.300 |
22330.143 |
5.485 |
-0.07% |
5.485 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint16_t_To_uint64_t_
|
-21.91% |
28598.586 |
22332.228 |
4.997 |
-0.02% |
4.997 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint16_t_To_uint64_t_
|
-21.91% |
28598.439 |
22332.535 |
0.976 |
-0.03% |
0.976 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
-21.62% |
2154.858 |
1688.935 |
0.024 |
0.63% |
0.024 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int32_t_
|
-21.26% |
1133306.321 |
892397.698 |
3101.085 |
4.10% |
3101.085 |
|
MultiSource/Benchmarks/Prolangs-C/gnugo/gnugo
Profile
|
-21.12% |
0.090 |
0.071 |
0.000 |
-20.80% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_int32_t_
|
-20.99% |
1134133.550 |
896102.828 |
1238.445 |
4.47% |
1238.445 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
-20.25% |
1086.343 |
866.310 |
1.890 |
-0.16% |
1.890 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
-20.01% |
10.721 |
8.576 |
0.000 |
9.08% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
-20.00% |
10.721 |
8.576 |
0.000 |
9.08% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
-19.84% |
561.748 |
450.284 |
0.006 |
2.43% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
-19.64% |
192.982 |
155.088 |
0.022 |
-2.24% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
-19.63% |
192.980 |
155.090 |
0.018 |
-2.24% |
0.018 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
-19.63% |
192.974 |
155.086 |
0.020 |
-2.25% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
-19.63% |
192.980 |
155.093 |
0.005 |
-2.25% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
-19.63% |
192.979 |
155.099 |
0.013 |
-2.24% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC128
|
-19.63% |
192.972 |
155.098 |
0.023 |
-2.24% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
-19.62% |
192.974 |
155.109 |
0.003 |
-2.23% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC127
|
-18.29% |
191.549 |
156.515 |
0.004 |
-0.00% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
-18.29% |
191.545 |
156.516 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC127
|
-18.29% |
191.545 |
156.517 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
-18.29% |
191.549 |
156.520 |
0.003 |
-0.01% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
-18.29% |
191.543 |
156.517 |
0.003 |
-0.00% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
-18.28% |
191.544 |
156.521 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-18.28% |
191.539 |
156.519 |
0.056 |
-0.01% |
0.056 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/28
|
-18.23% |
60.896 |
49.794 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint32_t_To_uint8_t_
|
-16.97% |
21449.868 |
17808.843 |
7.474 |
-16.98% |
7.474 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint32_t_To_uint8_t_
|
-16.97% |
21449.579 |
17808.716 |
7.428 |
-16.99% |
7.428 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint32_t_To_uint8_t_
|
-16.96% |
21450.236 |
17812.382 |
5.672 |
-16.97% |
5.672 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC3
|
-15.39% |
9.292 |
7.862 |
0.000 |
9.98% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
-14.98% |
290.891 |
247.318 |
1.539 |
0.50% |
1.539 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/10
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/256
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/28
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/16
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/51
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_two_loads<uint8_t>/999
|
-14.28% |
5.003 |
4.288 |
0.000 |
-14.29% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
-12.51% |
5.718 |
5.003 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopInterchange/LoopInterchange.test:BENCHMARK_LI1
|
-11.61% |
2711.794 |
2397.089 |
86.359 |
-24.17% |
86.359 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
-9.10% |
7.862 |
7.147 |
0.000 |
3.43% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-9.09% |
7.862 |
7.147 |
0.000 |
3.44% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/999
|
-8.28% |
2867.447 |
2630.108 |
0.047 |
-0.01% |
0.047 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/256
|
-8.17% |
744.009 |
683.217 |
0.058 |
0.00% |
0.058 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/999
|
-8.07% |
1443.012 |
1326.531 |
0.007 |
-0.01% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/51
|
-7.69% |
148.659 |
137.226 |
0.002 |
-0.00% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/256
|
-7.29% |
382.370 |
354.504 |
0.004 |
0.18% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/28
|
-7.18% |
82.909 |
76.953 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/28
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/10
|
-6.67% |
3.574 |
3.335 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/51
|
-6.66% |
3.574 |
3.335 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/256
|
-6.66% |
3.574 |
3.335 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/256
|
-6.66% |
3.574 |
3.335 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/16
|
-6.66% |
3.573 |
3.335 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/999
|
-6.66% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/16
|
-6.66% |
3.574 |
3.335 |
0.000 |
-0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/51
|
-6.66% |
3.573 |
3.335 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/999
|
-6.66% |
3.573 |
3.335 |
0.000 |
-6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/28
|
-6.66% |
3.574 |
3.335 |
0.000 |
-6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint16_t>/10
|
-6.66% |
3.573 |
3.335 |
0.000 |
-6.67% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_autovec_int32_t_
|
-6.47% |
560672.800 |
524397.158 |
1888.271 |
-6.63% |
1888.271 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/16
|
-6.37% |
48.601 |
45.504 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/51
|
-6.29% |
75.761 |
70.996 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_novec_int32_t_
|
-6.14% |
560424.000 |
526009.871 |
1807.298 |
-6.35% |
1807.298 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_autovec_int32_t_
|
-6.12% |
362833.853 |
340645.712 |
276.295 |
-0.01% |
276.295 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int32_t_
|
-5.91% |
362457.662 |
341037.956 |
103.447 |
-0.10% |
103.447 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/10
|
-5.30% |
31.448 |
29.780 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/28
|
-4.84% |
44.312 |
42.169 |
0.000 |
0.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/10
|
-4.76% |
15.009 |
14.294 |
0.000 |
-0.01% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/999
|
-4.17% |
5.718 |
5.479 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/28
|
-4.17% |
5.718 |
5.480 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/10
|
-4.17% |
5.718 |
5.480 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/16
|
-4.17% |
5.718 |
5.480 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/256
|
-4.16% |
5.718 |
5.480 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_three_loads<uint64_t>/51
|
-4.16% |
5.718 |
5.480 |
0.000 |
-4.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_novec_uint8_t_
|
-4.02% |
639555.251 |
613828.947 |
36.323 |
30.64% |
36.323 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_autovec_uint8_t_
|
-4.02% |
639480.804 |
613764.912 |
51.215 |
30.65% |
51.215 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/256
|
-3.85% |
203.691 |
195.840 |
0.001 |
-1.09% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/16
|
-3.33% |
21.441 |
20.727 |
0.000 |
0.00% |
0.000 |
|
MultiSource/Applications/aha/aha
Profile
|
-2.99% |
3.759 |
3.646 |
0.002 |
-2.90% |
0.002 |
|
External/SPEC/CFP2017rate/511.povray_r/511.povray_r
Profile
|
-2.99% |
14.530 |
14.095 |
0.007 |
-3.17% |
0.007 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/16
|
-2.63% |
27.160 |
26.445 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC127
|
-2.54% |
289.660 |
282.298 |
0.017 |
-0.01% |
0.017 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/28
|
-2.08% |
34.306 |
33.592 |
0.001 |
-0.01% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
-2.00% |
14.112 |
13.829 |
0.083 |
14.34% |
0.083 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC16
|
-1.89% |
37.880 |
37.164 |
0.001 |
1.95% |
0.001 |
|
External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r
Profile
|
-1.64% |
155.911 |
153.349 |
0.291 |
-0.94% |
0.291 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint64_t>/256
|
-1.46% |
199.463 |
196.559 |
0.045 |
-0.73% |
0.045 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint32_t_To_uint64_t_
|
-1.33% |
22493.866 |
22194.501 |
16.625 |
-1.33% |
16.625 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint32_t_To_uint64_t_
|
-1.32% |
22488.917 |
22192.944 |
10.436 |
-1.32% |
10.436 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint32_t_To_uint64_t_
|
-1.26% |
22475.781 |
22192.058 |
17.113 |
-1.32% |
17.113 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/51
|
-1.23% |
57.892 |
57.179 |
0.001 |
-0.00% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_erf_novec_double_
|
-1.18% |
563.464 |
556.834 |
1.658 |
-0.61% |
1.658 |
|
MultiSource/Benchmarks/Ptrdist/anagram/anagram
Profile
|
-1.10% |
1.882 |
1.861 |
0.001 |
-0.41% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/999
|
-1.07% |
734.024 |
726.163 |
0.015 |
-0.29% |
0.015 |
|
SingleSource/Benchmarks/Polybench/stencils/adi/adi
Profile
|
-1.03% |
61.242 |
60.612 |
0.027 |
-1.07% |
0.027 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
-1.02% |
70.043 |
69.327 |
0.002 |
1.03% |
0.002 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
-1.01% |
172.073 |
170.328 |
0.022 |
-0.00% |
0.022 |