|
SingleSource/Benchmarks/Misc/revertBits
Profile
|
18315.09% |
0.002 |
0.427 |
0.000 |
16921.12% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC7
|
73.31% |
10.722 |
18.583 |
0.000 |
73.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
59.99% |
10.721 |
17.153 |
0.001 |
59.98% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC15
|
58.32% |
17.155 |
27.160 |
0.000 |
58.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC2
|
58.15% |
5.423 |
8.577 |
0.000 |
56.49% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC16
|
57.88% |
18.107 |
28.589 |
0.000 |
57.88% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC16
|
57.88% |
18.108 |
28.589 |
0.001 |
57.88% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
57.13% |
10.007 |
15.724 |
0.000 |
57.13% |
0.000 |
|
SingleSource/Benchmarks/Dhrystone/dry
Profile
|
55.40% |
0.289 |
0.449 |
0.020 |
55.40% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC31
|
54.26% |
32.431 |
50.030 |
0.002 |
54.27% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC31
|
54.26% |
32.431 |
50.029 |
0.001 |
54.29% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC15
|
54.16% |
17.154 |
26.445 |
0.001 |
54.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC15
|
54.15% |
17.155 |
26.445 |
0.001 |
54.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC15
|
54.15% |
17.155 |
26.445 |
0.000 |
54.15% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC15
|
54.15% |
17.155 |
26.445 |
0.000 |
54.15% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
54.15% |
17.155 |
26.445 |
0.001 |
54.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC15
|
54.15% |
17.156 |
26.445 |
0.000 |
54.16% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC16
|
53.94% |
18.107 |
27.874 |
0.001 |
53.93% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC16
|
53.94% |
18.108 |
27.875 |
0.000 |
53.94% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC16
|
53.94% |
18.108 |
27.875 |
0.000 |
53.93% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
53.93% |
18.108 |
27.874 |
0.001 |
53.93% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC16
|
53.92% |
18.108 |
27.872 |
0.001 |
53.93% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC8
|
53.33% |
10.721 |
16.438 |
0.001 |
53.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC8
|
53.32% |
10.722 |
16.439 |
0.000 |
53.31% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC8
|
53.31% |
10.722 |
16.439 |
0.000 |
53.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC8
|
53.31% |
10.722 |
16.439 |
0.000 |
53.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC63
|
52.27% |
62.897 |
95.770 |
0.003 |
52.25% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
52.26% |
62.898 |
95.770 |
0.003 |
52.25% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC63
|
52.26% |
62.901 |
95.774 |
0.001 |
52.26% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC63
|
52.26% |
62.901 |
95.774 |
0.001 |
52.26% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC63
|
52.26% |
62.902 |
95.773 |
0.002 |
52.25% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC63
|
52.25% |
62.903 |
95.772 |
0.002 |
52.25% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC63
|
52.25% |
62.904 |
95.773 |
0.002 |
52.26% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC32
|
52.14% |
33.354 |
50.746 |
0.001 |
52.13% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC32
|
52.14% |
33.356 |
50.746 |
0.001 |
52.13% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC32
|
52.13% |
33.356 |
50.746 |
0.001 |
52.13% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC32
|
52.13% |
33.357 |
50.747 |
0.001 |
52.14% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC32
|
52.13% |
33.356 |
50.744 |
0.001 |
52.13% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
52.13% |
33.357 |
50.745 |
0.001 |
52.13% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC32
|
52.13% |
33.357 |
50.745 |
0.001 |
52.12% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC31
|
52.09% |
32.427 |
49.317 |
0.001 |
52.08% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
52.08% |
32.427 |
49.315 |
0.001 |
52.07% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC31
|
52.08% |
32.427 |
49.316 |
0.001 |
52.04% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC31
|
52.08% |
32.427 |
49.316 |
0.001 |
52.06% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC31
|
52.08% |
32.428 |
49.316 |
0.001 |
52.05% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC64
|
51.11% |
63.853 |
96.488 |
0.002 |
51.11% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC64
|
51.11% |
63.855 |
96.490 |
0.002 |
51.11% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC64
|
51.11% |
63.855 |
96.490 |
0.002 |
51.11% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC64
|
51.11% |
63.854 |
96.487 |
0.002 |
51.10% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
51.10% |
63.855 |
96.488 |
0.003 |
51.10% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC64
|
51.10% |
63.856 |
96.489 |
0.002 |
51.11% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC64
|
51.10% |
63.855 |
96.486 |
0.003 |
51.11% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC7
|
50.00% |
10.007 |
15.010 |
0.000 |
50.00% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
50.00% |
10.007 |
15.010 |
0.000 |
49.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC7
|
49.99% |
10.007 |
15.009 |
0.000 |
49.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC8
|
49.99% |
11.437 |
17.154 |
0.000 |
49.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC7
|
49.98% |
10.007 |
15.009 |
0.001 |
49.98% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC4
|
45.44% |
7.863 |
11.436 |
0.000 |
45.44% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC2
|
45.04% |
5.421 |
7.862 |
0.000 |
41.04% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC2
|
44.84% |
5.428 |
7.862 |
0.000 |
42.72% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC2
|
44.70% |
5.433 |
7.862 |
0.000 |
40.27% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC2
|
44.53% |
5.440 |
7.862 |
0.000 |
42.35% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC2
|
43.89% |
5.464 |
7.862 |
0.000 |
42.26% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC8
|
43.73% |
11.437 |
16.439 |
0.000 |
43.74% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC2
|
43.31% |
5.486 |
7.862 |
0.000 |
42.11% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC3
|
40.00% |
7.147 |
10.006 |
0.000 |
39.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC7
|
39.99% |
10.722 |
15.010 |
0.000 |
39.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_autovec_uint8_t_
|
36.86% |
467293.057 |
639516.453 |
26.687 |
36.13% |
26.687 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_multi_csa_only_novec_uint8_t_
|
36.85% |
467320.427 |
639531.993 |
15.600 |
36.12% |
15.600 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/16
|
36.78% |
29.784 |
40.740 |
0.000 |
36.78% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC4
|
36.36% |
7.862 |
10.721 |
0.000 |
36.35% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC4
|
36.36% |
7.863 |
10.721 |
0.000 |
36.35% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC4
|
36.35% |
7.863 |
10.721 |
0.000 |
36.36% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC4
|
36.35% |
7.863 |
10.721 |
0.000 |
36.36% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC4
|
36.35% |
7.863 |
10.721 |
0.000 |
36.36% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/28
|
35.63% |
27.401 |
37.166 |
0.001 |
35.64% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/10
|
34.47% |
20.729 |
27.874 |
0.000 |
34.46% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/28
|
33.48% |
49.797 |
66.469 |
0.001 |
33.48% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC4
|
33.32% |
8.578 |
11.436 |
0.000 |
33.32% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/51
|
32.80% |
45.745 |
60.750 |
0.001 |
32.80% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_novec_int32_t_
|
32.19% |
856602.689 |
1132302.100 |
585.897 |
32.09% |
585.897 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_nested_cond_load_autovec_int32_t_
|
31.77% |
859045.343 |
1131944.895 |
2014.775 |
31.97% |
2014.775 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/51
|
31.34% |
88.160 |
115.785 |
0.003 |
31.33% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/16
|
30.76% |
18.584 |
24.300 |
0.000 |
30.75% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC3
|
30.00% |
7.148 |
9.292 |
0.000 |
29.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC3
|
29.99% |
7.148 |
9.292 |
0.000 |
29.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC3
|
29.99% |
7.148 |
9.292 |
0.000 |
29.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
29.98% |
7.148 |
9.292 |
0.000 |
29.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/999
|
28.39% |
1678.367 |
2154.913 |
0.003 |
28.39% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/999
|
28.21% |
1679.022 |
2152.736 |
0.052 |
28.21% |
0.052 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint16_t_To_uint64_t_
|
27.98% |
22346.536 |
28598.137 |
6.108 |
27.97% |
6.108 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint16_t_To_uint64_t_
|
27.96% |
22350.207 |
28598.219 |
6.131 |
27.96% |
6.131 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint16_t_To_uint64_t_
|
27.95% |
22350.784 |
28598.022 |
6.100 |
27.98% |
6.100 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/999
|
27.81% |
847.744 |
1083.513 |
0.022 |
27.80% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint8_t>/256
|
27.79% |
439.599 |
561.775 |
0.019 |
27.79% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/256
|
27.31% |
439.597 |
559.629 |
0.021 |
27.30% |
0.021 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/51
|
27.28% |
88.159 |
112.212 |
0.003 |
27.28% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC3
|
27.27% |
7.863 |
10.006 |
0.000 |
27.26% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/28
|
26.31% |
49.796 |
62.896 |
0.001 |
26.30% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/256
|
25.92% |
228.770 |
288.057 |
0.002 |
25.92% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/999
|
25.02% |
868.986 |
1086.376 |
0.006 |
24.87% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/51
|
24.99% |
45.748 |
57.178 |
0.001 |
24.98% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/10
|
24.98% |
14.296 |
17.867 |
0.001 |
24.98% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/16
|
23.80% |
30.022 |
37.166 |
0.000 |
23.80% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/28
|
22.60% |
27.400 |
33.593 |
0.000 |
22.60% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC127
|
22.38% |
156.528 |
191.553 |
4.236 |
22.37% |
4.236 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC127
|
22.37% |
156.529 |
191.552 |
1.943 |
22.37% |
1.943 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
22.37% |
156.537 |
191.552 |
0.004 |
22.37% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC127
|
22.37% |
156.538 |
191.552 |
0.012 |
22.37% |
0.012 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC127
|
22.37% |
156.540 |
191.552 |
4.204 |
22.37% |
4.204 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC127
|
22.37% |
156.541 |
191.552 |
4.189 |
22.36% |
4.189 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC127
|
22.36% |
156.538 |
191.545 |
0.005 |
22.36% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC128
|
21.63% |
158.665 |
192.984 |
0.003 |
21.62% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC128
|
21.62% |
158.676 |
192.985 |
0.001 |
21.63% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1LoopWithReductionTC128
|
21.62% |
158.673 |
192.980 |
0.020 |
21.62% |
0.020 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC128
|
21.62% |
158.678 |
192.985 |
0.016 |
21.63% |
0.016 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC128
|
21.62% |
158.678 |
192.982 |
0.004 |
21.62% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC128
|
21.61% |
158.683 |
192.979 |
0.003 |
21.62% |
0.003 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC128
|
21.61% |
158.684 |
192.977 |
0.004 |
21.61% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopWithReductionTC3
|
18.17% |
7.863 |
9.292 |
0.000 |
18.17% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_novec_int32_t_
|
17.80% |
551764.479 |
649985.061 |
2773.188 |
19.18% |
2773.188 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_cond_arith_autovec_int32_t_
|
17.12% |
552040.032 |
646571.693 |
4568.200 |
15.37% |
4568.200 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC1
|
16.64% |
12.093 |
14.105 |
0.013 |
16.62% |
0.013 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint8_t>/256
|
16.29% |
250.126 |
290.883 |
0.006 |
16.99% |
0.006 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopTC1
|
16.15% |
12.149 |
14.111 |
0.010 |
14.00% |
0.010 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/16
|
15.99% |
17.869 |
20.728 |
0.000 |
15.99% |
0.000 |
|
SingleSource/Benchmarks/Polybench/linear-algebra/blas/gemver/gemver
Profile
|
14.48% |
0.280 |
0.320 |
0.004 |
19.16% |
0.004 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC1
|
14.28% |
5.003 |
5.718 |
0.000 |
14.28% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
13.79% |
6.910 |
7.862 |
0.000 |
13.79% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
13.78% |
6.910 |
7.862 |
0.000 |
13.79% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint32_t>/10
|
13.32% |
21.444 |
24.300 |
0.001 |
13.32% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW16From_uint64_t_To_uint8_t_
|
12.39% |
19098.563 |
21464.737 |
3.224 |
12.34% |
3.224 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint64_t_To_uint8_t_
|
12.33% |
19104.634 |
21460.507 |
4.989 |
12.33% |
4.989 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
12.33% |
19103.986 |
21459.668 |
5.361 |
12.33% |
5.361 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_autovec_uint8_t_
|
12.14% |
335822.627 |
376598.166 |
1397.529 |
11.56% |
1397.529 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_single_csa_cond_load_novec_uint8_t_
|
12.06% |
335712.434 |
376196.131 |
1416.426 |
11.70% |
1416.426 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/999
|
9.02% |
2630.249 |
2867.440 |
9.721 |
9.01% |
9.721 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/256
|
8.88% |
683.367 |
744.031 |
4.060 |
8.88% |
4.060 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/999
|
8.77% |
1326.648 |
1443.023 |
0.022 |
8.77% |
0.022 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/51
|
8.32% |
137.239 |
148.662 |
0.005 |
8.32% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/256
|
8.06% |
353.850 |
382.385 |
0.019 |
8.06% |
0.019 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/28
|
7.73% |
76.958 |
82.908 |
0.002 |
7.73% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC3
|
7.69% |
9.292 |
10.006 |
0.000 |
7.68% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
7.68% |
9.292 |
10.006 |
0.000 |
7.68% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC3
|
7.68% |
9.292 |
10.006 |
0.000 |
7.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC3
|
7.68% |
9.292 |
10.006 |
0.000 |
7.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC3
|
7.68% |
9.293 |
10.006 |
0.000 |
7.68% |
0.000 |
|
SingleSource/Benchmarks/CoyoteBench/huffbench
Profile
|
7.37% |
45.945 |
49.330 |
0.002 |
7.38% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/10
|
7.14% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/16
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/256
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/10
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/51
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/28
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/999
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/51
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/16
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/256
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint32_t>/28
|
7.13% |
3.336 |
3.574 |
0.000 |
7.12% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_first_single_load<uint64_t>/999
|
7.13% |
3.336 |
3.574 |
0.000 |
7.13% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/16
|
6.80% |
45.507 |
48.601 |
0.001 |
6.80% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/51
|
6.70% |
71.005 |
75.761 |
0.002 |
6.70% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC4
|
6.66% |
10.721 |
11.436 |
0.000 |
6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
6.66% |
10.722 |
11.436 |
0.000 |
6.65% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC4
|
6.66% |
10.722 |
11.436 |
0.000 |
6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC4
|
6.66% |
10.722 |
11.436 |
0.000 |
6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC4
|
6.65% |
10.722 |
11.436 |
0.000 |
6.66% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_novec_int32_t_
|
6.08% |
341612.225 |
362368.012 |
366.614 |
5.71% |
366.614 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:BENCHMARK_csa_with_in_loop_arith_autovec_int32_t_
|
6.03% |
342078.679 |
362706.432 |
166.045 |
6.36% |
166.045 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_three_loads<uint16_t>/10
|
5.59% |
29.782 |
31.448 |
0.000 |
5.60% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint32_t>/10
|
5.25% |
13.582 |
14.295 |
0.000 |
5.25% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/28
|
5.08% |
42.172 |
44.313 |
0.001 |
5.07% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/10
|
4.99% |
14.296 |
15.009 |
0.000 |
4.99% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint8_t>/127
|
4.95% |
150.167 |
157.598 |
0.002 |
4.92% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
4.61% |
145.990 |
152.717 |
0.258 |
4.46% |
0.258 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC7
|
4.16% |
17.154 |
17.868 |
0.000 |
4.16% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC7
|
4.16% |
17.155 |
17.869 |
0.000 |
4.16% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
4.16% |
17.155 |
17.868 |
0.000 |
4.16% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC7
|
4.15% |
17.155 |
17.868 |
0.000 |
4.16% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC16
|
3.91% |
36.454 |
37.881 |
0.001 |
3.91% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
3.70% |
19.299 |
20.012 |
0.000 |
3.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC8
|
3.69% |
19.299 |
20.012 |
0.001 |
3.70% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC8
|
3.69% |
19.299 |
20.012 |
0.001 |
3.69% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC8
|
3.69% |
19.300 |
20.012 |
0.000 |
3.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_no_early_exit_two_loads<uint16_t>/10
|
3.56% |
20.015 |
20.727 |
0.000 |
3.56% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC2
|
3.45% |
6.909 |
7.148 |
0.000 |
3.44% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC2
|
3.44% |
6.909 |
7.147 |
0.000 |
3.44% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
3.44% |
6.910 |
7.147 |
0.000 |
3.44% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/16
|
3.43% |
20.729 |
21.441 |
0.001 |
3.43% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint32_t>/256
|
2.87% |
198.016 |
203.698 |
0.023 |
2.88% |
0.023 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_three_loads<uint16_t>/16
|
2.69% |
26.447 |
27.160 |
0.000 |
2.69% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/28
|
2.11% |
33.597 |
34.307 |
0.001 |
2.12% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC15
|
2.08% |
34.308 |
35.022 |
0.001 |
2.08% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC15
|
2.08% |
34.308 |
35.021 |
0.001 |
2.08% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC31
|
2.08% |
68.618 |
70.044 |
0.002 |
2.08% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC15
|
2.08% |
34.309 |
35.021 |
0.001 |
2.08% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
1.95% |
36.455 |
37.167 |
0.001 |
1.96% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC16
|
1.95% |
36.455 |
37.166 |
0.000 |
1.95% |
0.000 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC16
|
1.95% |
36.455 |
37.166 |
0.001 |
1.95% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
1.95% |
36.457 |
37.166 |
0.001 |
1.95% |
0.001 |
|
External/SPEC/CINT2017rate/523.xalancbmk_r/523.xalancbmk_r
Profile
|
1.56% |
154.449 |
156.861 |
0.793 |
-0.62% |
0.793 |
|
MultiSource/Applications/siod/siod
Profile
|
1.33% |
5.969 |
6.049 |
0.015 |
1.21% |
0.015 |
|
MultiSource/Applications/sqlite3/sqlite3
Profile
|
1.30% |
7.561 |
7.660 |
0.024 |
1.73% |
0.024 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_two_loads<uint64_t>/51
|
1.24% |
57.183 |
57.892 |
0.002 |
1.23% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC8
|
1.16% |
40.979 |
41.454 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW1BigLoopWithReductionTC8
|
1.16% |
40.978 |
41.453 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4BigLoopWithReductionTC8
|
1.16% |
40.980 |
41.453 |
0.001 |
1.16% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC8
|
1.16% |
40.980 |
41.453 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopWithReductionTC8
|
1.15% |
40.981 |
41.454 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1BigLoopWithReductionTC8
|
1.15% |
40.980 |
41.453 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopWithReductionTC8
|
1.15% |
40.981 |
41.453 |
0.001 |
1.15% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:autovec_early_exit_taken_mid_single_load<uint16_t>/256
|
1.08% |
197.268 |
199.405 |
0.005 |
1.08% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC31
|
1.03% |
68.621 |
69.329 |
0.001 |
1.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC31
|
1.03% |
68.621 |
69.329 |
0.001 |
1.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC31
|
1.03% |
68.622 |
69.329 |
0.002 |
1.04% |
0.002 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopTC31
|
1.03% |
68.622 |
69.328 |
0.001 |
1.03% |
0.001 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC64
|
1.02% |
139.386 |
140.800 |
0.003 |
1.02% |
0.003 |
|
MicroBenchmarks/ImageProcessing/Interpolation/Interpolation.test:BENCHMARK_BILINEAR_INTERPOLATION/32
|
1.01% |
170.334 |
172.060 |
0.005 |
1.01% |
0.005 |
|
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC32
|
1.01% |
70.762 |
71.474 |
0.001 |
1.00% |
0.001 |