https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78699
--- Comment #3 from Richard Biener <rguenth at gcc dot gnu.org> ---
#12 0x00000000013e3c0a in vectorizable_load (
stmt=<gimple_assign 0x2aaaaec39780>, gsi=0x7fffffffd140,
vec_stmt=0x7fffffffd058, slp_node=0x296b970, slp_node_instance=0x278b4e0)
at /space/rguenther/src/svn/trunk/gcc/tree-vect-stmts.c:7455
7455 stmt, NULL_TREE);
(gdb) l
7450 {
7451 for (i = 0; i < vec_num; i++)
7452 {
7453 if (i > 0)
7454 dataref_ptr = bump_vector_ptr (dataref_ptr, ptr_incr,
gsi,
7455 stmt, NULL_TREE);
7456
7457 /* 2. Create the vector-load in the loop. */
7458 switch (alignment_support_scheme)
7459 {
(gdb) p vec_num
$1 = 7406755
ah, I thought we have fixed all those instances... (ah, no, I fixed cost
calculation!)
(gdb) p stmt_info->gap
$7 = 14716900
So the ultimate issue is that we are kind-of stupid when generating code
for SLP permutations. Maybe it's time to fix that...