https://gcc.gnu.org/bugzilla/show_bug.cgi?id=78699
--- Comment #3 from Richard Biener <rguenth at gcc dot gnu.org> --- #12 0x00000000013e3c0a in vectorizable_load ( stmt=<gimple_assign 0x2aaaaec39780>, gsi=0x7fffffffd140, vec_stmt=0x7fffffffd058, slp_node=0x296b970, slp_node_instance=0x278b4e0) at /space/rguenther/src/svn/trunk/gcc/tree-vect-stmts.c:7455 7455 stmt, NULL_TREE); (gdb) l 7450 { 7451 for (i = 0; i < vec_num; i++) 7452 { 7453 if (i > 0) 7454 dataref_ptr = bump_vector_ptr (dataref_ptr, ptr_incr, gsi, 7455 stmt, NULL_TREE); 7456 7457 /* 2. Create the vector-load in the loop. */ 7458 switch (alignment_support_scheme) 7459 { (gdb) p vec_num $1 = 7406755 ah, I thought we have fixed all those instances... (ah, no, I fixed cost calculation!) (gdb) p stmt_info->gap $7 = 14716900 So the ultimate issue is that we are kind-of stupid when generating code for SLP permutations. Maybe it's time to fix that...