http://gcc.gnu.org/bugzilla/show_bug.cgi?id=53346
Richard Guenther <rguenth at gcc dot gnu.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |ASSIGNED
AssignedTo|unassigned at gcc dot |rguenth at gcc dot gnu.org
|gnu.org |
--- Comment #5 from Richard Guenther <rguenth at gcc dot gnu.org> 2012-05-18
11:02:53 UTC ---
Yeah, this is sort-of related to what is observed in PR53355. I suppose at
runtime nxtr is comparatively small.
Reduced testcase:
subroutine cptrf2 (nxtr, ixtrt)
integer, dimension (1:nxtr), intent (out) :: ixtrt
ixtrt = 0
end subroutine
we peel the loop to possibly align the stores, and we peel the loop
to possibly take care of a remaining store at the end of the array.
And of course we compute that we need at least 6 scalar iterations
to make executing the vectorized loop profitable.
And apart from all that we should have recognized the loop as memset.
Mine.