https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77468
Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Component|middle-end |target --- Comment #3 from Andrew Pinski <pinskia at gcc dot gnu.org> --- This sounds like the generic tuning is wrong. Can you try -mcpu=cortex-a53 and/or -mcpu=cortex-a57. One thing I noticed about the generic tuning is the cost model for the vectorizer is just bogus and needs a lot of tuning. I noticed this when I was comparing GCC 5 and GCC 6. Also can you try out GCC 6 to see if an issue there too (I bet it is).