https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90539
--- Comment #2 from Thomas Koenig <tkoenig at gcc dot gnu.org> --- I am a bit surprised at this, that the library version of packing seems to be faster than the inlined one. Or maybe some argument is now packed which should not be. Increased code size is sort of expected, copying inline is bigger than calling s library function. This is why this is not done at -Os. Is it possible to get a reduced test case that shows the slowdown?