On Tue, Aug 6, 2024 at 8:50 PM Andi Kleen <a...@firstfloor.org> wrote:
>
> > -      s += 16;
> > +      v16qi data, t;
> > +      /* Unaligned load.  Reading beyond the final newline is safe, since
> > +      files.cc:read_file_guts pads the allocation.  */
>
> You need to change that function to use 32 byte padding as Jakub
> pointed out (I forgot that too)
>
> > +      data = *(const v16qi_u *)s;
> > +      /* Prevent propagation into pshufb and pcmp as memory operand.  */
> > +      __asm__ ("" : "+x" (data));
>
> It would probably make sense to a file a PR on this separately,
> to eventually fix the compiler to not need such workarounds.
> Not sure how much difference it makes however.

This is probably to work around bugs in older compiler versions?  If
not I agree.

Otherwise the patch is OK.

Thanks,
Richard.

> -Andi

Reply via email to