Re: [PATCH v3] x86: Update memcpy/memset inline strategies for -mtune=generic

2025-06-25 Thread Jan Hubicka
> Here is the v3 patch. It no longer uses "rep mov/stos". Lili, can you > measure > its performance impact on Intel and AMD cpus? > > The updated generic has > > Update memcpy and memset inline strategies for -mtune=generic: > > 1. Don't align memory. This looks OK to me (recent microarchs s

Re: [PATCH v3] x86: Update memcpy/memset inline strategies for -mtune=generic

2025-06-25 Thread Hongtao Liu
On Tue, Jun 17, 2025 at 8:54 PM Cui, Lili wrote: > > > > > -Original Message- > > From: H.J. Lu > > Sent: Monday, June 16, 2025 10:08 PM > > To: Jan Hubicka > > Cc: Uros Bizjak ; Cui, Lili ; gcc- > > patc...@gcc.gnu.org; Liu, Hongtao ; > > mjgu...@gmail.com > > Subject: [PATCH v3] x86: U

Re: [PATCH v3] x86: Update memcpy/memset inline strategies for -mtune=generic

2025-06-24 Thread Uros Bizjak
On Tue, Jun 24, 2025 at 5:22 AM Hongtao Liu wrote: > > > > Ideall we should catch repeated constants more generally since > > > > this appears elsewhere too. > > > > I am not quite sure where to fit it best. We already have a > > > > machine specific task that loads 0 into

Re: [PATCH v3] x86: Update memcpy/memset inline strategies for -mtune=generic

2025-06-23 Thread H.J. Lu
On Tue, Jun 24, 2025 at 1:18 PM Jan Hubicka wrote: > > > Here is the v3 patch. It no longer uses "rep mov/stos". Lili, can you > > measure > > its performance impact on Intel and AMD cpus? > > > > The updated generic has > > > > Update memcpy and memset inline strategies for -mtune=generic: >

RE: [PATCH v3] x86: Update memcpy/memset inline strategies for -mtune=generic

2025-06-17 Thread Cui, Lili
> -Original Message- > From: H.J. Lu > Sent: Monday, June 16, 2025 10:08 PM > To: Jan Hubicka > Cc: Uros Bizjak ; Cui, Lili ; gcc- > patc...@gcc.gnu.org; Liu, Hongtao ; > mjgu...@gmail.com > Subject: [PATCH v3] x86: Update memcpy/memset inline strategies for - > mtune=generic > > On Mo