[gcc r15-3942] Implement CSHIFT and EOSHIFT for unsigned.

2024-09-28 Thread Thomas Kテカnig via Gcc-cvs
https://gcc.gnu.org/g:1c928004cf0bc2131b6199905d11133d23a7cef2 commit r15-3942-g1c928004cf0bc2131b6199905d11133d23a7cef2 Author: Thomas Koenig Date: Sat Sep 28 22:28:59 2024 +0200 Implement CSHIFT and EOSHIFT for unsigned. gcc/fortran/ChangeLog: * check.cc (gfc_ch

[gcc r13-9061] Reduce recursive inlining of always_inline functions

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2532944e3588cf69bce019eaf03de9c63b78568f commit r13-9061-g2532944e3588cf69bce019eaf03de9c63b78568f Author: Jan Hubicka Date: Tue May 14 12:58:56 2024 +0200 Reduce recursive inlining of always_inline functions this patch tames down inliner on (mutiply) self-re

[gcc r14-10717] Zen5 tuning part 1: avoid FMA chains

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:fce2fe0406aa66c5d6f6465984a6af9ccc63370d commit r14-10717-gfce2fe0406aa66c5d6f6465984a6af9ccc63370d Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a criti

[gcc r14-10718] Zen5 tuning part 2: disable gather and scatter

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:3d0a91130eceaf428387ba314cfdfceb99b51709 commit r14-10718-g3d0a91130eceaf428387ba314cfdfceb99b51709 Author: Jan Hubicka Date: Tue Sep 3 15:07:41 2024 +0200 Zen5 tuning part 2: disable gather and scatter We disable gathers for zen4. It seems that gather has i

[gcc r14-10721] Zen5 tuning part 4: update reassocation width

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:b17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37 commit r14-10721-gb17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37 Author: Jan Hubicka Date: Tue Sep 3 18:20:34 2024 +0200 Zen5 tuning part 4: update reassocation width Zen5 has 6 instead of 4 ALUs and the integer multiplicati

[gcc r13-9062] Add AMD znver5 processor enablement with scheduler model

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:499afa8e6899d8e866bbd1e6cc340e5a52557883 commit r13-9062-g499afa8e6899d8e866bbd1e6cc340e5a52557883 Author: Jan Hubicka Date: Mon Mar 18 10:22:44 2024 +0100 Add AMD znver5 processor enablement with scheduler model 2024-02-14 Jan Hubicka Kart

[gcc r13-9063] Fixup unaligned load/store cost for znver5

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:22f4e4a5043fa69c54b1010d04bcd77958646189 commit r13-9063-g22f4e4a5043fa69c54b1010d04bcd77958646189 Author: Richard Biener Date: Tue Jul 16 10:45:27 2024 +0200 Fixup unaligned load/store cost for znver5 Currently unaligned YMM and ZMM load and store costs are

[gcc r14-10719] Zen5 tuning part 3: scheduler tweaks

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2c01292411044adbd67f79355c1e24decd2fd3c0 commit r14-10719-g2c01292411044adbd67f79355c1e24decd2fd3c0 Author: Jan Hubicka Date: Tue Sep 3 16:26:16 2024 +0200 Zen5 tuning part 3: scheduler tweaks this patch adds support for new fussion in znver5 documented in th

[gcc r14-10720] Zen5 tuning part 3: fix typo in previous patch

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2eade72b0e2ac9dd18ef517bc3b868157f1ddf48 commit r14-10720-g2eade72b0e2ac9dd18ef517bc3b868157f1ddf48 Author: Jan Hubicka Date: Tue Sep 3 17:25:05 2024 +0200 Zen5 tuning part 3: fix typo in previous patch gcc/ChangeLog: * config/i386/x86-tune-s

[gcc r12-10732] Add AMD znver5 processor enablement with scheduler model

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:54806268b47775449c7e237f8f03e922d6da26f6 commit r12-10732-g54806268b47775449c7e237f8f03e922d6da26f6 Author: Jan Hubicka Date: Mon Mar 18 10:22:44 2024 +0100 Add AMD znver5 processor enablement with scheduler model 2024-02-14 Jan Hubicka Kar

[gcc r12-10733] Fixup unaligned load/store cost for znver5

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c77b1c833e84b62928a729556c502e1311782b2d commit r12-10733-gc77b1c833e84b62928a729556c502e1311782b2d Author: Richard Biener Date: Tue Jul 16 10:45:27 2024 +0200 Fixup unaligned load/store cost for znver5 Currently unaligned YMM and ZMM load and store costs are

[gcc r13-9065] Re-add m_ZNVER4 to X86_TUNE_AVOID_256FMA_CHAINS

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:ad9ba1eccec5086b84f1030fb3e87947242ba904 commit r13-9065-gad9ba1eccec5086b84f1030fb3e87947242ba904 Author: Jan Hubicka Date: Sun Sep 29 02:10:14 2024 +0200 Re-add m_ZNVER4 to X86_TUNE_AVOID_256FMA_CHAINS * config/i386/x86-tune.def (X86_TUNE_AVOID_256F

[gcc r13-9064] Zen5 tuning part 1: avoid FMA chains

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093 commit r13-9064-g7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093 Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a critic

[gcc r14-10716] x86: Don't use address override with segment regsiter

2024-09-28 Thread H.J. Lu via Gcc-cvs
https://gcc.gnu.org/g:25cb153f93bb9ff3543ba8e31bbe7be4f6168aa4 commit r14-10716-g25cb153f93bb9ff3543ba8e31bbe7be4f6168aa4 Author: H.J. Lu Date: Wed Sep 25 16:39:04 2024 +0800 x86: Don't use address override with segment regsiter Address override only applies to the (reg32) part in

[gcc r12-10731] x86: Don't use address override with segment regsiter

2024-09-28 Thread H.J. Lu via Gcc-cvs
https://gcc.gnu.org/g:2e66eb7e7eae82bcd6675e79eabbdd6decfa9fe5 commit r12-10731-g2e66eb7e7eae82bcd6675e79eabbdd6decfa9fe5 Author: H.J. Lu Date: Wed Sep 25 16:39:04 2024 +0800 x86: Don't use address override with segment regsiter Address override only applies to the (reg32) part in

[gcc r13-9060] x86: Don't use address override with segment regsiter

2024-09-28 Thread H.J. Lu via Gcc-cvs
https://gcc.gnu.org/g:bf5d8d44f7a8f90a2ebfe3f28689bc3d86e185fb commit r13-9060-gbf5d8d44f7a8f90a2ebfe3f28689bc3d86e185fb Author: H.J. Lu Date: Wed Sep 25 16:39:04 2024 +0800 x86: Don't use address override with segment regsiter Address override only applies to the (reg32) part in