https://gcc.gnu.org/g:6fb88ff622ca2d12229870e9b0b3781bb930d2ad
commit r16-1190-g6fb88ff622ca2d12229870e9b0b3781bb930d2ad
Author: Jan Hubicka
Date: Thu Jun 5 18:10:16 2025 +0200
Fix typo in afdo_calculate_branch_prob
gcc/ChangeLog:
* auto-profile.cc (afdo_calculate
https://gcc.gnu.org/g:9399a3b5f512af6dbbe05be076e726efc91413f1
commit r16-1183-g9399a3b5f512af6dbbe05be076e726efc91413f1
Author: Jan Hubicka
Date: Thu Jun 5 15:24:36 2025 +0200
Fix handling of GUESSED_LOCAL in auto-fdo and preserve more static profile
This patch fixes ICE where GU
https://gcc.gnu.org/g:5080d98a383de244a7b78ae50456fd41881268c2
commit r16-953-g5080d98a383de244a7b78ae50456fd41881268c2
Author: Pranav Gorantla
Date: Thu May 29 15:02:24 2025 +0200
i386: Use Shuffles instead of shifts for Reduction in AMD znver4/5
In AMD znver4, znver5 targets vps
https://gcc.gnu.org/g:8859a06fc8143561dd94ef2342234dbc5ccfd937
commit r16-941-g8859a06fc8143561dd94ef2342234dbc5ccfd937
Author: Jan Hubicka
Date: Wed May 28 23:43:51 2025 +0200
Set znver5 addss cost to 2 again
since uses of addss for other purposes then modelling FP
addition/subt
https://gcc.gnu.org/g:17f7b6250628c31182fd4f71c9ecdeca9568ffd1
commit r16-930-g17f7b6250628c31182fd4f71c9ecdeca9568ffd1
Author: Jan Hubicka
Date: Wed May 28 14:26:11 2025 +0200
Handle auto-fdo 0 more carefully
This patch fixes few other places where auto-fdo 0 should be be treated
https://gcc.gnu.org/g:a4dc4001999a8b504cde468618af12c9b870589b
commit r16-929-ga4dc4001999a8b504cde468618af12c9b870589b
Author: Jan Hubicka
Date: Wed May 28 14:18:39 2025 +0200
Do not recompute profile when entry block has afdo count of 0
With normal profile feedback checking entr
https://gcc.gnu.org/g:8058e1f8857d580698c29590273e72c61de2c121
commit r16-927-g8058e1f8857d580698c29590273e72c61de2c121
Author: Jan Hubicka
Date: Wed May 28 12:23:48 2025 +0200
Fix profile_probability quality of switch
This fixes ages old bug I noticed only now where switch cases,
https://gcc.gnu.org/g:aa1b47aa41af2a6a094035bd446504cf352c4d71
commit r16-926-gaa1b47aa41af2a6a094035bd446504cf352c4d71
Author: Jan Hubicka
Date: Wed May 28 12:15:32 2025 +0200
Do not erase static profile by 0 autofdo profile
This patch makes auto-fdo more careful about keeping in
https://gcc.gnu.org/g:7b76965df10c47616c8f65e1c1134356d635f9ed
commit r16-912-g7b76965df10c47616c8f65e1c1134356d635f9ed
Author: Jan Hubicka
Date: Tue May 27 19:53:50 2025 +0200
Enable afdo testing on AMD Zen3+
contrib/ChangeLog:
* gen_autofdo_event.py: Add support
https://gcc.gnu.org/g:fbe6dea1a1b8cad114aeec07c6f8966300417cd9
commit r16-911-gfbe6dea1a1b8cad114aeec07c6f8966300417cd9
Author: Jan Hubicka
Date: Tue May 27 19:14:21 2025 +0200
Remove dead code in auto-profile.cc
This code to track what locations were used when reading auto-fdo pr
https://gcc.gnu.org/g:e3d3d6d7d2c8ab73ff597f4c82514c3217256567
commit r16-869-ge3d3d6d7d2c8ab73ff597f4c82514c3217256567
Author: Jan Hubicka
Date: Sun May 25 14:33:17 2025 +0200
Make i386 construcotr vectorizer costs more realistics
this patch attempts to make vectorizer costs of v
https://gcc.gnu.org/g:37e61c793c1b22bdcfbf142cd6086da2745be596
commit r16-531-g37e61c793c1b22bdcfbf142cd6086da2745be596
Author: Jan Hubicka
Date: Sun May 11 23:49:11 2025 +0200
i386: Fix move costs in vectorizer cost model.
This patch complements the change to stv and uses COSTS_N
https://gcc.gnu.org/g:993aa0bd28722c7f01fb8310f1c79814aef217ed
commit r16-517-g993aa0bd28722c7f01fb8310f1c79814aef217ed
Author: Jan Hubicka
Date: Sat May 10 22:23:48 2025 +0200
i386: Fix some problems in stv cost model
this patch fixes some of problems with cosint in scalar to vec
https://gcc.gnu.org/g:2c8d632d9ed4e3aeee2156ba17fe631ecbc90dbf
commit r16-447-g2c8d632d9ed4e3aeee2156ba17fe631ecbc90dbf
Author: Jan Hubicka
Date: Wed May 7 15:33:44 2025 +0200
i386: implement costs for float<->int conversions in
ix86_vector_costs::add_stmt_cost
This patch adds pa
https://gcc.gnu.org/g:62ffaef514aad0eae5d3bc548d8e86d21a3095e7
commit r16-429-g62ffaef514aad0eae5d3bc548d8e86d21a3095e7
Author: Jan Hubicka
Date: Wed May 7 00:24:04 2025 +0200
i386: Add costs for integer<->float conversions
Extend ix86_rtx_costs to cost FLOAT, UNSIGNED_FLOAT, FIX,
https://gcc.gnu.org/g:673d446894c063c92cafce9ba41340c82e960a11
commit r16-401-g673d446894c063c92cafce9ba41340c82e960a11
Author: Jan Hubicka
Date: Tue May 6 12:07:15 2025 +0200
Fix i386 bootstrap on non-Windows platforms
* config/i386/i386.cc (ix86_tls_index): Add ifdef.
D
https://gcc.gnu.org/g:064cac730f88dc71c6da578f9ae5b8e092ab6cd4
commit r16-372-g064cac730f88dc71c6da578f9ae5b8e092ab6cd4
Author: Jan Hubicka
Date: Sun May 4 10:52:35 2025 +0200
Improve maybe_hot handling in inliner heuristics
Inliner currently applies different heuristics to hot an
https://gcc.gnu.org/g:20d184e3f84d859e7e9f44a8d91772a02b658872
commit r16-367-g20d184e3f84d859e7e9f44a8d91772a02b658872
Author: Jan Hubicka
Date: Sat May 3 00:26:29 2025 +0200
Improve ix86 VEC_MERGE costs
ix86_rtx_costs VEC_MERGE by special casing AVX512 mask operations and
other
https://gcc.gnu.org/g:c85148d036d17295bb2560e10020c924c83a5d13
commit r16-358-gc85148d036d17295bb2560e10020c924c83a5d13
Author: Jan Hubicka
Date: Fri May 2 15:53:35 2025 +0200
Make ix86 cost of VEC_SELECT equivalent to SUBREG cost 1
This patch fixes regression of imagick with PGO
https://gcc.gnu.org/g:1efd525e828d5c30d0962d05ba81f9d7c8176ca7
commit r16-291-g1efd525e828d5c30d0962d05ba81f9d7c8176ca7
Author: Jan Hubicka
Date: Tue Apr 29 22:43:45 2025 +0200
Fix cs_interesting_for_ipcp_p wrt flag_profile_partial_training.
As noticed by Martin Jambor, I introduc
https://gcc.gnu.org/g:1d635e79b3c2d26f864964b79717132bffbcad20
commit r16-162-g1d635e79b3c2d26f864964b79717132bffbcad20
Author: Jan Hubicka
Date: Sat Apr 26 22:10:19 2025 +0200
Fix i386 vectorizer cost of FP scalar MAX_EXPR and MIN_EXPR
I introduced a bug by last minute cleanups u
https://gcc.gnu.org/g:2056d52d74070f50c5f8a22e4a600fcc3974fd88
commit r16-117-g2056d52d74070f50c5f8a22e4a600fcc3974fd88
Author: Jan Hubicka
Date: Thu Apr 24 18:37:55 2025 +0200
Fix i386 vectorizer cost of COND_EXPR and MIN_MAX with one of parameters 0
or -1
gcc/ChangeLog:
https://gcc.gnu.org/g:cfb04e0de6aa438df9d8b83a3d8c7f93789b5c9f
commit r16-116-gcfb04e0de6aa438df9d8b83a3d8c7f93789b5c9f
Author: Jan Hubicka
Date: Thu Apr 24 18:35:54 2025 +0200
Fix ICE building deepsjeng with -fprofile-use
The problem here is division by zero, since adjusted 0 > p
https://gcc.gnu.org/g:132d01d96ea9d617aaffdd5dfba3284a8958e529
commit r16-101-g132d01d96ea9d617aaffdd5dfba3284a8958e529
Author: Jan Hubicka
Date: Wed Apr 23 18:39:14 2025 +0200
Enable ip-cp cloning over non-hot edges
Currently enabling profile feedback regresses x264 and exchange.
https://gcc.gnu.org/g:9b9d605d68cf27a24e8ed9d4f1ead1f00131cec1
commit r16-100-g9b9d605d68cf27a24e8ed9d4f1ead1f00131cec1
Author: Jan Hubicka
Date: Wed Apr 23 17:04:32 2025 +0200
Cost truth_value exprs in i386 vectorizer costs.
this patch implements costing of truth_value exprs. I.
https://gcc.gnu.org/g:0650ea627399a0ef23db434d4fce6b52b9faf557
commit r16-89-g0650ea627399a0ef23db434d4fce6b52b9faf557
Author: Jan Hubicka
Date: Tue Apr 22 23:47:14 2025 +0200
Fix vectorizer costs of COND_EXPR, MIN_EXPR, MAX_EXPR, ABS_EXPR, ABSU_EXPR
this patch adds special cases
https://gcc.gnu.org/g:0907a810f586b07636cc5b83dba6025eb5240655
commit r16-54-g0907a810f586b07636cc5b83dba6025eb5240655
Author: Jan Hubicka
Date: Mon Apr 21 20:16:50 2025 +0200
Fix cost of vectorized double->float conversion
In previous patch I miscomputed costs of cvtpd2pf instruc
https://gcc.gnu.org/g:f6859fb621179ec9bf5631eb8902619ab8d4467b
commit r16-39-gf6859fb621179ec9bf5631eb8902619ab8d4467b
Author: Jan Hubicka
Date: Sat Apr 19 18:51:27 2025 +0200
Add tables for SSE fp conversion costs
as disucssed, I will proceed adding costs for common SSE operation
https://gcc.gnu.org/g:eabba7be040e81690332070873d59d23e8c93e11
commit r15-9522-geabba7be040e81690332070873d59d23e8c93e11
Author: Jan Hubicka
Date: Wed Apr 16 15:28:32 2025 +0200
Stream ipa_return_value_summary
Add streaming of return summaries from compile time to ltrans
which
https://gcc.gnu.org/g:4a01869b963520d689fe9242cf2ff24984ea41d8
commit r15-9496-g4a01869b963520d689fe9242cf2ff24984ea41d8
Author: Jan Hubicka
Date: Tue Apr 15 19:09:20 2025 +0200
Set znver5 issue rate to 4.
this patch sets issue rate of znver5 to 4. With current model, unless a
r
https://gcc.gnu.org/g:e2011ab13de3e70774f869b356f5f9c750780b34
commit r15-9495-ge2011ab13de3e70774f869b356f5f9c750780b34
Author: Jan Hubicka
Date: Tue Apr 15 19:04:15 2025 +0200
Set ADDSS cost to 3 for znver5
Znver5 has latency of addss 2 in typical case while all earlier versions
https://gcc.gnu.org/g:564e4e0819022925dd160e455ee44baf0fda5805
commit r15-9176-g564e4e0819022925dd160e455ee44baf0fda5805
Author: Jan Hubicka
Date: Thu Apr 3 13:06:07 2025 +0200
Fix costs of x86 move instructions at -Os
This patch fixes problem with size costs declaring all moves t
https://gcc.gnu.org/g:9c5505a35d9d71705464f9254f55407192d31ec3
commit r15-9047-g9c5505a35d9d71705464f9254f55407192d31ec3
Author: Jan Hubicka
Date: Sun Mar 30 23:49:49 2025 +0200
Optimize string constructor
this patch improves code generation on string constructors. We currently
https://gcc.gnu.org/g:57dbbdd8e34b80926e06b352b6c442c555b303ed
commit r15-8041-g57dbbdd8e34b80926e06b352b6c442c555b303ed
Author: Jan Hubicka
Date: Thu Mar 13 20:11:02 2025 +0100
Fix speculation_useful_p
This patch fixes issue with speculation and x264. With profile feedback
w
https://gcc.gnu.org/g:8c4a00f9a48f1b2af10448c9f2058b44b8cb7234
commit r15-7813-g8c4a00f9a48f1b2af10448c9f2058b44b8cb7234
Author: Jan Hubicka
Date: Tue Mar 4 16:22:01 2025 +0100
Break false dependency chain on Zen5
Zen5 on some variants has false dependency on tzcnt, blsi, blsr and
https://gcc.gnu.org/g:c84be624e079cd748df93a3dc0b5168865fefee9
commit r15-7811-gc84be624e079cd748df93a3dc0b5168865fefee9
Author: Jan Hubicka
Date: Mon Mar 3 19:12:20 2025 +0100
Make ix86_macro_fusion_pair_p and ix86_fuse_mov_alu_p match current CPUs
The current implementation of f
https://gcc.gnu.org/g:e909afe8a8a2924dd6ced6bdf7d8e397f14310b5
commit r12-10923-ge909afe8a8a2924dd6ced6bdf7d8e397f14310b5
Author: Jan Hubicka
Date: Tue Sep 3 15:07:41 2024 +0200
Zen5 tuning part 2: disable gather and scatter
We disable gathers for zen4. It seems that gather has i
https://gcc.gnu.org/g:3911b6366ee49dffe2f16578093b49664b3a2d72
commit r12-10901-g3911b6366ee49dffe2f16578093b49664b3a2d72
Author: Jan Hubicka
Date: Wed Sep 4 09:19:08 2024 +0200
Zen5 tuning part 5: update instruction latencies in x86-tune-costs
there is nothing exciting in this pa
https://gcc.gnu.org/g:f10d381dfc983ea32e5f72faadc7eb8126f114f6
commit r13-9309-gf10d381dfc983ea32e5f72faadc7eb8126f114f6
Author: Jan Hubicka
Date: Wed Sep 4 09:19:08 2024 +0200
Zen5 tuning part 5: update instruction latencies in x86-tune-costs
there is nothing exciting in this pat
https://gcc.gnu.org/g:4d7efc031fbd925565b049670bf755aca21bd2e3
commit r12-10888-g4d7efc031fbd925565b049670bf755aca21bd2e3
Author: Jan Hubicka
Date: Tue Sep 3 18:20:34 2024 +0200
Zen5 tuning part 4: update reassocation width
Zen5 has 6 instead of 4 ALUs and the integer multiplicati
https://gcc.gnu.org/g:52507e15aa31bc66e99f2273306f1b45be919bba
commit r13-9285-g52507e15aa31bc66e99f2273306f1b45be919bba
Author: Jan Hubicka
Date: Tue Sep 3 18:20:34 2024 +0200
Zen5 tuning part 4: update reassocation width
Zen5 has 6 instead of 4 ALUs and the integer multiplicatio
https://gcc.gnu.org/g:f0718f1d7815c7845243a182c66f4a454efbfb72
commit r12-10887-gf0718f1d7815c7845243a182c66f4a454efbfb72
Author: Jan Hubicka
Date: Tue Sep 3 16:26:16 2024 +0200
Zen5 tuning part 3: scheduler tweaks
this patch adds support for new fussion in znver5 documented in th
https://gcc.gnu.org/g:7392e9e480afe3143e72a99f7b5ac99b2f49c284
commit r13-9284-g7392e9e480afe3143e72a99f7b5ac99b2f49c284
Author: Jan Hubicka
Date: Tue Sep 3 16:26:16 2024 +0200
Zen5 tuning part 3: scheduler tweaks
this patch adds support for new fussion in znver5 documented in the
https://gcc.gnu.org/g:1729147f2b1a05ad2a4e8e2d91d1d2ba914039d6
commit r15-5694-g1729147f2b1a05ad2a4e8e2d91d1d2ba914039d6
Author: Jan Hubicka
Date: Tue Nov 26 13:52:09 2024 +0100
improve std::deque::_M_reallocate_map
Looking into reason why we still do throw_bad_alloc in clang bina
https://gcc.gnu.org/g:bd59f2eeacd41b91e4e79b32dda83cc60d499e25
commit r15-5365-gbd59f2eeacd41b91e4e79b32dda83cc60d499e25
Author: Jan Hubicka
Date: Sun Nov 17 15:48:29 2024 +0100
Mark asm statements as necessary in ipa-fnsummary
I forgot to mark asm statements as necessary in ipa-f
https://gcc.gnu.org/g:addf02282026cc23d24fc4445b47b408d484d1e7
commit r15-5362-gaddf02282026cc23d24fc4445b47b408d484d1e7
Author: Jan Hubicka
Date: Sun Nov 17 11:54:10 2024 +0100
ipa-modref bits for unsequenced and reproducible
C attributes reproducible and unsequenced implies that
https://gcc.gnu.org/g:cc33f880e553d1aa94d19a349ad755f34c33de9e
commit r15-5359-gcc33f880e553d1aa94d19a349ad755f34c33de9e
Author: Jan Hubicka
Date: Sat Nov 16 23:45:57 2024 +0100
Avoid expicit builtion list in tree-ssa-dce
while working on -fmalloc-dce I noticed that tree-ssa-dce.c
https://gcc.gnu.org/g:101f8c73d575b4746e49b0ea35eb6cc24de1dfdc
commit r15-5358-g101f8c73d575b4746e49b0ea35eb6cc24de1dfdc
Author: Jan Hubicka
Date: Sat Nov 16 23:44:13 2024 +0100
Minor cleanup to cxx_init_decl_processing
gcc/cp/ChangeLog:
* decl.cc (cxx_build_opera
https://gcc.gnu.org/g:cee7d080d5c2a5fb8125878998b742c040ec88b4
commit r15-5336-gcee7d080d5c2a5fb8125878998b742c040ec88b4
Author: Jan Hubicka
Date: Sat Nov 16 14:04:32 2024 +0100
Ignore conditions guarding __builtin_unreachable in inliner metrics
This extends my last year attempt t
https://gcc.gnu.org/g:d5af5657fa4173dd8d3155e44fe6a1e3914cde8f
commit r15-5322-gd5af5657fa4173dd8d3155e44fe6a1e3914cde8f
Author: Jan Hubicka
Date: Fri Nov 15 15:51:14 2024 +0100
Fix type of malloc parameter in trans-expr.cc
gcc/fortran/ChangeLog:
* trans-expr.cc (
https://gcc.gnu.org/g:236104908257d779b06894379f99e007a1c4c897
commit r15-5281-g236104908257d779b06894379f99e007a1c4c897
Author: Jan Hubicka
Date: Thu Nov 14 17:29:14 2024 +0100
Fix common.opt.urls
gcc/ChangeLog:
* common.opt.urls: Fix.
Diff:
---
gcc/common.opt.
https://gcc.gnu.org/g:269b6477744f25ab0a89b364295b86a4bce392f9
commit r15-5256-g269b6477744f25ab0a89b364295b86a4bce392f9
Author: Jan Hubicka
Date: Thu Nov 14 17:08:03 2024 +0100
New testcase for operator new/delete removal.
* g++.dg/tree-ssa/dce-1.C: New test.
Diff:
---
https://gcc.gnu.org/g:7828dc070510f8f5c06765858815fa6e5d2d1ec6
commit r15-5255-g7828dc070510f8f5c06765858815fa6e5d2d1ec6
Author: Jan Hubicka
Date: Thu Nov 14 17:01:12 2024 +0100
Remove allocations which are used only for NULL pointer check and free
Extend tree-ssa-dse to remove me
https://gcc.gnu.org/g:20faea418d580d4db99aa5293c976ab16867aca5
commit r15-5238-g20faea418d580d4db99aa5293c976ab16867aca5
Author: Jan Hubicka
Date: Thu Nov 14 14:19:04 2024 +0100
Add testcases for std::vector optimization
gcc/testsuite/ChangeLog:
PR tree-optimizati
https://gcc.gnu.org/g:2264b68796aa6f1a609987cf7edb9183bf070e7e
commit r15-5176-g2264b68796aa6f1a609987cf7edb9183bf070e7e
Author: Jan Hubicka
Date: Tue Nov 12 15:58:02 2024 +0100
Verify that empty std::vector is optimized away
With __builtin_operator_new we now can optimize away un
https://gcc.gnu.org/g:be6334fffdf2a7df3b7f92ea933b804664dfc383
commit r12-10736-gbe6334fffdf2a7df3b7f92ea933b804664dfc383
Author: Jan Hubicka
Date: Tue Sep 3 13:38:33 2024 +0200
Zen5 tuning part 1: avoid FMA chains
testing matrix multiplication benchmarks shows that FMA on a criti
https://gcc.gnu.org/g:456719b5c0705a6c2065fc261f41d0c2a30f3045
commit r13-9068-g456719b5c0705a6c2065fc261f41d0c2a30f3045
Author: Jan Hubicka
Date: Tue Sep 3 15:07:41 2024 +0200
Zen5 tuning part 2: disable gather and scatter
We disable gathers for zen4. It seems that gather has im
https://gcc.gnu.org/g:ad9ba1eccec5086b84f1030fb3e87947242ba904
commit r13-9065-gad9ba1eccec5086b84f1030fb3e87947242ba904
Author: Jan Hubicka
Date: Sun Sep 29 02:10:14 2024 +0200
Re-add m_ZNVER4 to X86_TUNE_AVOID_256FMA_CHAINS
* config/i386/x86-tune.def (X86_TUNE_AVOID_256F
https://gcc.gnu.org/g:7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093
commit r13-9064-g7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093
Author: Jan Hubicka
Date: Tue Sep 3 13:38:33 2024 +0200
Zen5 tuning part 1: avoid FMA chains
testing matrix multiplication benchmarks shows that FMA on a critic
https://gcc.gnu.org/g:c77b1c833e84b62928a729556c502e1311782b2d
commit r12-10733-gc77b1c833e84b62928a729556c502e1311782b2d
Author: Richard Biener
Date: Tue Jul 16 10:45:27 2024 +0200
Fixup unaligned load/store cost for znver5
Currently unaligned YMM and ZMM load and store costs are
https://gcc.gnu.org/g:54806268b47775449c7e237f8f03e922d6da26f6
commit r12-10732-g54806268b47775449c7e237f8f03e922d6da26f6
Author: Jan Hubicka
Date: Mon Mar 18 10:22:44 2024 +0100
Add AMD znver5 processor enablement with scheduler model
2024-02-14 Jan Hubicka
Kar
https://gcc.gnu.org/g:b17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37
commit r14-10721-gb17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37
Author: Jan Hubicka
Date: Tue Sep 3 18:20:34 2024 +0200
Zen5 tuning part 4: update reassocation width
Zen5 has 6 instead of 4 ALUs and the integer multiplicati
https://gcc.gnu.org/g:2eade72b0e2ac9dd18ef517bc3b868157f1ddf48
commit r14-10720-g2eade72b0e2ac9dd18ef517bc3b868157f1ddf48
Author: Jan Hubicka
Date: Tue Sep 3 17:25:05 2024 +0200
Zen5 tuning part 3: fix typo in previous patch
gcc/ChangeLog:
* config/i386/x86-tune-s
https://gcc.gnu.org/g:2c01292411044adbd67f79355c1e24decd2fd3c0
commit r14-10719-g2c01292411044adbd67f79355c1e24decd2fd3c0
Author: Jan Hubicka
Date: Tue Sep 3 16:26:16 2024 +0200
Zen5 tuning part 3: scheduler tweaks
this patch adds support for new fussion in znver5 documented in th
https://gcc.gnu.org/g:22f4e4a5043fa69c54b1010d04bcd77958646189
commit r13-9063-g22f4e4a5043fa69c54b1010d04bcd77958646189
Author: Richard Biener
Date: Tue Jul 16 10:45:27 2024 +0200
Fixup unaligned load/store cost for znver5
Currently unaligned YMM and ZMM load and store costs are
https://gcc.gnu.org/g:499afa8e6899d8e866bbd1e6cc340e5a52557883
commit r13-9062-g499afa8e6899d8e866bbd1e6cc340e5a52557883
Author: Jan Hubicka
Date: Mon Mar 18 10:22:44 2024 +0100
Add AMD znver5 processor enablement with scheduler model
2024-02-14 Jan Hubicka
Kart
https://gcc.gnu.org/g:3d0a91130eceaf428387ba314cfdfceb99b51709
commit r14-10718-g3d0a91130eceaf428387ba314cfdfceb99b51709
Author: Jan Hubicka
Date: Tue Sep 3 15:07:41 2024 +0200
Zen5 tuning part 2: disable gather and scatter
We disable gathers for zen4. It seems that gather has i
https://gcc.gnu.org/g:2532944e3588cf69bce019eaf03de9c63b78568f
commit r13-9061-g2532944e3588cf69bce019eaf03de9c63b78568f
Author: Jan Hubicka
Date: Tue May 14 12:58:56 2024 +0200
Reduce recursive inlining of always_inline functions
this patch tames down inliner on (mutiply) self-re
https://gcc.gnu.org/g:fce2fe0406aa66c5d6f6465984a6af9ccc63370d
commit r14-10717-gfce2fe0406aa66c5d6f6465984a6af9ccc63370d
Author: Jan Hubicka
Date: Tue Sep 3 13:38:33 2024 +0200
Zen5 tuning part 1: avoid FMA chains
testing matrix multiplication benchmarks shows that FMA on a criti
https://gcc.gnu.org/g:4292297a0f938ffc953422fa246ff00fe345fe3d
commit r15-3441-g4292297a0f938ffc953422fa246ff00fe345fe3d
Author: Jan Hubicka
Date: Wed Sep 4 09:19:08 2024 +0200
Zen5 tuning part 5: update instruction latencies in x86-tune-costs
there is nothing exciting in this pat
https://gcc.gnu.org/g:f0ab3de6ec0e3540f2e57f3f5628005f0a4e3fa5
commit r15-3427-gf0ab3de6ec0e3540f2e57f3f5628005f0a4e3fa5
Author: Jan Hubicka
Date: Tue Sep 3 18:20:34 2024 +0200
Zen5 tuning part 4: update reassocation width
Zen5 has 6 instead of 4 ALUs and the integer multiplicatio
https://gcc.gnu.org/g:910e1769a0653ac32bd8c1d6aabb39c797d5d773
commit r15-3425-g910e1769a0653ac32bd8c1d6aabb39c797d5d773
Author: Jan Hubicka
Date: Tue Sep 3 17:25:05 2024 +0200
Zen5 tuning part 3: fix typo in previous patch
gcc/ChangeLog:
* config/i386/x86-tune-sc
https://gcc.gnu.org/g:e2125a600552bc6e0329e3f1224eea14804db8d3
commit r15-3420-ge2125a600552bc6e0329e3f1224eea14804db8d3
Author: Jan Hubicka
Date: Tue Sep 3 16:26:16 2024 +0200
Zen5 tuning part 3: scheduler tweaks
this patch adds support for new fussion in znver5 documented in the
https://gcc.gnu.org/g:d82edbe92eed53a479736fcbbe6d54d0fb42daa4
commit r15-3417-gd82edbe92eed53a479736fcbbe6d54d0fb42daa4
Author: Jan Hubicka
Date: Tue Sep 3 15:07:41 2024 +0200
Zen5 tuning part 2: disable gather and scatter
We disable gathers for zen4. It seems that gather has im
https://gcc.gnu.org/g:d6360b4083695970789fd65b9c515c11a5ce25b4
commit r15-3414-gd6360b4083695970789fd65b9c515c11a5ce25b4
Author: Jan Hubicka
Date: Tue Sep 3 13:38:33 2024 +0200
Zen5 tuning part 1: avoid FMA chains
testing matrix multiplication benchmarks shows that FMA on a critic
https://gcc.gnu.org/g:c16d4a0ae162abc00d97bb73e598ca00d16cf555
commit c16d4a0ae162abc00d97bb73e598ca00d16cf555
Author: Jan Hubicka
Date: Tue Aug 27 13:50:32 2024 +0200
Fix handling of types
* ipa-devirt.cc (odr_equivalent_or_derived_p): New.
* ipa-utils.h (odr_
https://gcc.gnu.org/g:b3176b620ff29a06c90992ca3d29f3cffd459537
commit r15-2376-gb3176b620ff29a06c90992ca3d29f3cffd459537
Author: Jan Hubicka
Date: Mon Jul 29 10:49:49 2024 +0200
Fix ICE with -fdump-tree-moref
gcc/ChangeLog:
PR ipa/116055
* ipa-modref.c
https://gcc.gnu.org/g:98baaa17561ca299eefc98f469f4326e551604c9
commit r14-10515-g98baaa17561ca299eefc98f469f4326e551604c9
Author: Jan Hubicka
Date: Mon Jul 29 10:48:34 2024 +0200
Fix ICE with -fdump-tree-moref
gcc/ChangeLog:
PR ipa/116055
* ipa-modref.
https://gcc.gnu.org/g:efcbe7b985e24ac002a863afd609c44a67761195
commit r15-2207-gefcbe7b985e24ac002a863afd609c44a67761195
Author: Jan Hubicka
Date: Mon Jul 22 23:01:50 2024 +0200
Fix handling of ICF_NOVOPS in ipa-modref
As shown in somewhat convoluted testcase, ipa-modref is mistre
https://gcc.gnu.org/g:9ddd5f88e60972147dff74b48658e2b12040d468
commit r14-10495-g9ddd5f88e60972147dff74b48658e2b12040d468
Author: Jan Hubicka
Date: Mon Jul 22 19:00:39 2024 +0200
Fix modref's iteraction with store merging
Hi,
this patch fixes wrong code in case store-merging i
https://gcc.gnu.org/g:14074773350ffed7efdebbc553adf0f23b572e87
commit r15-2205-g14074773350ffed7efdebbc553adf0f23b572e87
Author: Jan Hubicka
Date: Mon Jul 22 19:00:39 2024 +0200
Fix modref's iteraction with store merging
Hi,
this patch fixes wrong code in case store-merging in
https://gcc.gnu.org/g:27ef3a0779e551ca116c56c431436c8d2191b253
commit r14-10489-g27ef3a0779e551ca116c56c431436c8d2191b253
Author: Jan Hubicka
Date: Mon Jul 22 18:08:08 2024 +0200
Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced
to function call parameters
https://gcc.gnu.org/g:f2e98084792821c3849074867d5b007c49028854
commit r14-10488-gf2e98084792821c3849074867d5b007c49028854
Author: Jan Hubicka
Date: Mon Jul 22 18:05:26 2024 +0200
Fix accounting of offsets in unadjusted_ptr_and_unit_offset
unadjusted_ptr_and_unit_offset accidentall
https://gcc.gnu.org/g:c5397d343ff1365fcebcf3ebabe140608874aac3
commit r14-10487-gc5397d343ff1365fcebcf3ebabe140608874aac3
Author: Jan Hubicka
Date: Mon Jul 22 18:01:57 2024 +0200
Compare loop bounds in ipa-icf
Hi,
this testcase shows another poblem with missing comparators for
https://gcc.gnu.org/g:9a7d668fc58f817027ec7f9fa7e20a6dce08bddb
commit r14-10486-g9a7d668fc58f817027ec7f9fa7e20a6dce08bddb
Author: Jan Hubicka
Date: Tue May 14 12:58:56 2024 +0200
Reduce recursive inlining of always_inline functions
this patch tames down inliner on (mutiply) self-r
https://gcc.gnu.org/g:cf8ffc58aad3127031c229a75cc4b99c8ace25e0
commit r15-2202-gcf8ffc58aad3127031c229a75cc4b99c8ace25e0
Author: Jan Hubicka
Date: Mon Jul 22 18:08:08 2024 +0200
Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced
to function call parameters
https://gcc.gnu.org/g:391f46f10b0586c074014de82efe76787739bb0c
commit r15-2201-g391f46f10b0586c074014de82efe76787739bb0c
Author: Jan Hubicka
Date: Mon Jul 22 18:05:26 2024 +0200
Fix accounting of offsets in unadjusted_ptr_and_unit_offset
unadjusted_ptr_and_unit_offset accidentally
https://gcc.gnu.org/g:0d19fbc7b0760ce665fa6a88cd40cfa0311358d7
commit r15-2200-g0d19fbc7b0760ce665fa6a88cd40cfa0311358d7
Author: Jan Hubicka
Date: Mon Jul 22 18:01:57 2024 +0200
Compare loop bounds in ipa-icf
Hi,
this testcase shows another poblem with missing comparators for
https://gcc.gnu.org/g:96d53252aefcbc2fe419c4c3b4bcd3fc03d4d187
commit r15-581-g96d53252aefcbc2fe419c4c3b4bcd3fc03d4d187
Author: Jan Hubicka
Date: Thu May 16 15:33:55 2024 +0200
Fix points_to_local_or_readonly_memory_p wrt TARGET_MEM_REF
TARGET_MEM_REF can be used to offset constan
https://gcc.gnu.org/g:9b7cad5884f21cc5783075be0043777448db3fab
commit r15-512-g9b7cad5884f21cc5783075be0043777448db3fab
Author: Jan Hubicka
Date: Wed May 15 14:14:27 2024 +0200
Avoid pointer compares on TYPE_MAIN_VARIANT in TBAA
while building more testcases for ipa-icf I noticed
https://gcc.gnu.org/g:1ec49897253e093e1ef6261eb104ac0c111bac83
commit r15-482-g1ec49897253e093e1ef6261eb104ac0c111bac83
Author: Jan Hubicka
Date: Tue May 14 12:58:56 2024 +0200
Reduce recursive inlining of always_inline functions
this patch tames down inliner on (mutiply) self-rec
https://gcc.gnu.org/g:6f0a646dd2fc59e9c9cde63718b36085f84a19ba
commit r14-10093-g6f0a646dd2fc59e9c9cde63718b36085f84a19ba
Author: Jan Hubicka
Date: Tue Apr 23 15:51:42 2024 +0200
Remove repeated information in -ftree-loop-distribute-patterns doc
We have:
-ftree-loo
https://gcc.gnu.org/g:0923fe2d4808c16b72c1d1bfe28220dd326d8b76
commit r14-9705-g0923fe2d4808c16b72c1d1bfe28220dd326d8b76
Author: Jan Hubicka
Date: Thu Mar 28 13:24:54 2024 +0100
Hash operands of PHI in ipa-icf
This patch fixes cache colision on function whose body differs only by
https://gcc.gnu.org/g:dfc9d1cc8353bdd7fbc37bc10bb3fd40f49fa4af
commit r14-9516-gdfc9d1cc8353bdd7fbc37bc10bb3fd40f49fa4af
Author: Jan Hubicka
Date: Mon Mar 18 14:24:10 2024 +0100
Add missing config/i386/zn4zn5.md file
gcc/ChangeLog:
* config/i386/zn4zn5.md: Add fil
https://gcc.gnu.org/g:d0aa0af9a9b7dd709a8c7ff6604ed6b7da0fc23a
commit r14-9515-gd0aa0af9a9b7dd709a8c7ff6604ed6b7da0fc23a
Author: Jan Hubicka
Date: Mon Mar 18 10:22:44 2024 +0100
Add AMD znver5 processor enablement with scheduler model
2024-02-14 Jan Hubicka
Kart
95 matches
Mail list logo