[gcc r16-1190] Fix typo in afdo_calculate_branch_prob

2025-06-05 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:6fb88ff622ca2d12229870e9b0b3781bb930d2ad commit r16-1190-g6fb88ff622ca2d12229870e9b0b3781bb930d2ad Author: Jan Hubicka Date: Thu Jun 5 18:10:16 2025 +0200 Fix typo in afdo_calculate_branch_prob gcc/ChangeLog: * auto-profile.cc (afdo_calculate

[gcc r16-1183] Fix handling of GUESSED_LOCAL in auto-fdo and preserve more static profile

2025-06-05 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9399a3b5f512af6dbbe05be076e726efc91413f1 commit r16-1183-g9399a3b5f512af6dbbe05be076e726efc91413f1 Author: Jan Hubicka Date: Thu Jun 5 15:24:36 2025 +0200 Fix handling of GUESSED_LOCAL in auto-fdo and preserve more static profile This patch fixes ICE where GU

[gcc r16-953] i386: Use Shuffles instead of shifts for Reduction in AMD znver4/5

2025-05-29 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:5080d98a383de244a7b78ae50456fd41881268c2 commit r16-953-g5080d98a383de244a7b78ae50456fd41881268c2 Author: Pranav Gorantla Date: Thu May 29 15:02:24 2025 +0200 i386: Use Shuffles instead of shifts for Reduction in AMD znver4/5 In AMD znver4, znver5 targets vps

[gcc r16-941] Set znver5 addss cost to 2 again

2025-05-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:8859a06fc8143561dd94ef2342234dbc5ccfd937 commit r16-941-g8859a06fc8143561dd94ef2342234dbc5ccfd937 Author: Jan Hubicka Date: Wed May 28 23:43:51 2025 +0200 Set znver5 addss cost to 2 again since uses of addss for other purposes then modelling FP addition/subt

[gcc r16-930] Handle auto-fdo 0 more carefully

2025-05-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:17f7b6250628c31182fd4f71c9ecdeca9568ffd1 commit r16-930-g17f7b6250628c31182fd4f71c9ecdeca9568ffd1 Author: Jan Hubicka Date: Wed May 28 14:26:11 2025 +0200 Handle auto-fdo 0 more carefully This patch fixes few other places where auto-fdo 0 should be be treated

[gcc r16-929] Do not recompute profile when entry block has afdo count of 0

2025-05-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:a4dc4001999a8b504cde468618af12c9b870589b commit r16-929-ga4dc4001999a8b504cde468618af12c9b870589b Author: Jan Hubicka Date: Wed May 28 14:18:39 2025 +0200 Do not recompute profile when entry block has afdo count of 0 With normal profile feedback checking entr

[gcc r16-927] Fix profile_probability quality of switch

2025-05-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:8058e1f8857d580698c29590273e72c61de2c121 commit r16-927-g8058e1f8857d580698c29590273e72c61de2c121 Author: Jan Hubicka Date: Wed May 28 12:23:48 2025 +0200 Fix profile_probability quality of switch This fixes ages old bug I noticed only now where switch cases,

[gcc r16-926] Do not erase static profile by 0 autofdo profile

2025-05-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:aa1b47aa41af2a6a094035bd446504cf352c4d71 commit r16-926-gaa1b47aa41af2a6a094035bd446504cf352c4d71 Author: Jan Hubicka Date: Wed May 28 12:15:32 2025 +0200 Do not erase static profile by 0 autofdo profile This patch makes auto-fdo more careful about keeping in

[gcc r16-912] Enable afdo testing on AMD Zen3+

2025-05-27 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:7b76965df10c47616c8f65e1c1134356d635f9ed commit r16-912-g7b76965df10c47616c8f65e1c1134356d635f9ed Author: Jan Hubicka Date: Tue May 27 19:53:50 2025 +0200 Enable afdo testing on AMD Zen3+ contrib/ChangeLog: * gen_autofdo_event.py: Add support

[gcc r16-911] Remove dead code in auto-profile.cc

2025-05-27 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:fbe6dea1a1b8cad114aeec07c6f8966300417cd9 commit r16-911-gfbe6dea1a1b8cad114aeec07c6f8966300417cd9 Author: Jan Hubicka Date: Tue May 27 19:14:21 2025 +0200 Remove dead code in auto-profile.cc This code to track what locations were used when reading auto-fdo pr

[gcc r16-869] Make i386 construcotr vectorizer costs more realistics

2025-05-25 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:e3d3d6d7d2c8ab73ff597f4c82514c3217256567 commit r16-869-ge3d3d6d7d2c8ab73ff597f4c82514c3217256567 Author: Jan Hubicka Date: Sun May 25 14:33:17 2025 +0200 Make i386 construcotr vectorizer costs more realistics this patch attempts to make vectorizer costs of v

[gcc r16-531] i386: Fix move costs in vectorizer cost model.

2025-05-11 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:37e61c793c1b22bdcfbf142cd6086da2745be596 commit r16-531-g37e61c793c1b22bdcfbf142cd6086da2745be596 Author: Jan Hubicka Date: Sun May 11 23:49:11 2025 +0200 i386: Fix move costs in vectorizer cost model. This patch complements the change to stv and uses COSTS_N

[gcc r16-517] i386: Fix some problems in stv cost model

2025-05-10 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:993aa0bd28722c7f01fb8310f1c79814aef217ed commit r16-517-g993aa0bd28722c7f01fb8310f1c79814aef217ed Author: Jan Hubicka Date: Sat May 10 22:23:48 2025 +0200 i386: Fix some problems in stv cost model this patch fixes some of problems with cosint in scalar to vec

[gcc r16-447] i386: implement costs for float<->int conversions in ix86_vector_costs::add_stmt_cost

2025-05-07 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2c8d632d9ed4e3aeee2156ba17fe631ecbc90dbf commit r16-447-g2c8d632d9ed4e3aeee2156ba17fe631ecbc90dbf Author: Jan Hubicka Date: Wed May 7 15:33:44 2025 +0200 i386: implement costs for float<->int conversions in ix86_vector_costs::add_stmt_cost This patch adds pa

[gcc r16-429] i386: Add costs for integer<->float conversions

2025-05-06 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:62ffaef514aad0eae5d3bc548d8e86d21a3095e7 commit r16-429-g62ffaef514aad0eae5d3bc548d8e86d21a3095e7 Author: Jan Hubicka Date: Wed May 7 00:24:04 2025 +0200 i386: Add costs for integer<->float conversions Extend ix86_rtx_costs to cost FLOAT, UNSIGNED_FLOAT, FIX,

[gcc r16-401] Fix i386 bootstrap on non-Windows platforms

2025-05-06 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:673d446894c063c92cafce9ba41340c82e960a11 commit r16-401-g673d446894c063c92cafce9ba41340c82e960a11 Author: Jan Hubicka Date: Tue May 6 12:07:15 2025 +0200 Fix i386 bootstrap on non-Windows platforms * config/i386/i386.cc (ix86_tls_index): Add ifdef. D

[gcc r16-372] Improve maybe_hot handling in inliner heuristics

2025-05-04 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:064cac730f88dc71c6da578f9ae5b8e092ab6cd4 commit r16-372-g064cac730f88dc71c6da578f9ae5b8e092ab6cd4 Author: Jan Hubicka Date: Sun May 4 10:52:35 2025 +0200 Improve maybe_hot handling in inliner heuristics Inliner currently applies different heuristics to hot an

[gcc r16-367] Improve ix86 VEC_MERGE costs

2025-05-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:20d184e3f84d859e7e9f44a8d91772a02b658872 commit r16-367-g20d184e3f84d859e7e9f44a8d91772a02b658872 Author: Jan Hubicka Date: Sat May 3 00:26:29 2025 +0200 Improve ix86 VEC_MERGE costs ix86_rtx_costs VEC_MERGE by special casing AVX512 mask operations and other

[gcc r16-358] Make ix86 cost of VEC_SELECT equivalent to SUBREG cost 1

2025-05-02 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c85148d036d17295bb2560e10020c924c83a5d13 commit r16-358-gc85148d036d17295bb2560e10020c924c83a5d13 Author: Jan Hubicka Date: Fri May 2 15:53:35 2025 +0200 Make ix86 cost of VEC_SELECT equivalent to SUBREG cost 1 This patch fixes regression of imagick with PGO

[gcc r16-291] Fix cs_interesting_for_ipcp_p wrt flag_profile_partial_training.

2025-04-29 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:1efd525e828d5c30d0962d05ba81f9d7c8176ca7 commit r16-291-g1efd525e828d5c30d0962d05ba81f9d7c8176ca7 Author: Jan Hubicka Date: Tue Apr 29 22:43:45 2025 +0200 Fix cs_interesting_for_ipcp_p wrt flag_profile_partial_training. As noticed by Martin Jambor, I introduc

[gcc r16-162] Fix i386 vectorizer cost of FP scalar MAX_EXPR and MIN_EXPR

2025-04-26 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:1d635e79b3c2d26f864964b79717132bffbcad20 commit r16-162-g1d635e79b3c2d26f864964b79717132bffbcad20 Author: Jan Hubicka Date: Sat Apr 26 22:10:19 2025 +0200 Fix i386 vectorizer cost of FP scalar MAX_EXPR and MIN_EXPR I introduced a bug by last minute cleanups u

[gcc r16-117] Fix i386 vectorizer cost of COND_EXPR and MIN_MAX with one of parameters 0 or -1

2025-04-24 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2056d52d74070f50c5f8a22e4a600fcc3974fd88 commit r16-117-g2056d52d74070f50c5f8a22e4a600fcc3974fd88 Author: Jan Hubicka Date: Thu Apr 24 18:37:55 2025 +0200 Fix i386 vectorizer cost of COND_EXPR and MIN_MAX with one of parameters 0 or -1 gcc/ChangeLog:

[gcc r16-116] Fix ICE building deepsjeng with -fprofile-use

2025-04-24 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:cfb04e0de6aa438df9d8b83a3d8c7f93789b5c9f commit r16-116-gcfb04e0de6aa438df9d8b83a3d8c7f93789b5c9f Author: Jan Hubicka Date: Thu Apr 24 18:35:54 2025 +0200 Fix ICE building deepsjeng with -fprofile-use The problem here is division by zero, since adjusted 0 > p

[gcc r16-101] Enable ip-cp cloning over non-hot edges

2025-04-23 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:132d01d96ea9d617aaffdd5dfba3284a8958e529 commit r16-101-g132d01d96ea9d617aaffdd5dfba3284a8958e529 Author: Jan Hubicka Date: Wed Apr 23 18:39:14 2025 +0200 Enable ip-cp cloning over non-hot edges Currently enabling profile feedback regresses x264 and exchange.

[gcc r16-100] Cost truth_value exprs in i386 vectorizer costs.

2025-04-23 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9b9d605d68cf27a24e8ed9d4f1ead1f00131cec1 commit r16-100-g9b9d605d68cf27a24e8ed9d4f1ead1f00131cec1 Author: Jan Hubicka Date: Wed Apr 23 17:04:32 2025 +0200 Cost truth_value exprs in i386 vectorizer costs. this patch implements costing of truth_value exprs. I.

[gcc r16-89] Fix vectorizer costs of COND_EXPR, MIN_EXPR, MAX_EXPR, ABS_EXPR, ABSU_EXPR

2025-04-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:0650ea627399a0ef23db434d4fce6b52b9faf557 commit r16-89-g0650ea627399a0ef23db434d4fce6b52b9faf557 Author: Jan Hubicka Date: Tue Apr 22 23:47:14 2025 +0200 Fix vectorizer costs of COND_EXPR, MIN_EXPR, MAX_EXPR, ABS_EXPR, ABSU_EXPR this patch adds special cases

[gcc r16-54] Fix cost of vectorized double->float conversion

2025-04-21 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:0907a810f586b07636cc5b83dba6025eb5240655 commit r16-54-g0907a810f586b07636cc5b83dba6025eb5240655 Author: Jan Hubicka Date: Mon Apr 21 20:16:50 2025 +0200 Fix cost of vectorized double->float conversion In previous patch I miscomputed costs of cvtpd2pf instruc

[gcc r16-39] Add tables for SSE fp conversion costs

2025-04-19 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:f6859fb621179ec9bf5631eb8902619ab8d4467b commit r16-39-gf6859fb621179ec9bf5631eb8902619ab8d4467b Author: Jan Hubicka Date: Sat Apr 19 18:51:27 2025 +0200 Add tables for SSE fp conversion costs as disucssed, I will proceed adding costs for common SSE operation

[gcc r15-9522] Stream ipa_return_value_summary

2025-04-16 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:eabba7be040e81690332070873d59d23e8c93e11 commit r15-9522-geabba7be040e81690332070873d59d23e8c93e11 Author: Jan Hubicka Date: Wed Apr 16 15:28:32 2025 +0200 Stream ipa_return_value_summary Add streaming of return summaries from compile time to ltrans which

[gcc r15-9496] Set znver5 issue rate to 4.

2025-04-15 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:4a01869b963520d689fe9242cf2ff24984ea41d8 commit r15-9496-g4a01869b963520d689fe9242cf2ff24984ea41d8 Author: Jan Hubicka Date: Tue Apr 15 19:09:20 2025 +0200 Set znver5 issue rate to 4. this patch sets issue rate of znver5 to 4. With current model, unless a r

[gcc r15-9495] Set ADDSS cost to 3 for znver5

2025-04-15 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:e2011ab13de3e70774f869b356f5f9c750780b34 commit r15-9495-ge2011ab13de3e70774f869b356f5f9c750780b34 Author: Jan Hubicka Date: Tue Apr 15 19:04:15 2025 +0200 Set ADDSS cost to 3 for znver5 Znver5 has latency of addss 2 in typical case while all earlier versions

[gcc r15-9176] Fix costs of x86 move instructions at -Os

2025-04-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:564e4e0819022925dd160e455ee44baf0fda5805 commit r15-9176-g564e4e0819022925dd160e455ee44baf0fda5805 Author: Jan Hubicka Date: Thu Apr 3 13:06:07 2025 +0200 Fix costs of x86 move instructions at -Os This patch fixes problem with size costs declaring all moves t

[gcc r15-9047] Optimize string constructor

2025-03-30 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9c5505a35d9d71705464f9254f55407192d31ec3 commit r15-9047-g9c5505a35d9d71705464f9254f55407192d31ec3 Author: Jan Hubicka Date: Sun Mar 30 23:49:49 2025 +0200 Optimize string constructor this patch improves code generation on string constructors. We currently

[gcc r15-8041] Fix speculation_useful_p

2025-03-13 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:57dbbdd8e34b80926e06b352b6c442c555b303ed commit r15-8041-g57dbbdd8e34b80926e06b352b6c442c555b303ed Author: Jan Hubicka Date: Thu Mar 13 20:11:02 2025 +0100 Fix speculation_useful_p This patch fixes issue with speculation and x264. With profile feedback w

[gcc r15-7813] Break false dependency chain on Zen5

2025-03-04 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:8c4a00f9a48f1b2af10448c9f2058b44b8cb7234 commit r15-7813-g8c4a00f9a48f1b2af10448c9f2058b44b8cb7234 Author: Jan Hubicka Date: Tue Mar 4 16:22:01 2025 +0100 Break false dependency chain on Zen5 Zen5 on some variants has false dependency on tzcnt, blsi, blsr and

[gcc r15-7811] Make ix86_macro_fusion_pair_p and ix86_fuse_mov_alu_p match current CPUs

2025-03-04 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c84be624e079cd748df93a3dc0b5168865fefee9 commit r15-7811-gc84be624e079cd748df93a3dc0b5168865fefee9 Author: Jan Hubicka Date: Mon Mar 3 19:12:20 2025 +0100 Make ix86_macro_fusion_pair_p and ix86_fuse_mov_alu_p match current CPUs The current implementation of f

[gcc r12-10923] Zen5 tuning part 2: disable gather and scatter

2025-01-21 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:e909afe8a8a2924dd6ced6bdf7d8e397f14310b5 commit r12-10923-ge909afe8a8a2924dd6ced6bdf7d8e397f14310b5 Author: Jan Hubicka Date: Tue Sep 3 15:07:41 2024 +0200 Zen5 tuning part 2: disable gather and scatter We disable gathers for zen4. It seems that gather has i

[gcc r12-10901] Zen5 tuning part 5: update instruction latencies in x86-tune-costs

2025-01-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:3911b6366ee49dffe2f16578093b49664b3a2d72 commit r12-10901-g3911b6366ee49dffe2f16578093b49664b3a2d72 Author: Jan Hubicka Date: Wed Sep 4 09:19:08 2024 +0200 Zen5 tuning part 5: update instruction latencies in x86-tune-costs there is nothing exciting in this pa

[gcc r13-9309] Zen5 tuning part 5: update instruction latencies in x86-tune-costs

2025-01-12 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:f10d381dfc983ea32e5f72faadc7eb8126f114f6 commit r13-9309-gf10d381dfc983ea32e5f72faadc7eb8126f114f6 Author: Jan Hubicka Date: Wed Sep 4 09:19:08 2024 +0200 Zen5 tuning part 5: update instruction latencies in x86-tune-costs there is nothing exciting in this pat

[gcc r12-10888] Zen5 tuning part 4: update reassocation width

2025-01-07 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:4d7efc031fbd925565b049670bf755aca21bd2e3 commit r12-10888-g4d7efc031fbd925565b049670bf755aca21bd2e3 Author: Jan Hubicka Date: Tue Sep 3 18:20:34 2024 +0200 Zen5 tuning part 4: update reassocation width Zen5 has 6 instead of 4 ALUs and the integer multiplicati

[gcc r13-9285] Zen5 tuning part 4: update reassocation width

2025-01-07 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:52507e15aa31bc66e99f2273306f1b45be919bba commit r13-9285-g52507e15aa31bc66e99f2273306f1b45be919bba Author: Jan Hubicka Date: Tue Sep 3 18:20:34 2024 +0200 Zen5 tuning part 4: update reassocation width Zen5 has 6 instead of 4 ALUs and the integer multiplicatio

[gcc r12-10887] Zen5 tuning part 3: scheduler tweaks

2025-01-07 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:f0718f1d7815c7845243a182c66f4a454efbfb72 commit r12-10887-gf0718f1d7815c7845243a182c66f4a454efbfb72 Author: Jan Hubicka Date: Tue Sep 3 16:26:16 2024 +0200 Zen5 tuning part 3: scheduler tweaks this patch adds support for new fussion in znver5 documented in th

[gcc r13-9284] Zen5 tuning part 3: scheduler tweaks

2025-01-07 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:7392e9e480afe3143e72a99f7b5ac99b2f49c284 commit r13-9284-g7392e9e480afe3143e72a99f7b5ac99b2f49c284 Author: Jan Hubicka Date: Tue Sep 3 16:26:16 2024 +0200 Zen5 tuning part 3: scheduler tweaks this patch adds support for new fussion in znver5 documented in the

[gcc r15-5694] improve std::deque::_M_reallocate_map

2024-11-26 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:1729147f2b1a05ad2a4e8e2d91d1d2ba914039d6 commit r15-5694-g1729147f2b1a05ad2a4e8e2d91d1d2ba914039d6 Author: Jan Hubicka Date: Tue Nov 26 13:52:09 2024 +0100 improve std::deque::_M_reallocate_map Looking into reason why we still do throw_bad_alloc in clang bina

[gcc r15-5365] Mark asm statements as necessary in ipa-fnsummary

2024-11-17 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:bd59f2eeacd41b91e4e79b32dda83cc60d499e25 commit r15-5365-gbd59f2eeacd41b91e4e79b32dda83cc60d499e25 Author: Jan Hubicka Date: Sun Nov 17 15:48:29 2024 +0100 Mark asm statements as necessary in ipa-fnsummary I forgot to mark asm statements as necessary in ipa-f

[gcc r15-5362] ipa-modref bits for unsequenced and reproducible

2024-11-17 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:addf02282026cc23d24fc4445b47b408d484d1e7 commit r15-5362-gaddf02282026cc23d24fc4445b47b408d484d1e7 Author: Jan Hubicka Date: Sun Nov 17 11:54:10 2024 +0100 ipa-modref bits for unsequenced and reproducible C attributes reproducible and unsequenced implies that

[gcc r15-5359] Avoid expicit builtion list in tree-ssa-dce

2024-11-17 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:cc33f880e553d1aa94d19a349ad755f34c33de9e commit r15-5359-gcc33f880e553d1aa94d19a349ad755f34c33de9e Author: Jan Hubicka Date: Sat Nov 16 23:45:57 2024 +0100 Avoid expicit builtion list in tree-ssa-dce while working on -fmalloc-dce I noticed that tree-ssa-dce.c

[gcc r15-5358] Minor cleanup to cxx_init_decl_processing

2024-11-16 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:101f8c73d575b4746e49b0ea35eb6cc24de1dfdc commit r15-5358-g101f8c73d575b4746e49b0ea35eb6cc24de1dfdc Author: Jan Hubicka Date: Sat Nov 16 23:44:13 2024 +0100 Minor cleanup to cxx_init_decl_processing gcc/cp/ChangeLog: * decl.cc (cxx_build_opera

[gcc r15-5336] Ignore conditions guarding __builtin_unreachable in inliner metrics

2024-11-16 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:cee7d080d5c2a5fb8125878998b742c040ec88b4 commit r15-5336-gcee7d080d5c2a5fb8125878998b742c040ec88b4 Author: Jan Hubicka Date: Sat Nov 16 14:04:32 2024 +0100 Ignore conditions guarding __builtin_unreachable in inliner metrics This extends my last year attempt t

[gcc r15-5322] Fix type of malloc parameter in trans-expr.cc

2024-11-15 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:d5af5657fa4173dd8d3155e44fe6a1e3914cde8f commit r15-5322-gd5af5657fa4173dd8d3155e44fe6a1e3914cde8f Author: Jan Hubicka Date: Fri Nov 15 15:51:14 2024 +0100 Fix type of malloc parameter in trans-expr.cc gcc/fortran/ChangeLog: * trans-expr.cc (

[gcc r15-5281] Fix common.opt.urls

2024-11-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:236104908257d779b06894379f99e007a1c4c897 commit r15-5281-g236104908257d779b06894379f99e007a1c4c897 Author: Jan Hubicka Date: Thu Nov 14 17:29:14 2024 +0100 Fix common.opt.urls gcc/ChangeLog: * common.opt.urls: Fix. Diff: --- gcc/common.opt.

[gcc r15-5256] New testcase for operator new/delete removal.

2024-11-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:269b6477744f25ab0a89b364295b86a4bce392f9 commit r15-5256-g269b6477744f25ab0a89b364295b86a4bce392f9 Author: Jan Hubicka Date: Thu Nov 14 17:08:03 2024 +0100 New testcase for operator new/delete removal. * g++.dg/tree-ssa/dce-1.C: New test. Diff: ---

[gcc r15-5255] Remove allocations which are used only for NULL pointer check and free

2024-11-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:7828dc070510f8f5c06765858815fa6e5d2d1ec6 commit r15-5255-g7828dc070510f8f5c06765858815fa6e5d2d1ec6 Author: Jan Hubicka Date: Thu Nov 14 17:01:12 2024 +0100 Remove allocations which are used only for NULL pointer check and free Extend tree-ssa-dse to remove me

[gcc r15-5238] Add testcases for std::vector optimization

2024-11-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:20faea418d580d4db99aa5293c976ab16867aca5 commit r15-5238-g20faea418d580d4db99aa5293c976ab16867aca5 Author: Jan Hubicka Date: Thu Nov 14 14:19:04 2024 +0100 Add testcases for std::vector optimization gcc/testsuite/ChangeLog: PR tree-optimizati

[gcc r15-5176] Verify that empty std::vector is optimized away

2024-11-12 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2264b68796aa6f1a609987cf7edb9183bf070e7e commit r15-5176-g2264b68796aa6f1a609987cf7edb9183bf070e7e Author: Jan Hubicka Date: Tue Nov 12 15:58:02 2024 +0100 Verify that empty std::vector is optimized away With __builtin_operator_new we now can optimize away un

[gcc r12-10736] Zen5 tuning part 1: avoid FMA chains

2024-09-30 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:be6334fffdf2a7df3b7f92ea933b804664dfc383 commit r12-10736-gbe6334fffdf2a7df3b7f92ea933b804664dfc383 Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a criti

[gcc r13-9068] Zen5 tuning part 2: disable gather and scatter

2024-09-30 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:456719b5c0705a6c2065fc261f41d0c2a30f3045 commit r13-9068-g456719b5c0705a6c2065fc261f41d0c2a30f3045 Author: Jan Hubicka Date: Tue Sep 3 15:07:41 2024 +0200 Zen5 tuning part 2: disable gather and scatter We disable gathers for zen4. It seems that gather has im

[gcc r13-9065] Re-add m_ZNVER4 to X86_TUNE_AVOID_256FMA_CHAINS

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:ad9ba1eccec5086b84f1030fb3e87947242ba904 commit r13-9065-gad9ba1eccec5086b84f1030fb3e87947242ba904 Author: Jan Hubicka Date: Sun Sep 29 02:10:14 2024 +0200 Re-add m_ZNVER4 to X86_TUNE_AVOID_256FMA_CHAINS * config/i386/x86-tune.def (X86_TUNE_AVOID_256F

[gcc r13-9064] Zen5 tuning part 1: avoid FMA chains

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093 commit r13-9064-g7c0c772e4fb89bf4d9bc09f7d8e41c6bc0b0e093 Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a critic

[gcc r12-10733] Fixup unaligned load/store cost for znver5

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c77b1c833e84b62928a729556c502e1311782b2d commit r12-10733-gc77b1c833e84b62928a729556c502e1311782b2d Author: Richard Biener Date: Tue Jul 16 10:45:27 2024 +0200 Fixup unaligned load/store cost for znver5 Currently unaligned YMM and ZMM load and store costs are

[gcc r12-10732] Add AMD znver5 processor enablement with scheduler model

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:54806268b47775449c7e237f8f03e922d6da26f6 commit r12-10732-g54806268b47775449c7e237f8f03e922d6da26f6 Author: Jan Hubicka Date: Mon Mar 18 10:22:44 2024 +0100 Add AMD znver5 processor enablement with scheduler model 2024-02-14 Jan Hubicka Kar

[gcc r14-10721] Zen5 tuning part 4: update reassocation width

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:b17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37 commit r14-10721-gb17cb7ed709ea7250eaa4ddc4a713ebbb6b94b37 Author: Jan Hubicka Date: Tue Sep 3 18:20:34 2024 +0200 Zen5 tuning part 4: update reassocation width Zen5 has 6 instead of 4 ALUs and the integer multiplicati

[gcc r14-10720] Zen5 tuning part 3: fix typo in previous patch

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2eade72b0e2ac9dd18ef517bc3b868157f1ddf48 commit r14-10720-g2eade72b0e2ac9dd18ef517bc3b868157f1ddf48 Author: Jan Hubicka Date: Tue Sep 3 17:25:05 2024 +0200 Zen5 tuning part 3: fix typo in previous patch gcc/ChangeLog: * config/i386/x86-tune-s

[gcc r14-10719] Zen5 tuning part 3: scheduler tweaks

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2c01292411044adbd67f79355c1e24decd2fd3c0 commit r14-10719-g2c01292411044adbd67f79355c1e24decd2fd3c0 Author: Jan Hubicka Date: Tue Sep 3 16:26:16 2024 +0200 Zen5 tuning part 3: scheduler tweaks this patch adds support for new fussion in znver5 documented in th

[gcc r13-9063] Fixup unaligned load/store cost for znver5

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:22f4e4a5043fa69c54b1010d04bcd77958646189 commit r13-9063-g22f4e4a5043fa69c54b1010d04bcd77958646189 Author: Richard Biener Date: Tue Jul 16 10:45:27 2024 +0200 Fixup unaligned load/store cost for znver5 Currently unaligned YMM and ZMM load and store costs are

[gcc r13-9062] Add AMD znver5 processor enablement with scheduler model

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:499afa8e6899d8e866bbd1e6cc340e5a52557883 commit r13-9062-g499afa8e6899d8e866bbd1e6cc340e5a52557883 Author: Jan Hubicka Date: Mon Mar 18 10:22:44 2024 +0100 Add AMD znver5 processor enablement with scheduler model 2024-02-14 Jan Hubicka Kart

[gcc r14-10718] Zen5 tuning part 2: disable gather and scatter

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:3d0a91130eceaf428387ba314cfdfceb99b51709 commit r14-10718-g3d0a91130eceaf428387ba314cfdfceb99b51709 Author: Jan Hubicka Date: Tue Sep 3 15:07:41 2024 +0200 Zen5 tuning part 2: disable gather and scatter We disable gathers for zen4. It seems that gather has i

[gcc r13-9061] Reduce recursive inlining of always_inline functions

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:2532944e3588cf69bce019eaf03de9c63b78568f commit r13-9061-g2532944e3588cf69bce019eaf03de9c63b78568f Author: Jan Hubicka Date: Tue May 14 12:58:56 2024 +0200 Reduce recursive inlining of always_inline functions this patch tames down inliner on (mutiply) self-re

[gcc r14-10717] Zen5 tuning part 1: avoid FMA chains

2024-09-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:fce2fe0406aa66c5d6f6465984a6af9ccc63370d commit r14-10717-gfce2fe0406aa66c5d6f6465984a6af9ccc63370d Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a criti

[gcc r15-3441] Zen5 tuning part 5: update instruction latencies in x86-tune-costs

2024-09-04 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:4292297a0f938ffc953422fa246ff00fe345fe3d commit r15-3441-g4292297a0f938ffc953422fa246ff00fe345fe3d Author: Jan Hubicka Date: Wed Sep 4 09:19:08 2024 +0200 Zen5 tuning part 5: update instruction latencies in x86-tune-costs there is nothing exciting in this pat

[gcc r15-3427] Zen5 tuning part 4: update reassocation width

2024-09-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:f0ab3de6ec0e3540f2e57f3f5628005f0a4e3fa5 commit r15-3427-gf0ab3de6ec0e3540f2e57f3f5628005f0a4e3fa5 Author: Jan Hubicka Date: Tue Sep 3 18:20:34 2024 +0200 Zen5 tuning part 4: update reassocation width Zen5 has 6 instead of 4 ALUs and the integer multiplicatio

[gcc r15-3425] Zen5 tuning part 3: fix typo in previous patch

2024-09-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:910e1769a0653ac32bd8c1d6aabb39c797d5d773 commit r15-3425-g910e1769a0653ac32bd8c1d6aabb39c797d5d773 Author: Jan Hubicka Date: Tue Sep 3 17:25:05 2024 +0200 Zen5 tuning part 3: fix typo in previous patch gcc/ChangeLog: * config/i386/x86-tune-sc

[gcc r15-3420] Zen5 tuning part 3: scheduler tweaks

2024-09-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:e2125a600552bc6e0329e3f1224eea14804db8d3 commit r15-3420-ge2125a600552bc6e0329e3f1224eea14804db8d3 Author: Jan Hubicka Date: Tue Sep 3 16:26:16 2024 +0200 Zen5 tuning part 3: scheduler tweaks this patch adds support for new fussion in znver5 documented in the

[gcc r15-3417] Zen5 tuning part 2: disable gather and scatter

2024-09-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:d82edbe92eed53a479736fcbbe6d54d0fb42daa4 commit r15-3417-gd82edbe92eed53a479736fcbbe6d54d0fb42daa4 Author: Jan Hubicka Date: Tue Sep 3 15:07:41 2024 +0200 Zen5 tuning part 2: disable gather and scatter We disable gathers for zen4. It seems that gather has im

[gcc r15-3414] Zen5 tuning part 1: avoid FMA chains

2024-09-03 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:d6360b4083695970789fd65b9c515c11a5ce25b4 commit r15-3414-gd6360b4083695970789fd65b9c515c11a5ce25b4 Author: Jan Hubicka Date: Tue Sep 3 13:38:33 2024 +0200 Zen5 tuning part 1: avoid FMA chains testing matrix multiplication benchmarks shows that FMA on a critic

[gcc/devel/nothrow-detection] Fix handling of types

2024-08-27 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c16d4a0ae162abc00d97bb73e598ca00d16cf555 commit c16d4a0ae162abc00d97bb73e598ca00d16cf555 Author: Jan Hubicka Date: Tue Aug 27 13:50:32 2024 +0200 Fix handling of types * ipa-devirt.cc (odr_equivalent_or_derived_p): New. * ipa-utils.h (odr_

[gcc r15-2376] Fix ICE with -fdump-tree-moref

2024-07-29 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:b3176b620ff29a06c90992ca3d29f3cffd459537 commit r15-2376-gb3176b620ff29a06c90992ca3d29f3cffd459537 Author: Jan Hubicka Date: Mon Jul 29 10:49:49 2024 +0200 Fix ICE with -fdump-tree-moref gcc/ChangeLog: PR ipa/116055 * ipa-modref.c

[gcc r14-10515] Fix ICE with -fdump-tree-moref

2024-07-29 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:98baaa17561ca299eefc98f469f4326e551604c9 commit r14-10515-g98baaa17561ca299eefc98f469f4326e551604c9 Author: Jan Hubicka Date: Mon Jul 29 10:48:34 2024 +0200 Fix ICE with -fdump-tree-moref gcc/ChangeLog: PR ipa/116055 * ipa-modref.

[gcc r15-2207] Fix handling of ICF_NOVOPS in ipa-modref

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:efcbe7b985e24ac002a863afd609c44a67761195 commit r15-2207-gefcbe7b985e24ac002a863afd609c44a67761195 Author: Jan Hubicka Date: Mon Jul 22 23:01:50 2024 +0200 Fix handling of ICF_NOVOPS in ipa-modref As shown in somewhat convoluted testcase, ipa-modref is mistre

[gcc r14-10495] Fix modref's iteraction with store merging

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9ddd5f88e60972147dff74b48658e2b12040d468 commit r14-10495-g9ddd5f88e60972147dff74b48658e2b12040d468 Author: Jan Hubicka Date: Mon Jul 22 19:00:39 2024 +0200 Fix modref's iteraction with store merging Hi, this patch fixes wrong code in case store-merging i

[gcc r15-2205] Fix modref's iteraction with store merging

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:14074773350ffed7efdebbc553adf0f23b572e87 commit r15-2205-g14074773350ffed7efdebbc553adf0f23b572e87 Author: Jan Hubicka Date: Mon Jul 22 19:00:39 2024 +0200 Fix modref's iteraction with store merging Hi, this patch fixes wrong code in case store-merging in

[gcc r14-10489] Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced to function call parameter

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:27ef3a0779e551ca116c56c431436c8d2191b253 commit r14-10489-g27ef3a0779e551ca116c56c431436c8d2191b253 Author: Jan Hubicka Date: Mon Jul 22 18:08:08 2024 +0200 Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced to function call parameters

[gcc r14-10488] Fix accounting of offsets in unadjusted_ptr_and_unit_offset

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:f2e98084792821c3849074867d5b007c49028854 commit r14-10488-gf2e98084792821c3849074867d5b007c49028854 Author: Jan Hubicka Date: Mon Jul 22 18:05:26 2024 +0200 Fix accounting of offsets in unadjusted_ptr_and_unit_offset unadjusted_ptr_and_unit_offset accidentall

[gcc r14-10487] Compare loop bounds in ipa-icf

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:c5397d343ff1365fcebcf3ebabe140608874aac3 commit r14-10487-gc5397d343ff1365fcebcf3ebabe140608874aac3 Author: Jan Hubicka Date: Mon Jul 22 18:01:57 2024 +0200 Compare loop bounds in ipa-icf Hi, this testcase shows another poblem with missing comparators for

[gcc r14-10486] Reduce recursive inlining of always_inline functions

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9a7d668fc58f817027ec7f9fa7e20a6dce08bddb commit r14-10486-g9a7d668fc58f817027ec7f9fa7e20a6dce08bddb Author: Jan Hubicka Date: Tue May 14 12:58:56 2024 +0200 Reduce recursive inlining of always_inline functions this patch tames down inliner on (mutiply) self-r

[gcc r15-2202] Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced to function call parameter

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:cf8ffc58aad3127031c229a75cc4b99c8ace25e0 commit r15-2202-gcf8ffc58aad3127031c229a75cc4b99c8ace25e0 Author: Jan Hubicka Date: Mon Jul 22 18:08:08 2024 +0200 Fix modref_eaf_analysis::analyze_ssa_name handling of values dereferenced to function call parameters

[gcc r15-2201] Fix accounting of offsets in unadjusted_ptr_and_unit_offset

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:391f46f10b0586c074014de82efe76787739bb0c commit r15-2201-g391f46f10b0586c074014de82efe76787739bb0c Author: Jan Hubicka Date: Mon Jul 22 18:05:26 2024 +0200 Fix accounting of offsets in unadjusted_ptr_and_unit_offset unadjusted_ptr_and_unit_offset accidentally

[gcc r15-2200] Compare loop bounds in ipa-icf

2024-07-22 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:0d19fbc7b0760ce665fa6a88cd40cfa0311358d7 commit r15-2200-g0d19fbc7b0760ce665fa6a88cd40cfa0311358d7 Author: Jan Hubicka Date: Mon Jul 22 18:01:57 2024 +0200 Compare loop bounds in ipa-icf Hi, this testcase shows another poblem with missing comparators for

[gcc r15-581] Fix points_to_local_or_readonly_memory_p wrt TARGET_MEM_REF

2024-05-16 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:96d53252aefcbc2fe419c4c3b4bcd3fc03d4d187 commit r15-581-g96d53252aefcbc2fe419c4c3b4bcd3fc03d4d187 Author: Jan Hubicka Date: Thu May 16 15:33:55 2024 +0200 Fix points_to_local_or_readonly_memory_p wrt TARGET_MEM_REF TARGET_MEM_REF can be used to offset constan

[gcc r15-512] Avoid pointer compares on TYPE_MAIN_VARIANT in TBAA

2024-05-15 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:9b7cad5884f21cc5783075be0043777448db3fab commit r15-512-g9b7cad5884f21cc5783075be0043777448db3fab Author: Jan Hubicka Date: Wed May 15 14:14:27 2024 +0200 Avoid pointer compares on TYPE_MAIN_VARIANT in TBAA while building more testcases for ipa-icf I noticed

[gcc r15-482] Reduce recursive inlining of always_inline functions

2024-05-14 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:1ec49897253e093e1ef6261eb104ac0c111bac83 commit r15-482-g1ec49897253e093e1ef6261eb104ac0c111bac83 Author: Jan Hubicka Date: Tue May 14 12:58:56 2024 +0200 Reduce recursive inlining of always_inline functions this patch tames down inliner on (mutiply) self-rec

[gcc r14-10093] Remove repeated information in -ftree-loop-distribute-patterns doc

2024-04-23 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:6f0a646dd2fc59e9c9cde63718b36085f84a19ba commit r14-10093-g6f0a646dd2fc59e9c9cde63718b36085f84a19ba Author: Jan Hubicka Date: Tue Apr 23 15:51:42 2024 +0200 Remove repeated information in -ftree-loop-distribute-patterns doc We have: -ftree-loo

[gcc r14-9705] Hash operands of PHI in ipa-icf

2024-03-28 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:0923fe2d4808c16b72c1d1bfe28220dd326d8b76 commit r14-9705-g0923fe2d4808c16b72c1d1bfe28220dd326d8b76 Author: Jan Hubicka Date: Thu Mar 28 13:24:54 2024 +0100 Hash operands of PHI in ipa-icf This patch fixes cache colision on function whose body differs only by

[gcc r14-9516] Add missing config/i386/zn4zn5.md file

2024-03-18 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:dfc9d1cc8353bdd7fbc37bc10bb3fd40f49fa4af commit r14-9516-gdfc9d1cc8353bdd7fbc37bc10bb3fd40f49fa4af Author: Jan Hubicka Date: Mon Mar 18 14:24:10 2024 +0100 Add missing config/i386/zn4zn5.md file gcc/ChangeLog: * config/i386/zn4zn5.md: Add fil

[gcc r14-9515] Add AMD znver5 processor enablement with scheduler model

2024-03-18 Thread Jan Hubicka via Gcc-cvs
https://gcc.gnu.org/g:d0aa0af9a9b7dd709a8c7ff6604ed6b7da0fc23a commit r14-9515-gd0aa0af9a9b7dd709a8c7ff6604ed6b7da0fc23a Author: Jan Hubicka Date: Mon Mar 18 10:22:44 2024 +0100 Add AMD znver5 processor enablement with scheduler model 2024-02-14 Jan Hubicka Kart