[gcc r15-4566] Fix ICE due to isa mismatch for the builtins.

2024-10-23 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:403e361d5aa620e77c9832578b2409a0fdd79d96 commit r15-4566-g403e361d5aa620e77c9832578b2409a0fdd79d96 Author: liuhongt Date: Tue Oct 22 01:54:40 2024 -0700 Fix ICE due to isa mismatch for the builtins. gcc/ChangeLog: PR target/117240

[gcc r14-10852] Fix ICE due to subreg:us_truncate.

2024-10-30 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:71a0cf699b6a2dc03abec53aeafab8b70db2bb07 commit r14-10852-g71a0cf699b6a2dc03abec53aeafab8b70db2bb07 Author: liuhongt Date: Tue Oct 29 02:09:39 2024 -0700 Fix ICE due to subreg:us_truncate. Force_operand issues an ICE when input is (subreg:DI (us_truncate:

[gcc r15-4775] Fix ICE due to subreg:us_truncate.

2024-10-30 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:bc0eeccf27a084461a2d5661e23468350acb43da commit r15-4775-gbc0eeccf27a084461a2d5661e23468350acb43da Author: liuhongt Date: Tue Oct 29 02:09:39 2024 -0700 Fix ICE due to subreg:us_truncate. Force_operand issues an ICE when input is (subreg:DI (us_truncate:V

[gcc r15-4954] Support vector float_truncate for SF to BF.

2024-11-05 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:a17acf4f25f0ce9b8dce24f25867500a3b093b57 commit r15-4954-ga17acf4f25f0ce9b8dce24f25867500a3b093b57 Author: liuhongt Date: Wed Oct 23 00:51:00 2024 -0700 Support vector float_truncate for SF to BF. Generate native instruction whenever possible, otherwise use v

[gcc r15-4955] Support vector float_extend from __bf16 to float.

2024-11-05 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:648bd1fcc6acfc56e08f4ad8146a80910cfacfd7 commit r15-4955-g648bd1fcc6acfc56e08f4ad8146a80910cfacfd7 Author: liuhongt Date: Wed Oct 23 23:51:20 2024 -0700 Support vector float_extend from __bf16 to float. It's supported by vector permutation with zero vector.

[gcc r12-10793] Fix ICE due to subreg:us_truncate.

2024-10-30 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:d0a932fb53ccdf5155db90632901c55446b8 commit r12-10793-gd0a932fb53ccdf5155db90632901c55446b8 Author: liuhongt Date: Tue Oct 29 02:09:39 2024 -0700 Fix ICE due to subreg:us_truncate. Force_operand issues an ICE when input is (subreg:DI (us_truncate:

[gcc r13-9157] Fix ICE due to subreg:us_truncate.

2024-10-30 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:28ea5a4ec3e9e49439fdb912ef4edeebfdae881d commit r13-9157-g28ea5a4ec3e9e49439fdb912ef4edeebfdae881d Author: liuhongt Date: Tue Oct 29 02:09:39 2024 -0700 Fix ICE due to subreg:us_truncate. Force_operand issues an ICE when input is (subreg:DI (us_truncate:V

[gcc r15-5071] Guard truncate from vector float to vector __bf16 with !flag_rounding_math && HONOR_NANS (BFmode).

2024-11-10 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:de867e8da30bf5e0cb51c3946ec43c3c4778d4a0 commit r15-5071-gde867e8da30bf5e0cb51c3946ec43c3c4778d4a0 Author: liuhongt Date: Wed Nov 6 18:15:42 2024 -0800 Guard truncate from vector float to vector __bf16 with !flag_rounding_math && HONOR_NANS (BFmode). hw inst

[gcc r12-10784] Fix ICE due to isa mismatch for the builtins.

2024-10-23 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:ab84a8a4b78990942e006e9f060dc2705f2c6d8f commit r12-10784-gab84a8a4b78990942e006e9f060dc2705f2c6d8f Author: liuhongt Date: Tue Oct 22 01:54:40 2024 -0700 Fix ICE due to isa mismatch for the builtins. gcc/ChangeLog: PR target/117240

[gcc r15-3885] Define VECTOR_STORE_FLAG_VALUE

2024-09-25 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:78eef8919e2f2973ed7750ba66f5726e70614d07 commit r15-3885-g78eef8919e2f2973ed7750ba66f5726e70614d07 Author: liuhongt Date: Mon Sep 23 11:06:04 2024 +0800 Define VECTOR_STORE_FLAG_VALUE gcc/ChangeLog: * config/i386/i386.h (VECTOR_STORE_FLAG_VAL

[gcc r15-4371] Adjust testcase to avoid scan FIX in REG_EQUIV.

2024-10-15 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:a8b4ea1bcc10b5253992f4b932aec6862aef32fa commit r15-4371-ga8b4ea1bcc10b5253992f4b932aec6862aef32fa Author: liuhongt Date: Tue Oct 15 11:17:20 2024 +0800 Adjust testcase to avoid scan FIX in REG_EQUIV. Also add hard_float target to avoid failed on arm-eabi.

[gcc r15-4400] Don't lower vpcmpu to pcmpgt since the latter is for signed comparison.

2024-10-16 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:21e2cd65add9070292313f8e12e8731d0aa2c869 commit r15-4400-g21e2cd65add9070292313f8e12e8731d0aa2c869 Author: liuhongt Date: Tue Oct 8 16:18:31 2024 +0800 Don't lower vpcmpu to pcmpgt since the latter is for signed comparison. r15-1737-gb06a108f0fbffe lower AVX5

[gcc r15-4399] Canonicalize (vec_merge (fma: op2 op1 op3) (match_dup 1)) mask) to (vec_merge (fma: op1 op2 op3) (ma

2024-10-16 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:edf4db8355dead3413bad64f6a89bae82dabd0ad commit r15-4399-gedf4db8355dead3413bad64f6a89bae82dabd0ad Author: liuhongt Date: Mon Oct 14 13:09:59 2024 +0800 Canonicalize (vec_merge (fma: op2 op1 op3) (match_dup 1)) mask) to (vec_merge (fma: op1 op2 op3) (match_dup 1)) ma

[gcc r15-4398] Canonicalize (vec_merge (fma op2 op1 op3) op1 mask) to (vec_merge (fma op1 op2 op3) op1 mask).

2024-10-16 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:330782a1b6cfe881ad884617ffab441aeb1c2b5c commit r15-4398-g330782a1b6cfe881ad884617ffab441aeb1c2b5c Author: liuhongt Date: Mon Oct 14 17:16:13 2024 +0800 Canonicalize (vec_merge (fma op2 op1 op3) op1 mask) to (vec_merge (fma op1 op2 op3) op1 mask). For x86 ma

[gcc r15-5639] Fix uninitialized operands[2] in vec_unpacks_hi_v4sf.

2024-11-24 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:ba4cf2e296d8d5950c3d356fa6b6efcad00d0189 commit r15-5639-gba4cf2e296d8d5950c3d356fa6b6efcad00d0189 Author: liuhongt Date: Thu Nov 21 23:57:38 2024 -0800 Fix uninitialized operands[2] in vec_unpacks_hi_v4sf. It could cause weired spill in RA when register pres

[gcc r14-10979] Fix uninitialized operands[2] in vec_unpacks_hi_v4sf.

2024-11-25 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:4a63cc6de77481878ec31e1e6ac30e22c50b063a commit r14-10979-g4a63cc6de77481878ec31e1e6ac30e22c50b063a Author: liuhongt Date: Thu Nov 21 23:57:38 2024 -0800 Fix uninitialized operands[2] in vec_unpacks_hi_v4sf. It could cause weired spill in RA when register pre

[gcc r15-5489] Add microarchtecture tunable for pass_align_tight_loops [PR117438]

2024-11-19 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:6350e956d1a74963a62bedabef3d4a1a3f2d4852 commit r15-5489-g6350e956d1a74963a62bedabef3d4a1a3f2d4852 Author: MayShao-oc Date: Thu Nov 7 10:57:02 2024 +0800 Add microarchtecture tunable for pass_align_tight_loops [PR117438] Hi Hongtao: Add m_CASCADELAK, a

[gcc r13-9216] Fix uninitialized operands[2] in vec_unpacks_hi_v4sf.

2024-11-25 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:0eb8c19cb45fc004b7039fa22ff9021604d80dbc commit r13-9216-g0eb8c19cb45fc004b7039fa22ff9021604d80dbc Author: liuhongt Date: Thu Nov 21 23:57:38 2024 -0800 Fix uninitialized operands[2] in vec_unpacks_hi_v4sf. It could cause weired spill in RA when register pres

[gcc r12-10832] Fix uninitialized operands[2] in vec_unpacks_hi_v4sf.

2024-11-25 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:89a27cf6b1354cc80d834d71f7a3aa137d605e94 commit r12-10832-g89a27cf6b1354cc80d834d71f7a3aa137d605e94 Author: liuhongt Date: Thu Nov 21 23:57:38 2024 -0800 Fix uninitialized operands[2] in vec_unpacks_hi_v4sf. It could cause weired spill in RA when register pre

[gcc r15-6097] Fix inaccuracy in cunroll/cunrolli when considering what's innermost loop.

2024-12-10 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:ee2f19b0937b5efc0b23c4319cbd4a38b27eac6e commit r15-6097-gee2f19b0937b5efc0b23c4319cbd4a38b27eac6e Author: liuhongt Date: Mon Dec 2 01:54:59 2024 -0800 Fix inaccuracy in cunroll/cunrolli when considering what's innermost loop. r15-919-gef27b91b62c3aa removed

[gcc r15-6844] Refactor ix86_expand_vecop_qihi2.

2025-01-12 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:0e05b793fba2a9bea9f0fbb1f068679f5dadf514 commit r15-6844-g0e05b793fba2a9bea9f0fbb1f068679f5dadf514 Author: liuhongt Date: Wed Jan 8 23:11:17 2025 -0800 Refactor ix86_expand_vecop_qihi2. Since there's regression to use vpermq, and it's manually disabled by

[gcc r15-6940] Fix typo to avoid ICE.

2025-01-16 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:3872daa5767622d1f8b086050996c85604db7514 commit r15-6940-g3872daa5767622d1f8b086050996c85604db7514 Author: liuhongt Date: Wed Jan 15 19:09:24 2025 -0800 Fix typo to avoid ICE. gcc/ChangeLog: PR target/118489 * config/i386/sse.md (

[gcc r15-8283] Mark gcc.target/i386/apx-ndd-tls-1b.c as xfail.

2025-03-18 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:be671ec1f30ecd55aaff09048afb2a619018cb8a commit r15-8283-gbe671ec1f30ecd55aaff09048afb2a619018cb8a Author: liuhongt Date: Sun Mar 16 22:28:44 2025 -0700 Mark gcc.target/i386/apx-ndd-tls-1b.c as xfail. It looks like the testcase is fragile, it's supposed to ch

[gcc r15-8461] Use ix86_fp_comparison_operator in cbranchbf4 to avoid ICE.

2025-03-19 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:62a6cafd7f55c6e88a9780b91039257572038535 commit r15-8461-g62a6cafd7f55c6e88a9780b91039257572038535 Author: liuhongt Date: Mon Mar 17 22:47:11 2025 -0700 Use ix86_fp_comparison_operator in cbranchbf4 to avoid ICE. *jcc only supports ix86_fp_comparison_operator

[gcc r15-9473] Revert documents from r11-344-g0fec3f62b9bfc0

2025-04-14 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:fa58ff249a0e63a721ccb6d770c86523d84a212a commit r15-9473-gfa58ff249a0e63a721ccb6d770c86523d84a212a Author: liuhongt Date: Sun Apr 13 19:40:51 2025 -0700 Revert documents from r11-344-g0fec3f62b9bfc0 gcc/ChangeLog: PR target/108134

[gcc r16-46] Generate 2 FMA instructions in ix86_expand_swdivsf.

2025-04-20 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:e1098c7b08d9e6018f60dae7a14c5ad621618223 commit r16-46-ge1098c7b08d9e6018f60dae7a14c5ad621618223 Author: hongtao.liu Date: Thu Apr 17 09:07:55 2025 +0200 Generate 2 FMA instructions in ix86_expand_swdivsf. When FMA is available, N-R step can be rewritten with

[gcc r16-91] Accept allones or 0 operand for vcond_mask op1.

2025-04-22 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:f72a2d221539cede358f2487b94bc370c6fc44b5 commit r16-91-gf72a2d221539cede358f2487b94bc370c6fc44b5 Author: liuhongt Date: Sun Mar 30 20:15:41 2025 -0700 Accept allones or 0 operand for vcond_mask op1. Since ix86_expand_sse_movcc will simplify them into a simple

[gcc r16-105] target: [PR103750] Also handle avx512 kmask & immediate 15 or 3 when VF is 4/2.

2025-04-23 Thread hongtao Liu via Gcc-cvs
https://gcc.gnu.org/g:599bca27dc37b3f7979bd6af30a357104f2b90c1 commit r16-105-g599bca27dc37b3f7979bd6af30a357104f2b90c1 Author: liuhongt Date: Mon Apr 7 23:50:53 2025 -0700 target: [PR103750] Also handle avx512 kmask & immediate 15 or 3 when VF is 4/2. Since the upper bits are al

<    1   2