date:20210428

Re: Recent change breaking Wrestrict8 on various targets

2021-04-28 Thread Richard Biener via Gcc-patches

On Tue, Apr 27, 2021 at 10:22 PM Jeff Law wrote: > > This change: > > d8e1f1d24179690fd9c0f63c27b12e030010d9ea is the first bad commit > commit d8e1f1d24179690fd9c0f63c27b12e030010d9ea > Author: Richard Biener > Date: Wed Apr 7 12:09:44 2021 +0200 > > tree-optimization/99912 - schedule DSE

[PATCH] ipa/100308 - properly update the callgraph when pruning EH in IPA CP

2021-04-28 Thread Richard Biener

This makes sure to fall into the delete_unreachable_blocks_update_callgraph handling to remove blocks becoming unreachable when removing EH edges by open-coding gimple_purge_dead_eh_edges. This fixes an ICE seen with gfortran.dg/gomp/pr88933.f90 when enhancing DSE. Bootstrap & regtest running on

[Ada] Fix loss of optimization of array iteration due to inlining

2021-04-28 Thread Eric Botcazou

This helps loop-invariant motion to hoist complicated offset computations. Tested on x86-64/Linux, applied on the mainline. 2021-04-28 Eric Botcazou * gcc-interface/trans.c (language_function): Add comment. (loop_info_d): Add fndecl and invariants fields. (find_loop_f

[PATCH][v2] ipa/100308 - properly update the callgraph when pruning EH in IPA CP

2021-04-28 Thread Richard Biener

This makes sure to fall into the delete_unreachable_blocks_update_callgraph handling to remove blocks becoming unreachable when removing EH edges by tracking blocks to need EH cleanup and doing that after releasing dominance info. This fixes an ICE seen with gfortran.dg/gomp/pr88933.f90 when enhan

[Ada] Get rid of useless temporary for call to pure function

2021-04-28 Thread Eric Botcazou

This avoids creating a useless temporary for a call to a pure function with good properties by using the RSO. Tested on x86-64/Linux, applied on the mainline. 2021-04-28 Eric Botcazou * gcc-interface/trans.c (is_array_of_scalar_type): New predicate. (find_decls_r): New functi

Re: [PATCH 1/1] PR100281 Fix SImode pointer handling

2021-04-28 Thread Richard Biener via Gcc-patches

On Wed, Apr 28, 2021 at 8:54 AM Andreas Krebbel via Gcc-patches wrote: > > The problem appears to be triggered by two locations in the front-end > where non-POINTER_SIZE pointers aren't handled right now. > > 1. An assertion in strip_typedefs is triggered because the alignment > of the types don't

Re: [PATCH 1/1] PR100281 Fix SImode pointer handling

2021-04-28 Thread Andreas Krebbel via Gcc-patches

On 4/28/21 10:12 AM, Richard Biener wrote: > On Wed, Apr 28, 2021 at 8:54 AM Andreas Krebbel via Gcc-patches > wrote: >> >> The problem appears to be triggered by two locations in the front-end >> where non-POINTER_SIZE pointers aren't handled right now. >> >> 1. An assertion in strip_typedefs is

[Ada] Avoid creating useless local bounds around calls

2021-04-28 Thread Eric Botcazou

This prevents the compiler from creating useless local bounds around calls that take a parameter of an unconstrained array type when the bounds already exist somewhere else for the actual parameter. Tested on x86-64/Linux, applied on the mainline. 2021-04-28 Eric Botcazou * gcc-inter

Re: add ASM_OUTPUT_MAX_SKIP_ALIGN to i386.h

2021-04-28 Thread Uros Bizjak via Gcc-patches

On Tue, Apr 27, 2021 at 5:34 PM Alexandre Oliva wrote: > > > Several i386 align tests expect p2align to be used, but not all > configurations define ASM_OUTPUT_MAX_SKIP_ALIGN, even when > HAVE_GAS_MAX_SKIP_P2ALIGN. > > i386.h has an equivalent ASM_OUTPUT_MAX_SKIP_PAD that is used in > i386.c, so I

Re: [PATCH 1/1] PR100281 Fix SImode pointer handling

2021-04-28 Thread Andreas Krebbel via Gcc-patches

On 4/28/21 10:22 AM, Andreas Krebbel via Gcc-patches wrote: > On 4/28/21 10:12 AM, Richard Biener wrote: >> On Wed, Apr 28, 2021 at 8:54 AM Andreas Krebbel via Gcc-patches >> wrote: >>> >>> The problem appears to be triggered by two locations in the front-end >>> where non-POINTER_SIZE pointers ar

[PATCH] RISC-V: For '-march' and '-mabi' options, add 'Negative' property mentions itself.

2021-04-28 Thread Geng Qi via Gcc-patches

When use multi-lib riscv-tool-chain. A bug is triggered when there are two '-march' at command line. riscv64-unknown-elf-gcc -march=rv32gcp -mabi=ilp32f -march=rv32gcpzp64 HelloWorld.c /lhome/gengq/riscv64-linux-ptest/lib/gcc/riscv64-unknown-elf/10.2.0/../../../../riscv64-unknown-elf/bin/ld: /lh

Re: add ASM_OUTPUT_MAX_SKIP_ALIGN to i386.h

2021-04-28 Thread Uros Bizjak via Gcc-patches

On Tue, Apr 27, 2021 at 5:34 PM Alexandre Oliva wrote: > > > Several i386 align tests expect p2align to be used, but not all > configurations define ASM_OUTPUT_MAX_SKIP_ALIGN, even when > HAVE_GAS_MAX_SKIP_P2ALIGN. > > i386.h has an equivalent ASM_OUTPUT_MAX_SKIP_PAD that is used in > i386.c, so I

Re: [PATCH] ipa/100308 - properly update the callgraph when pruning EH in IPA CP

2021-04-28 Thread Richard Biener via Gcc-patches

On Wed, Apr 28, 2021 at 10:50 AM Richard Biener wrote: > > This makes sure to fall into the delete_unreachable_blocks_update_callgraph > handling to remove blocks becoming unreachable when removing EH edges > by open-coding gimple_purge_dead_eh_edges. > > This fixes an ICE seen with gfortran.dg/go

[PATCH] doc/options.texi: Fix the discription of 'Negative'.

2021-04-28 Thread Geng Qi via Gcc-patches

gcc/ChangeLog: * doc/options.texi (Negative): Fix the discription so that it matches the code implementation of prune_options(). --- gcc/doc/options.texi | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/gcc/doc/options.texi b/gcc/doc/options.texi index 205762

[PATCH] tree-optimization/100292 - avoid invalid GIMPLE from vector lowering

2021-04-28 Thread Richard Biener

We have to avoid folding the condition when building a COND_EXPR since we no longer gimplify the whole thing. The folding done at COND_EXPR build time will deal with possible simplifications. Bootstrapped on x86_64-unknown-linux-gnu, testing in progress. 2021-04-28 Richard Biener PR

[Ada] Extend compile-time evaluation in case statements to all objects

2021-04-28 Thread Pierre-Marie de Rodat

A comment in Analyze_Case_Statement describes an "interesting optimization" that was applied to all assignable objects. This patch extends it to all objects, in particular, to formal parameters of mode IN. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_ch5.adb (Analyze_

[Ada] Incorrect error with Default_Value on private/modular type

2021-04-28 Thread Pierre-Marie de Rodat

This patch fixes a bug where if a private type has a full type that is a modular type with a Default_Value specified, then creating objects of that type causes the compiler to incorrectly say, "illegal operand for numeric conversion". Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/

[Ada] Bad handling of 'Valid_Scalars and arrays

2021-04-28 Thread Pierre-Marie de Rodat

When using 'Valid_Scalars on unconstrained arrays, the expanded code includes unchecked conversion to the unconstrained base type, which may lead to incorrect code being generated. Fixed by replacing Validated_View by Get_Fullest_View except for records where it is still needed. We also take this

[Ada] Update reference with description of type resolution

2021-04-28 Thread Pierre-Marie de Rodat

A reference in the comment about the type resolution described in Sem_Ch4 refers to description with a header "Handling of Overload Resolution", but this has been moved to Sem_Type. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_res.ads: Update reference in comment.

[Ada] Remove unused subprograms

2021-04-28 Thread Pierre-Marie de Rodat

Remove dead code found by GNATcoverage: * Protected objects do not exist in Ada 83. * The whole last part of Scan_ALI is no longer used. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_ch3.adb (Analyze_Object_Declaration): Remove dead code. * ali.ads, ali.adb (S

[Ada] Hashed container Cursor type predefined equality non-conformance

2021-04-28 Thread Pierre-Marie de Rodat

The RM states (A.18-4-18/2, A.18.7-17/2, et al) that "the predefined "=" operator for type Cursor returns True if both cursors are No_Element, or designate the same element in the same container." In some cases, GNAT's implementation violates this requirement. This was due to the component "Positi

[Ada] Improve error message for ghost in predicate

2021-04-28 Thread Pierre-Marie de Rodat

It may be surprising to users that a ghost entity is not allowed to appear in a predicate, which is a kind of assertion. Explain this in a continuation message, as well as the possible fixes. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * ghost.adb (Check_Ghost_Context): Ad

[Ada] Crash on inherited component in type extension in generic unit.

2021-04-28 Thread Pierre-Marie de Rodat

Compiler aborts on an instance body that has a reference to a selected component of a local type extension, when the component is inherited from an ancestor type declared outside of the generic being instantiated. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * exp_ch3.adb (

[Ada] Incorrect discriminant check on call to access to subprogram

2021-04-28 Thread Pierre-Marie de Rodat

When calling an access to a subprogram taking an unconstrained discriminated record as parameter, we fail to pass the extra constrained actual parameter, which would lead to spurious or missed discriminant checks. At the same time we noticed that GNAT sometimes generates trees of the form: (Fie

[Ada] Crash with declare expression used in a postcondition

2021-04-28 Thread Pierre-Marie de Rodat

This is preliminary work to properly handle a declare expression used in a postcondition. This first part is adding guards against missing types. The second part will be about setting the proper type when currently missing. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem

[Ada] Remove double initialization of interpretation tables

2021-04-28 Thread Pierre-Marie de Rodat

Global variable Sem_Type.Headers, which is used in type resolution, was initialized twice: by elaboration and by Init_Interp_Tables (called by Gnat1drv). Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * fmap.ads (Reset_Tables): Remove outdated references to GNSA/ASIS.

[Ada] Assert failure on complex code with private type and discriminant

2021-04-28 Thread Pierre-Marie de Rodat

Recent improvements in the handling of discriminants and discriminant checks made an assert failure appear on complex code. It appears that the assertions in einfo.adb (Discriminant_Constraint) is too strict. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * einfo.adb (Discri

[Ada] More precise error about aspects conflicting with Static

2021-04-28 Thread Pierre-Marie de Rodat

When a Pre/Post aspect is conflicting with Static aspect, the error message now says 'aspect "Pre"' or 'aspect "Post"' and not just "this aspect". Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_ch13.adb (Analyze_Aspect_Static): Use aspect name in the error messa

[Ada] Use spans instead of locations for compiler diagnostics

2021-04-28 Thread Pierre-Marie de Rodat

Change the internal data structure that stores the compiler diagnostics to store spans instead of locations, where a span is a triplet of a main location, and a first-last pair of locations. Also change the main procedures for reporting an error in Errout to use spans instead of locations whenever

[Ada] AI12-0397: Default_Initial_Condition expressions for derived types

2021-04-28 Thread Pierre-Marie de Rodat

AI12-0397 specifies new rules for the resolution of expressions for the Default_Intial_Condition aspect, based on defining that a reference to the current instance of a type with DIC is a notional (nonabstract) formal derived type (making DIC similar to how Postcondition and Type_Invariant are defi

[Ada] Refactor repeated checks for the expression of aspect Static

2021-04-28 Thread Pierre-Marie de Rodat

Check for the expression of the aspect Static were done inside branches corresponding to inside-a-generic and outside-a-generic. Now this check occurs only once. Code cleanup; behaviour is unaffected. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_ch13.adb (Analyze_Aspe

[Ada] Eliminate early roundoff error for Long_Long_Float on x86

2021-04-28 Thread Pierre-Marie de Rodat

This overcomes the lack of fused multiply-add instruction on the x87 FPU by doing an iterated addition with exact error handling for the last digit taken into account for the mantissa. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * libgnat/s-valrea.adb (Fast2Sum): New funct

[Ada] Consistent diagnostic on missing -gnat2020 switch for aspects

2021-04-28 Thread Pierre-Marie de Rodat

Reuse Error_Msg_Ada_2020_Feature for messages that reference the currently processed aspect using the '%' insertion character. Cleanup only; behaviour is not affected. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * sem_ch13.adb (Analyze_Aspect_Static): Reuse Error_M

[Ada] Remove redundant assignment in Formal_Is_Used_Once

2021-04-28 Thread Pierre-Marie de Rodat

The counter variable in routine Formal_Is_Used_Once is initialized with 0, then is possibly incremented to 1, and when incremented again, the traversal is abandoned. This second increment can only make the counter equal 2; there is no need to explicitly re-assign it. Code cleanup only; semantics i

[Ada] Install_Restricted_Handlers: define Prio parameter as Interrupt_Priority

2021-04-28 Thread Pierre-Marie de Rodat

System.Interrupt.Install_Restricted_Handlers takes a parameter Prio. It was defined as Any_Priority when it should be the more narrower Interrupt_Priority subtype as handlers can only have priorities in the Interrupt_Priority range. Using the more narrower subtype assists in unit testing. Tested o

[Ada] Simplify data structures for overloaded interpretations

2021-04-28 Thread Pierre-Marie de Rodat

The data structure for overloaded interpretations included a hash table with a subtle implementation. It is now replaced with a generic hash provided by the GNAT.HTable. This is only a code cleanup; behaviour of the compiler is not affected. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/

[Ada] Fix computation of Prec/Succ of zero without denormals

2021-04-28 Thread Pierre-Marie de Rodat

The result must be the Small instead of the Tiny in this case. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * libgnat/s-fatgen.adb: Add use clause for Interfaces.Unsigned_16 and Interfaces.Unsigned_32. (Small16): New constant. (Small32): Likewise.

[Ada] Fix recent optimization in evaluation of selected component for GNATprove

2021-04-28 Thread Pierre-Marie de Rodat

As GNATprove uses the AST after semantic analysis without expansion, the recent optimization in the evaluation of selected components for aggregates was leading to incorrect AST where possible run-time errors in the evaluation of the (aggregate) prefix were not taken into account. Fixed by only ena

[Ada] Adjust List_Length description

2021-04-28 Thread Pierre-Marie de Rodat

The comment for List_Length says "It is an error to call this function with No_List (No_List is not considered to be the same as an empty list).". It legitimately happens in practice, so adjust the comment to allow this. Tested on x86_64-pc-linux-gnu, committed on trunk gcc/ada/ * nlist

New template for 'gcc' made available

2021-04-28 Thread Translation Project Robot

Hello, gentle maintainer. This is a message from the Translation Project robot. (If you have any questions, send them to .) A new POT file for textual domain 'gcc' has been made available to the language teams for translation. It is archived as: https://translationproject.org/POT-files/gcc

Re: [Ping] AVR CC0 conversion

2021-04-28 Thread John Paul Adrian Glaubitz

Hi Senthil! > On Mon, Apr 26, 2021 at 9:20 AM Senthil Kumar Selvaraj via Gcc-patches > wrote: >> >> Hi, >> >> This is >> https://gcc.gnu.org/pipermail/gcc-patches/2021-January/563638.html, >> rebased against latest gcc master. The only change is modification of >> avr_md_asm_adjust's signature to

[PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Richard Biener

This arranges for the x86 AVX and AVX2 masked load builtins to be pure to enable dead code elimination and more appropriate alias analysis. Bootstrapped and tested on x86_64-unknown-linux-gnu. OK for trunk? Thanks, Richard. 2021-04-28 Richard Biener PR target/100312 * config

sel-sched: fix UB in init_regs_for_mode [PR100311]

2021-04-28 Thread Richard Earnshaw via Gcc-patches

init_regs_for_mode iterates over all hard regs for the machine to test if the reg is OK for the mode, but an arithmetic overflow can lead to testing elements beyond the end of the arrays allocated for fixed and global registers. Clearly, if a mode requiring multiple hard regs needs one beyond

Re: sel-sched: fix UB in init_regs_for_mode [PR100311]

2021-04-28 Thread Jakub Jelinek via Gcc-patches

On Wed, Apr 28, 2021 at 12:04:45PM +0100, Richard Earnshaw wrote: > init_regs_for_mode iterates over all hard regs for the machine to test if > the reg is OK for the mode, but an arithmetic overflow can lead to testing > elements beyond the end of the arrays allocated for fixed and global > registe

Re: [PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Uros Bizjak via Gcc-patches

On Wed, Apr 28, 2021 at 1:02 PM Richard Biener wrote: > > This arranges for the x86 AVX and AVX2 masked load builtins to be > pure to enable dead code elimination and more appropriate alias > analysis. > > Bootstrapped and tested on x86_64-unknown-linux-gnu. OK for trunk? > > Thanks, > Richard. >

Re: [PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Richard Biener

On Wed, 28 Apr 2021, Uros Bizjak wrote: > On Wed, Apr 28, 2021 at 1:02 PM Richard Biener wrote: > > > > This arranges for the x86 AVX and AVX2 masked load builtins to be > > pure to enable dead code elimination and more appropriate alias > > analysis. > > > > Bootstrapped and tested on x86_64-unk

Re: [PATCH 1/1] PR100281 Fix SImode pointer handling

2021-04-28 Thread Andreas Krebbel via Gcc-patches

On 4/28/21 10:12 AM, Richard Biener wrote: > On Wed, Apr 28, 2021 at 8:54 AM Andreas Krebbel via Gcc-patches > wrote: >> >> The problem appears to be triggered by two locations in the front-end >> where non-POINTER_SIZE pointers aren't handled right now. >> >> 1. An assertion in strip_typedefs is

[committed] libstdc++: Add missing noexcept on std::thread member function [PR 100298]

2021-04-28 Thread Jonathan Wakely via Gcc-patches

The new inline definition of std::thread::hardware_concurrency() for non-gthreads targets is missing the noexcept-specifier that is on the declaration. libstdc++-v3/ChangeLog: PR libstdc++/100298 * include/bits/std_thread.h (thread::hardware_concurrency): Add missing noexc

Re: [PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Uros Bizjak via Gcc-patches

On Wed, Apr 28, 2021 at 1:46 PM Richard Biener wrote: > > On Wed, 28 Apr 2021, Uros Bizjak wrote: > > > On Wed, Apr 28, 2021 at 1:02 PM Richard Biener wrote: > > > > > > This arranges for the x86 AVX and AVX2 masked load builtins to be > > > pure to enable dead code elimination and more appropria

About implementation of the Negative property of options.

2021-04-28 Thread gengqi-linux via Gcc-patches

I have been fixing a bug. It involved the Negative property of options, and I have some confusion about it. gcc/optc-gen.awk: 383 neg = opt_args("Negative", flags[i]); if (neg != "") idx = indices[neg] else { if (flag_set_p("RejectNegative", flags[i])) idx = -1; else { if (opts[i

Re: Recent change breaking Wrestrict8 on various targets

2021-04-28 Thread Jeff Law via Gcc-patches

On 4/28/2021 1:10 AM, Richard Biener wrote: On Tue, Apr 27, 2021 at 10:22 PM Jeff Law wrote: This change: d8e1f1d24179690fd9c0f63c27b12e030010d9ea is the first bad commit commit d8e1f1d24179690fd9c0f63c27b12e030010d9ea Author: Richard Biener Date: Wed Apr 7 12:09:44 2021 +0200 tree

New German PO file for 'gcc' (version 11.1.0)

2021-04-28 Thread Translation Project Robot

Hello, gentle maintainer. This is a message from the Translation Project robot. A revised PO file for textual domain 'gcc' has been submitted by the German team of translators. The file is available at: https://translationproject.org/latest/gcc/de.po (This file, 'gcc-11.1.0.de.po', has jus

[PATCH RFA (diagnostic)] c++: -Wdeprecated-copy and #pragma diagnostic [PR94492]

2021-04-28 Thread Jason Merrill via Gcc-patches

-Wdeprecated-copy was depending only on the state of the warning at the point where we call the function, making it hard to use #pragma diagnostic to suppress the warning for a particular implicitly declared function. But checking whether the warning is enabled at the location of the implicit dec

Re: [PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Richard Biener

On Wed, 28 Apr 2021, Uros Bizjak wrote: > On Wed, Apr 28, 2021 at 1:46 PM Richard Biener wrote: > > > > On Wed, 28 Apr 2021, Uros Bizjak wrote: > > > > > On Wed, Apr 28, 2021 at 1:02 PM Richard Biener wrote: > > > > > > > > This arranges for the x86 AVX and AVX2 masked load builtins to be > > >

[PATCH 1/20] aarch64: Use RTL builtin for vmull[_high]_p8 intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vmull[_high]_p8 Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu and aarch64_be-none-elf - no issues. Ok for master? Thanks,

[PATCH 2/20] aarch64: Use RTL builtin for vq[r]dmulh[q]_n intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vq[r]dmulh[q]_n Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/Chan

Re: [Patch] OpenMP: Add iterator support to Fortran's depend; add affinity clause

2021-04-28 Thread Jakub Jelinek via Gcc-patches

On Tue, Apr 27, 2021 at 03:36:38PM +0200, Tobias Burnus wrote: > OpenMP 5's iterator can be used for > - depend clause > - affinity clause > - mapping (unsupported and not touched) > > (a) This patch add the iterator support to the Fortran FE > and adds support for it to the depend clause. > > (b

[PATCH 3/20] aarch64: Use RTL builtins for vpaddq intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vpaddq Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeLog: 2

[PATCH 4/20] aarch64: Use RTL builtins for [su]paddl[q] intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the [su]paddl[q] Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeL

[PATCH 5/20] aarch64: Use RTL builtins for vpadal_[su]32 intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vpadal_[su]32 Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/Change

[committed] libstdc++: Reduce output of 'make doc-pdf-doxygen'

2021-04-28 Thread Jonathan Wakely via Gcc-patches

Use '@' to prevent Make from echoing the recipe, so that users don't see this every time: if [ -f ${doxygen_pdf} ]; then mv ${doxygen_pdf} ${api_pdf} ; echo ":: PDF file is ${api_pdf}"; else echo "... error"; grep -F 'LaTeX Error' ${doxygen_outdir}/latex/refman.log; grep -F

[PATCH 6/20] aarch64: Use RTL builtins for polynomial vsli[q]_n intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vsli[q]_n_p* Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeL

Re: GCC 12 Ranger plans

2021-04-28 Thread Andrew MacLeod via Gcc-patches

On 4/28/21 2:24 AM, Richard Biener wrote: The final thing I plan to incorporate is some documentation of ranger technology. There are a lot of features and data available in the Ranger ecosystem that could be useful elsewhere. No one really knows about them, and rather than trying to write ev

[PATCH 7/20] aarch64: Use RTL builtins for polynomial vsri[q]_n intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vsri[q]_n_p* Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeL

Re: [Patch] offload-defaulted: Config option to silently ignore uninstalled offload compilers

2021-04-28 Thread Jakub Jelinek via Gcc-patches

On Tue, Apr 27, 2021 at 12:22:44PM +0200, Tobias Burnus wrote: > If configured with --enable-offload-defaulted, configured but not installed > offload compilers and libgomp plugins are silently ignored. Useful for > distribution compilers where those are in separate optional packages. > > 2021-04

Re: sel-sched: fix UB in init_regs_for_mode [PR100311]

2021-04-28 Thread Richard Earnshaw via Gcc-patches

On 28/04/2021 12:22, Jakub Jelinek via Gcc-patches wrote: On Wed, Apr 28, 2021 at 12:04:45PM +0100, Richard Earnshaw wrote: init_regs_for_mode iterates over all hard regs for the machine to test if the reg is OK for the mode, but an arithmetic overflow can lead to testing elements beyond the

Re: [PATCH 1/20] aarch64: Use RTL builtin for vmull[_high]_p8 intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vmull[_high]_p8 Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling and > optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu and > aa

New Ukrainian PO file for 'gcc' (version 11.1.0)

2021-04-28 Thread Translation Project Robot

Hello, gentle maintainer. This is a message from the Translation Project robot. A revised PO file for textual domain 'gcc' has been submitted by the Ukrainian team of translators. The file is available at: https://translationproject.org/latest/gcc/uk.po (This file, 'gcc-11.1.0.uk.po', has

Re: [Patch] offload-defaulted: Config option to silently ignore uninstalled offload compilers

2021-04-28 Thread Matthias Klose

On 4/27/21 12:22 PM, Tobias Burnus wrote: > This is based on Jakub's patch* which is used with many distributions – and > is has > to be maintained by all of them; otherwise issues like lp #1878760 might > creep in, > as discussed in #gcc yesterday. - As I am a huge fan of reducing code > duplic

Re: sel-sched: fix UB in init_regs_for_mode [PR100311]

2021-04-28 Thread Jakub Jelinek via Gcc-patches

On Wed, Apr 28, 2021 at 03:06:53PM +0100, Richard Earnshaw wrote: > We could of course, change the patch below into a checking assert of the > inverse condition: that would help to catch issues like this. I have nothing against an assert, but am not sure that sel-sched is the best spot for that, b

Re: [PATCH 2/20] aarch64: Use RTL builtin for vq[r]dmulh[q]_n intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vq[r]dmulh[q]_n Neon intrinsics to use > RTL builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > i

Re: [PATCH 3/20] aarch64: Use RTL builtins for vpaddq intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vpaddq Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issues. >

[PATCH 8/20] aarch64: Use RTL builtins for v[q]tbl intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the v[q]tbl Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeLog:

[PATCH 9/20] aarch64: Use RTL builtins for v[q]tbx intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the v[q]tbx Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan --- gcc/ChangeLog:

[PATCH][omp, simt] Fix expand_GOMP_SIMT_*

2021-04-28 Thread Tom de Vries

Hi, When running the test-case included in this patch using an nvptx accelerator, it fails in execution. The problem is that the expansion of GOMP_SIMT_XCHG_BFLY is optimized away during pass_jump as "trivially dead insns". This is caused by this code in expand_GOMP_SIMT_XCHG_BFLY: ... class e

Re: [PATCH 4/20] aarch64: Use RTL builtins for [su]paddl[q] intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the [su]paddl[q] Neon intrinsics to use > RTL builtins rather than inline assembly code, allowing for better > scheduling and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issu

[PATCH 10/20] aarch64: Use RTL builtins for FP ml[as]_n intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the floating-point vml[as][q]_n Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu - no issues. Ok for master? Thanks, Jonathan -

Re: [PATCH 5/20] aarch64: Use RTL builtins for vpadal_[su]32 intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vpadal_[su]32 Neon intrinsics to use > RTL builtins rather than inline assembly code, allowing for better > scheduling and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > iss

Re: [PATCH][omp, simt] Fix expand_GOMP_SIMT_*

2021-04-28 Thread Jakub Jelinek via Gcc-patches

On Wed, Apr 28, 2021 at 04:30:30PM +0200, Tom de Vries wrote: > gcc/ChangeLog: > > 2021-04-28 Tom de Vries > > PR target/100232 > * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC) > (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED) > (expand_GOMP_SIMT_VOTE_ANY, e

[PATCH 11/20] aarch64: Use RTL builtins for FP ml[as] intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the floating-point vml[as][q] Neon intrinsics to use RTL builtins rather than relying on the GCC vector extensions. Using RTL builtins allows control over the emission of fmla/fmls instructions (which we don't want here.) With this commit, the code generated by

[PATCH] AArch64: Cleanup aarch64_classify_symbol

2021-04-28 Thread Wilco Dijkstra via Gcc-patches

Use a GOT indirection for extern weak symbols instead of a literal - this is the same as PIC/PIE and mirrors LLVM behaviour. Ensure PIC/PIE use the same offset limits for symbols that don't use the GOT. Passes bootstrap and regress. OK for commit? ChangeLog: 2021-04-27 Wilco Dijkstra

[PATCH 12/20] aarch64: Use RTL builtins for FP ml[as][q]_lane intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the floating-point vml[as][q]_lane Neon intrinsics to use RTL builtins rather than relying on the GCC vector extensions. Using RTL builtins allows control over the emission of fmla/fmls instructions (which we don't want here.) With this commit, the code generat

Re: [PATCH 6/20] aarch64: Use RTL builtins for polynomial vsli[q]_n intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vsli[q]_n_p* Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issu

[PATCH 13/20] aarch64: Use RTL builtins for FP ml[as][q]_laneq intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the floating-point vml[as][q]_laneq Neon intrinsics to use RTL builtins rather than relying on the GCC vector extensions. Using RTL builtins allows control over the emission of fmla/fmls instructions (which we don't want here.) With this commit, the code genera

Re: [PATCH 7/20] aarch64: Use RTL builtins for polynomial vsri[q]_n intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the vsri[q]_n_p* Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issu

[PATCH 14/20] testsuite: aarch64: Add fusion tests for FP vml[as] intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch adds compilation tests to make sure that the output of vmla/vmls floating-point Neon intrinsics (fmul, fadd/fsub) is not fused into fmla/fmls instructions. Ok for master? Thanks, Jonathan --- gcc/testsuite/ChangeLog: 2021-02-16 Jonathan Wright * gcc.targ

[PATCH 15/20] aarch64: Use RTL builtins for vcvtx intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch rewrites the vcvtx Neon intrinsics to use RTL builtins rather than inline assembly code, allowing for better scheduling and optimization. Regression tested and bootstrapped on aarch64-none-linux-gnu and aarch64_be-none-elf - no issues. Ok for master? Thanks, Jonathan

Re: [PATCH 8/20] aarch64: Use RTL builtins for v[q]tbl intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the v[q]tbl Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issues. >

[PATCH 16/20] aarch64: Relax aarch64_hn2 RTL pattern

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch implements the v[r]addhn2 and v[r]subhn2 Neon intrinsic RTL patterns using a vec_concat of a register_operand and an ADDSUBHN unspec - instead of just an ADDSUBHN2 unspec. This more relaxed pattern allows for more aggressive combinations and ultimately better code genera

[PATCH 17/20] aarch64: Relax aarch64_qshrnn2_n RTL pattern

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch implements the saturating right-shift and narrow high Neon intrinsic RTL patterns using a vec_concat of a register_operand and a VQSHRN_N unspec - instead of just a VQSHRN2_N unspec. This more relaxed pattern allows for more aggressive combinations and ultimately better

[PATCH 18/20] aarch64: Update attributes of arm_fp16.h intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch updates the attributes of all intrinsics defined in arm_fp16.h to be consistent with the attributes of the intrinsics defined in arm_neon.h. Specifically, this means updating the attributes from: __extension__ static __inline __attribute__ ((__always_inline__)) to:

Re: [PATCH 9/20] aarch64: Use RTL builtins for v[q]tbx intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the v[q]tbx Neon intrinsics to use RTL > builtins rather than inline assembly code, allowing for better scheduling > and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-gnu - no > issues. >

Re: [PATCH] target/100312 - make x86 masked load builtins pure

2021-04-28 Thread Uros Bizjak via Gcc-patches

On Wed, Apr 28, 2021 at 3:32 PM Richard Biener wrote: > > On Wed, 28 Apr 2021, Uros Bizjak wrote: > > > On Wed, Apr 28, 2021 at 1:46 PM Richard Biener wrote: > > > > > > On Wed, 28 Apr 2021, Uros Bizjak wrote: > > > > > > > On Wed, Apr 28, 2021 at 1:02 PM Richard Biener > > > > wrote: > > > > >

Re: [PATCH 10/20] aarch64: Use RTL builtins for FP ml[as]_n intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the floating-point vml[as][q]_n Neon > intrinsics to use RTL builtins rather than inline assembly code, allowing > for better scheduling and optimization. > > Regression tested and bootstrapped on aarch64-none-linux-

[committed] libstdc++: Define __cpp_lib_constexpr_string macro

2021-04-28 Thread Jonathan Wakely via Gcc-patches

As noted in r11-1339-gb6ab9ecd550227684643b41e9e33a4d3466724d8 we define a non-standard __cpp_lib_constexpr_char_traits feature test macro to indicate support for P0426R1 and P1032R1. At some point last year the __cpp_lib_constexpr_string macro was retconned to indicate support for those papers. Th

Re: [PATCH 11/20] aarch64: Use RTL builtins for FP ml[as] intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the floating-point vml[as][q] Neon intrinsics > to use RTL builtins rather than relying on the GCC vector extensions. > Using RTL builtins allows control over the emission of fmla/fmls > instructions (which we don't

Re: [PATCH] AArch64: Cleanup aarch64_classify_symbol

2021-04-28 Thread Richard Sandiford via Gcc-patches

Wilco Dijkstra via Gcc-patches writes: > Use a GOT indirection for extern weak symbols instead of a literal - this is > the same as > PIC/PIE and mirrors LLVM behaviour. Just to check: I guess this part is an optimisation, because it means that we can share the GOT entry with other TUs. Is that

Re: [PATCH 12/20] aarch64: Use RTL builtins for FP ml[as][q]_lane intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch rewrites the floating-point vml[as][q]_lane Neon > intrinsics to use RTL builtins rather than relying on the GCC vector > extensions. Using RTL builtins allows control over the emission of > fmla/fmls instructions (which we d

[PATCH 19/20] aarch64: Update attributes of arm_acle.h intrinsics

2021-04-28 Thread Jonathan Wright via Gcc-patches

Hi, As subject, this patch updates the attributes of all intrinsics defined in arm_acle.h to be consistent with the attributes of the intrinsics defined in arm_neon.h. Specifically, this means updating the attributes from: __extension__ static __inline __attribute__ ((__always_inline__)) to:

Re: [PATCH 14/20] testsuite: aarch64: Add fusion tests for FP vml[as] intrinsics

2021-04-28 Thread Richard Sandiford via Gcc-patches

Jonathan Wright via Gcc-patches writes: > Hi, > > As subject, this patch adds compilation tests to make sure that the output > of vmla/vmls floating-point Neon intrinsics (fmul, fadd/fsub) is not fused > into fmla/fmls instructions. > > Ok for master? > > Thanks, > Jonathan > > --- > > gcc/testsui

1 2 >

1 - 100 of 175 matches

Mail list logo