https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #11 from Richard Biener ---
Just an update on costs:
t.c:1:35: note: === vect_compute_single_scalar_iteration_cost ===
0x483e120 *_3 1 times scalar_load costs 12 in body
0x483e120 _4 + r_16 1 times scalar_stmt costs 12 in body
and
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Jakub Jelinek changed:
What|Removed |Added
Target Milestone|8.3 |8.4
--- Comment #10 from Jakub Jelinek
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #9 from Richard Biener ---
Split out the target cost issue to PR89114, it'll improve code-gen for the
unwanted vectorization a bit at least. That's independent on the
vectorizer cost issue.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Segher Boessenkool changed:
What|Removed |Added
CC||segher at gcc dot gnu.org
--- Comme
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #7 from Richard Biener ---
Author: rguenth
Date: Fri Jan 25 12:46:24 2019
New Revision: 268264
URL: https://gcc.gnu.org/viewcvs?rev=268264&root=gcc&view=rev
Log:
2019-01-25 Richard Biener
PR tree-optimization/89049
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Richard Biener changed:
What|Removed |Added
CC|segher at gcc dot gnu.org |
--- Comment #6 from Richard Bie
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Richard Biener changed:
What|Removed |Added
CC||segher at gcc dot gnu.org
--- Comment #
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #4 from Richard Biener ---
With -mtune=core-avx2 we do
vmovups (%rdi), %xmm1
vmovups (%rdi), %ymm3
...
vextractf128$0x1, %ymm3, %xmm1
with -mtune=intel the even more weird
vmovups (%rdi), %xmm1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #3 from Richard Biener ---
In the assembly I notice
vinsertf128 $0x1, 16(%rdi), %ymm4, %ymm2
...
vextractf128$0x1, %ymm2, %xmm1
somehow we fail to elide the initial %ymm2 build with the upper half
extraction
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
--- Comment #2 from Richard Biener ---
Created attachment 45531
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=45531&action=edit
scalar loop cost patch
I'm testing this patch (not fixing the testcase, just improving costs).
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Richard Biener changed:
What|Removed |Added
Priority|P3 |P2
Status|UNCONFIRMED
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89049
Jakub Jelinek changed:
What|Removed |Added
Target||x86_64-linux
CC|
12 matches
Mail list logo