Re: Char shifts promoted to int. Why?

2006-12-21 Thread Ayal Zaks
>Something along these lines may be useful to do in the vectorizer when we >get code like this: > > ((char)x) = ((char)( ((int)((char)x)) << ((int)c) ) ) >and don't feel like doing all the unpacking of chars to ints and then >packing the ints to chars after the shift. An alternative could be

Fw: Scheduling an early complete loop unrolling pass?

2007-02-07 Thread Ayal Zaks
... >Ah, right... I wonder if we can keep the loop structure in place, even >after completely unrolling the loop - I mean the 'struct loop' in >'current_loops' (not the actual CFG), so that the "SLP in loops" would have >a chance to at least consider vectorizing this "loop". Having a "loop" str

Re: RFC: vectorizer cost model

2007-03-08 Thread Ayal Zaks
> > > "Linthicum, Tony" <[EMAIL PROTECTED]> writes: ... > > > * What is the best way to access target level cost information? > > > > I'm sure you know that the loop code does this by generating RTL and > > asking for the cost of it (computation_cost in tree-ssa-loop-ivopts.c). > > Which should

Re: Improvements of the haifa scheduler

2007-03-09 Thread Ayal Zaks
s lists the way it shouldn't. > Lists are being copied, edited and deleted directly without interaction > with sched-deps.c . What the scheduler really needs is the following > set of primitives: >o FOR_EACH_DEP (insn, which_list, iterator, dep) - walk through > insn's w

Re: Ayal Zaks appointed Modulo Scheduler maintainer

2007-03-23 Thread Ayal Zaks
David Edelsohn <[EMAIL PROTECTED]> wrote on 23/03/2007 19:04:57: >I am pleased to announce that the GCC Steering Committee has > appointed Ayal Zaks as Modulo Scheduler maintainer. > >Please join me in congratulating Ayal on his new role. > Ayal, please update

Re: SoC Project: Propagating array data dependencies from Tree-SSA to RTL

2007-03-25 Thread Ayal Zaks
"Daniel Berlin" <[EMAIL PROTECTED]> wrote on 25/03/2007 05:19:41: > On 3/23/07, Alexander Monakov <[EMAIL PROTECTED]> wrote: > > Hello, > > > > > > I would be pleased to see Ayal Zaks as my mentor, because proposed > > improvement is

Fw: Store scheduling with DFA scheduler

2005-05-01 Thread Ayal Zaks
As you noted, when the scheduler decides between stores and adds it always prefers the adds (first at t = 5), due to its critical path heuristic. In Jon's example, stores seem "costly" as one cannot issue a load or store immediately following a store. Perhaps the scheduler could take the (resou

Re: Software pipelining capabilities

2005-06-15 Thread Ayal Zaks
> Vasanth <[EMAIL PROTECTED]> > I am using powerpc-eabi-gcc (3.4.1) and trying to retarget it for a > fully pipelined FPU. I have a DFA model for the FPU. I am looking at > the code produced for a simple FIR algorithm (a loop iterating over an > array, with a multiply-add operation per iterati

Re: Question about SMS scheduling windows

2011-07-27 Thread Ayal Zaks
(sorry for replicated submissions, had to convert to plain text) >2011/7/27 Revital1 Eres > >Hello Richard, > > >> I ask because in the final range: >> >> start = early_start; >> end = MIN (end, early_start + ii); >> /* Schedule the node close to it's predecessors. */ >>

Re: Question about SMS scheduling windows

2011-08-04 Thread Ayal Zaks
endencies is definitely important. > > Ayal Zaks writes: > >>> I ask because in the final range: > >>> > >>>       start = early_start; > >>>       end = MIN (end, early_start + ii); > >>>       /* Schedule the node close to it'

Re: Question about SMS scheduling windows

2011-08-08 Thread Ayal Zaks
2011/8/8 Richard Sandiford > > Ayal Zaks writes: > >> OK.  For the follow-on iv patch, it seemed easier to keep both bounds > >> inclusive for the loop, then make the "end" exclusive when setting the > >> out parameters.  (Note that there shouldn'

Re: A question about sched_analyze_insn in sched-deps.c

2011-08-15 Thread Ayal Zaks
>AFAIK SMS will not do speculative memory access. Right, SMS does no speculative memory access. Though that might not be a bad idea... Ayal. 2011/8/11 Revital Eres > > Hello, > > >> I appriciate explanation regarding the following piece of code in > >> sched_analyze_insn function (sched-deps.c)

Re: Solve transitive closure issue in modulo scheduling

2009-02-01 Thread Ayal Zaks
"Bingfeng Mei" wrote on 30/01/2009 14:44:01: > Hello, > I try to make modulo scheduling work more efficiently for our VLIW target. I > found one serious issue that prevents current SMS algorithm from achieving > high IPC is so-called "transitive closure" problem, where scheduling window is > only

RE: Solve transitive closure issue in modulo scheduling

2009-02-17 Thread Ayal Zaks
l, not many loops are candidates for software pipelining. Thanks. > I would suggest to check the actual compile-time and memory usage, and try to reduce them later if needed. Modulo-scheduling is not a compile-time cheap optimization in general. Ayal. > Cheers, > Bingfeng > > >

Re: [PATCH] SMS - Pass the actual schedulable rows to compute_split_row

2009-03-13 Thread Ayal Zaks
s, Ayal. > Thanks, > Revital > > * modulo-sched.c (sms_schedule_by_order): Pass the actual > schedulable rows to compute_split_row. > > [attachment "patch_sms_12_3.txt" deleted by Ayal Zaks/Haifa/IBM]

Re: Line insn notes in modulo-sched

2006-03-14 Thread Ayal Zaks
Steven Bosscher <[EMAIL PROTECTED]> wrote on 14/03/2006 01:32:09: > Hi Ayal, > > The SMS implementation in GCC, in modulo-sched.c, uses line notes > to find insn locations, see find_line_note. Why are you using > line notes instead of insn locators? Line notes are on the list > of Things That Sh

Re: Fw: Status of SEE and Autovectorization patches?

2006-05-03 Thread Ayal Zaks
> Roger Sayle <[EMAIL PROTECTED]> > Sent by: [EMAIL PROTECTED] > > 03/05/2006 05:03 PM > > To > > Mark Mitchell <[EMAIL PROTECTED]> > > cc > > Richard Henderson <[EMAIL PROTECTED]>, gcc mailing list , Leehod > Baruch <[EMAIL PROTECTED]>, Mircea Namolaru/Haifa/[EMAIL PROTECTED], > <[EMAIL PROTECTED]

Re: Fw: GCC 4.2 Status Report (2006-06-04)

2006-06-06 Thread Ayal Zaks
> This status report has been a long time coming, for which I apologize. > > As fwprop is no longer on the table for 4.2, and as the vectorizer > improvements seem to have stalled due to a combination of lack of review > and Dorit's leave, That is unfortunate. Dorit did make a sincere effort to p

Re: Fw: GCC 4.2 Status Report (2006-06-04)

2006-06-07 Thread Ayal Zaks
"Richard Guenther" <[EMAIL PROTECTED]> wrote on 06/06/2006 16:58:27: > On 6/6/06, Ayal Zaks <[EMAIL PROTECTED]> wrote: > > > > > This status report has been a long time coming, for which I apologize. > > > > > > As fwprop is no longer on t

Workshop on GCC for Research in Embedded and Parallel Systems

2007-06-26 Thread Ayal Zaks
Acronymed GREPS (... just what you were looking for), is to be held on September 16 in Brasov, Romania, co-located with PACT. We'd like to bring this workshop to your attention; the submission site is now open until July 24, after the upcoming GCC Developers' Summit. For more details see http://sy

Re: Does unrolling prevents doloop optimizations?

2007-06-30 Thread Ayal Zaks
Zdenek Dvorak <[EMAIL PROTECTED]> wrote on 30/06/2007 09:55:45: > Hello, > > > By "this change" I mean just commenting out the check in > > doloop_condition_get. After applying the patch that introduced DOLOOP > > patterns for SPU (http://gcc.gnu.org/ml/gcc-patches/2007-01/msg01470.html) > > we ne

Re: Does unrolling prevents doloop optimizations?

2007-06-30 Thread Ayal Zaks
Zdenek Dvorak <[EMAIL PROTECTED]> wrote on 30/06/2007 10:52:32: > Hello, > > > It doesn't seem that the number of iterations analysis from loop-iv.c deals > > with EQ closing branches. > > loop-iv works just fine for EQ closing branches. > Thanks for the clarification (I didn't see EQ in iv_numbe

Re: Does unrolling prevents doloop optimizations?

2007-06-30 Thread Ayal Zaks
Zdenek Dvorak <[EMAIL PROTECTED]> wrote on 30/06/2007 20:36:12: > Hello, > > > > > It doesn't seem that the number of iterations analysis from loop-iv.c > > deals > > > > with EQ closing branches. > > > > > > loop-iv works just fine for EQ closing branches. > > > > > > > Thanks for the clarificati

Fw: Failing matrix tests

2007-07-14 Thread Ayal Zaks
> Is anyone else seeing failures on the gcc.dg/matrix tests? I am getting > the following failures on IA64 HP-UX: > Hope Razya can help spot the cause, after she returns from vacation later this week. Ayal. > FAIL: gcc.dg/matrix/matrix-1.c scan-ipa-dump-times Flattened 3 dimensions 1 > FAIL: g

Re: Workshop on GCC for Research in Embedded and Parallel Systems

2007-08-01 Thread Ayal Zaks
t 13, 2007 Workshop date: September 16, 2007 Ayal Zaks/Haifa/IBM wrote on 26/06/2007 18:08:43: > Acronymed GREPS (... just what you were looking for), is to be held on > September 16 in Brasov, Romania, co-located with PACT. We'd like to bring this > workshop to your attention; the sub

Re: [RFC] Improve Tree-SSA if-conversion - convergence of efforts

2007-08-01 Thread Ayal Zaks
"Daniel Berlin" <[EMAIL PROTECTED]> wrote on 01/08/2007 18:27:35: > On 8/1/07, Tehila Meyzels <[EMAIL PROTECTED]> wrote: > > "Daniel Berlin" <[EMAIL PROTECTED]> wrote on 31/07/2007 18:00:57: > > > > > > > > I agree with you for conditional stores/loads. > > > > Great! > > > > > > > > The unconditi

Re: [RFC][modulo-sched] Fix scheduling order within a cycle

2007-11-13 Thread Ayal Zaks
only must_preceed insn 58 because the latency between 61->58 is > 14 which causes the end boundary of the window to be 0. > > Thanks, > Revital > > [attachment "test.c.172r.sms" deleted by Ayal Zaks/Haifa/IBM] Indeed the latency between 61->58 is 14 causing en

Re: [RFC][modulo-sched] Fix scheduling order within a cycle

2007-11-15 Thread Ayal Zaks
Revital1 Eres/Haifa/IBM wrote on 14/11/2007 18:46:14: > > > > > When scheduling insn 58, we calculate a window of possible cycles according > > to already scheduled predecessors and successors. This window looks like a > > parallelogram in general rather than a rectangle: in the first cycle there