On 22-09-14 10:28, Richard Biener wrote:
On Tue, 16 Sep 2014, Tom de Vries wrote:
On 09-09-14 12:56, Richard Biener wrote:
On Tue, 9 Sep 2014, Tom de Vries wrote:
On 18-08-14 14:16, Tom de Vries wrote:
On 06-08-14 17:10, Tom de Vries wrote:
We could insert a pass-group here that only deals
On Tue, 16 Sep 2014, Tom de Vries wrote:
> On 09-09-14 12:56, Richard Biener wrote:
> > On Tue, 9 Sep 2014, Tom de Vries wrote:
> >
> > > On 18-08-14 14:16, Tom de Vries wrote:
> > > > On 06-08-14 17:10, Tom de Vries wrote:
> > > > > We could insert a pass-group here that only deals with function
On September 16, 2014 5:34:15 PM CEST, Tom de Vries
wrote:
>On 09-09-14 12:56, Richard Biener wrote:
>> On Tue, 9 Sep 2014, Tom de Vries wrote:
>>
>>> On 18-08-14 14:16, Tom de Vries wrote:
On 06-08-14 17:10, Tom de Vries wrote:
> We could insert a pass-group here that only deals with fu
On 09-09-14 12:56, Richard Biener wrote:
On Tue, 9 Sep 2014, Tom de Vries wrote:
On 18-08-14 14:16, Tom de Vries wrote:
On 06-08-14 17:10, Tom de Vries wrote:
We could insert a pass-group here that only deals with functions that have
the
kernels directive, and do the auto-par thing in a pass_
On Tue, 9 Sep 2014, Tom de Vries wrote:
> On 18-08-14 14:16, Tom de Vries wrote:
> > On 06-08-14 17:10, Tom de Vries wrote:
> > > We could insert a pass-group here that only deals with functions that have
> > > the
> > > kernels directive, and do the auto-par thing in a pass_oacc_kernels (which
>
On 18-08-14 14:16, Tom de Vries wrote:
On 06-08-14 17:10, Tom de Vries wrote:
We could insert a pass-group here that only deals with functions that have the
kernels directive, and do the auto-par thing in a pass_oacc_kernels (which
should share the majority of the infrastructure with the parloop
On 06-08-14 17:10, Tom de Vries wrote:
The place after build_ealias is early enough to be before the lto-stream
write/read. I don't see how we can do this earlier. Before ealias, there's no
alias info, and one of the loops fails to be recognized as parallel.
Furthermore, pass_ch, pass_ccp, pass_l
Jakub,
I've looked into how to implement the openacc kernels directive in gcc.
In order to map the loopnests marked by the kernels directive efficiently on
accelerator hardware, we need parallelization and vectorization.
Focussing on paralellization for the moment, a possibility for paralelli