ot; directives when translating asynchronous
> kernels regions instead.
(Or rather, use structured 'data' (as we're now doing), but with
appropriate 'async' clauses.)
> The attached patch just adds a "wait" operation before the end of
> the enclosing data reg
This patch provides a workaround for unreliable operation of asynchronous
kernels regions on AMD GCN. At present, kernels regions are decomposed
into a series of parallel regions surrounded by a data region capturing
the data-movement clauses needed by the region as a whole:
#pragma acc kernels