Hi! On 2019-01-12T23:21:23+0100, Tom de Vries <tdevr...@suse.de> wrote: > Allow vector_length clauses to accept values larger than warp size.
> * testsuite/libgomp.oacc-c-c++-common/parallel-dims.c: Expect vector > length 2097152 to be reduced to 1024 instead of 32. > --- a/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c > +++ b/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c > @@ -350,7 +350,7 @@ int main () > int gangs_min, gangs_max, workers_min, workers_max, vectors_min, > vectors_max; > gangs_min = workers_min = vectors_min = INT_MAX; > gangs_max = workers_max = vectors_max = INT_MIN; > -#pragma acc parallel copy (vectors_actual) /* { dg-warning "using > vector_length \\(32\\), ignoring 2097152" "" { target > openacc_nvidia_accel_configured } } */ \ > +#pragma acc parallel copy (vectors_actual) /* { dg-warning "using > vector_length \\(1024\\), ignoring 2097152" "" { target > openacc_nvidia_accel_configured } } */ \ > vector_length (VECTORS) > { > if (acc_on_device (acc_device_host)) > @@ -361,7 +361,7 @@ int main () > else if (acc_on_device (acc_device_nvidia)) > { > /* The GCC nvptx back end enforces vector_length (32). */ > - vectors_actual = 32; > + vectors_actual = 1024; > } > else > __builtin_abort (); As obvious, pushed "[nvptx] Update comment in 'libgomp.oacc-c-c++-common/parallel-dims.c'" to master branch in commit e64d62c7008e6a4b0227fd25e071db8f0b3f1820, see attached. Grüße Thomas ----------------- Mentor Graphics (Deutschland) GmbH, Arnulfstrasse 201, 80634 München Registergericht München HRB 106955, Geschäftsführer: Thomas Heurung, Frank Thürauf
>From e64d62c7008e6a4b0227fd25e071db8f0b3f1820 Mon Sep 17 00:00:00 2001 From: Thomas Schwinge <tho...@codesourcery.com> Date: Sat, 5 Jun 2021 22:01:48 +0200 Subject: [PATCH] [nvptx] Update comment in 'libgomp.oacc-c-c++-common/parallel-dims.c' Small fix-up for r267889 (commit 2b9d9e393766d2fa6e2dd5f361d0db14872cf261) "[nvptx] Enable large vectors": > * testsuite/libgomp.oacc-c-c++-common/parallel-dims.c: Expect vector > length 2097152 to be reduced to 1024 instead of 32. libgomp/ * testsuite/libgomp.oacc-c-c++-common/parallel-dims.c <acc_device_nvidia>: Update comment. --- libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c b/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c index ef4917aafff..ef3dfda5fa5 100644 --- a/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c +++ b/libgomp/testsuite/libgomp.oacc-c-c++-common/parallel-dims.c @@ -385,7 +385,7 @@ int main () } else if (acc_on_device (acc_device_nvidia)) { - /* The GCC nvptx back end enforces vector_length (32). */ + /* The GCC nvptx back end reduces to vector_length (1024). */ vectors_actual = 1024; } else if (acc_on_device (acc_device_radeon)) -- 2.30.2