> Wild theory: Maybe the print statements introduce GPU synchronization?
> Does your observation change with multiple loops through the code?
>
> Also note that the profiler won't help you debug overlap. If it is
> active, all GPU activity is synchronous.
>
> Andreas

No. None of the above. The "Working.py" code runs overlapping using
the profiler including print statments.

Don't know where you want to go with the multiple loops comment ...
the code overlaps or not without any loops.

-Magnus


-- 

-----------------------------------------------
Magnus Paulsson
Assistant Professor
School of Computer Science, Physics and Mathematics
Linnaeus University
Phone: +46-480-446308
Mobile: +46-70-6942987

_______________________________________________
PyCUDA mailing list
[email protected]
http://lists.tiker.net/listinfo/pycuda

Reply via email to