> Wild theory: Maybe the print statements introduce GPU synchronization? > Does your observation change with multiple loops through the code? > > Also note that the profiler won't help you debug overlap. If it is > active, all GPU activity is synchronous. > > Andreas
No. None of the above. The "Working.py" code runs overlapping using the profiler including print statments. Don't know where you want to go with the multiple loops comment ... the code overlaps or not without any loops. -Magnus -- ----------------------------------------------- Magnus Paulsson Assistant Professor School of Computer Science, Physics and Mathematics Linnaeus University Phone: +46-480-446308 Mobile: +46-70-6942987 _______________________________________________ PyCUDA mailing list [email protected] http://lists.tiker.net/listinfo/pycuda
