Hi Everyone, I am running a truncated mean convolution but I keep running into cuCtxSynchronize failed type errors when the kernel is called numerous times. I guess that it's not a timeout issue in GPU issue because it's a compute only GPU.
So, I put it down to memory issues due to the use of list that I need to store in kernel for sorting/truncating values. Am I correct in thinking that the error can be avoided by using shared memory? Thanks, Sanjay Rana | Design Directorate Rural Payments Agency | Room 200 | North Gate House | Reading | RG1 1AF Jabber: 02077642065 | ext. 42065 | Email: [email protected]<mailto:[email protected]> Follow us on Twitter @Ruralpay ************************************************************************************ This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. This footnote also confirms that this email message has been swept by Cisco ESA for the presence of computer viruses. ************************************************************************************
_______________________________________________ PyCUDA mailing list [email protected] https://lists.tiker.net/listinfo/pycuda
