Hi, I woder if I can ameliorate the pycuda code by editing the number of maximum threads in the gpuarray.py ? And I can't understand what's really happening when I use the methods of gpuarray to multiply a matrix with a scalar ? Is the scalar sent to the GPU for each element of the matrix or it's sent only the first time ? And is it sent as scalar or as gpuarray ?
Thanks -- View this message in context: http://pycuda.2962900.n2.nabble.com/Amelioration-of-GPU-speed-using-pycuda-functions-tp6167365p6167365.html Sent from the PyCuda mailing list archive at Nabble.com. _______________________________________________ PyCUDA mailing list [email protected] http://lists.tiker.net/listinfo/pycuda
