Hi Tobjan, On Mon, 7 Feb 2011 19:45:37 -0500, Tobjan Brejicz <[email protected]> wrote: > I'm have some problem with my testing of pycuda, which I don't understand. > > I want to understand example of how to pointwise multiply large vector (like > 10000 length) use pycuda. > > So I am trying the code from here: > > http://documen.tician.de/pycuda/ > > about "multiply_them". It work correctly for length 400 vector, but I > replace 400 with larger number, like 550, and now immediately it is not > work.
Blocks are limited to 512 threads. To submit more work, you likely want to use *both* blocks and grid of non-unit size. I've updated that doc example to show how you can specify the grid size. > So what is wrong? Is my idea wrong to use larger values, should I split up > problem more into smaller piece? Yes, exactly. HTH, Andreas
pgpgdztqm2EUt.pgp
Description: PGP signature
_______________________________________________ PyCUDA mailing list [email protected] http://lists.tiker.net/listinfo/pycuda
