Thank you so much for this help and very soon reply, I now understand what
On Mon, Feb 7, 2011 at 8:16 PM, Andreas Kloeckner
On Mon, 7 Feb 2011 19:45:37 -0500, Tobjan Brejicz <tobangi(a)gmail.com>
I'm have some problem with my testing of
pycuda, which I don't
I want to understand example of how to pointwise multiply large vector
10000 length) use pycuda.
So I am trying the code from here:
about "multiply_them". It work correctly for length 400 vector, but I
replace 400 with larger number, like 550, and now immediately it is not
Blocks are limited to 512 threads. To submit more work, you likely want to
*both* blocks and grid of non-unit size. I've updated that doc example
to show how you can specify the grid size.
So what is wrong? Is my idea wrong to use
larger values, should I split
problem more into smaller piece?