Thank you so much for this help and very soon reply, I now understand what to do!

-Tob

On Mon, Feb 7, 2011 at 8:16 PM, Andreas Kloeckner <lists@informa.tiker.net> wrote:

Hi Tobjan,

Blocks are limited to 512 threads. To submit more work, you likely want to use

On Mon, 7 Feb 2011 19:45:37 -0500, Tobjan Brejicz <tobangi@gmail.com> wrote:

> I'm have some problem with my testing of pycuda, which I don't understand.

>

> I want to understand example of how to pointwise multiply large vector (like

> 10000 length) use pycuda.

>

> So I am trying the code from here:

>

> http://documen.tician.de/pycuda/

>

> about "multiply_them". It work correctly for length 400 vector, but I

> replace 400 with larger number, like 550, and now immediately it is not

> work.

*both* blocks and grid of non-unit size. I've updated that doc example

to show how you can specify the grid size.

Yes, exactly.

> So what is wrong? Is my idea wrong to use larger values, should I split up

> problem more into smaller piece?

HTH,

Andreas