On Donnerstag 22 April 2010, Michael Rule wrote:
NV Tesla T10,
I assume its something I am doing wrong...
If you're using NV hardware, it helps to know the CUDA literature. You
must supply a local_size to partition your work into NV's thread
blocks. If you don't you're only submitting one thread block, which has
hardware size limits.
PS: Please keep replies on the list. Thanks.