[Hedge] Problem with Pycuda & hedge
Andreas Kloeckner
lists at informa.tiker.net
Thu Jul 28 23:42:12 PDT 2011
Hi Peter,
On Thu, 21 Jul 2011 11:53:29 +0200, Peter17 <peter017 at gmail.com> wrote:
> 2011/7/20 Paul Cazeaux <paul_cazeaux at brown.edu>:
> > - I had the same problem as Peter, with a "pycuda._driver.LogicError: cuFuncSetBlockShape failed: invalid value". More precisely, I get
> > block = (
> > given.devdata.smem_granularity,
> > plan.parallelism.parallel,
> > plan.aligned_image_dofs_per_microblock
> > #//given.devdata.smem_granularity)
> > set to (512,1,0). Now I think that given.devdata.smem_granularity is always going to be 512 for my laptop's GeForce 9600 GT - so I hardcoded (512, 1, 1)
> > as the block shape, and it seems to work. Is there a big problem with that?
>
> Hi, Paul!
>
> Thanks for the tip! I didn't try to hardcode the value before. It is
> also working for me if I hardcode block to (512, 1, 1) in the 3
> functions function_prepare, function_prepared_call and
> function_prepared_timed_call of pycuda/pycuda/driver.py.
Paul and I just figured out that this was due to a bug in a patch that I
had accepted into PyCUDA in March. Should be better now. Sorry for the trouble.
Andreas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.tiker.net/pipermail/hedge/attachments/20110729/e3721be3/attachment.pgp>
More information about the Hedge
mailing list