[PyCUDA] Parallel prefix scan
Andreas Kloeckner
lists at informa.tiker.net
Sun Mar 20 03:04:50 PDT 2011
Hi Nithin, Bryan, all,
On Wed, 23 Feb 2011 01:00:05 +0530, nithin s <nithin19484 at gmail.com> wrote:
> Wow.. thats quite a bit of work. .. though I must admit on first
> inspection of the kernel code it was evident that as the kernel goes
> up and down tree there is a great deal of waiting.
>
> Any how I don't quite have the energy to get another version working
> in the near future. I'm leaving it to the community to use the code or
> not.
Thanks very much for submitting this code.
PyCUDA and PyOpenCL now both have a scan implementation. I did not end
up using your code, but instead I ended up stealing directly from
thrust.
http://documen.tician.de/pycuda/array.html#module-pycuda.scan
http://documen.tician.de/pyopencl/array.html#module-pyopencl.scan
The code for both is actually one and the same file, using 'glue' called
py{opencl,cuda}._cluda...
Andreas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://lists.tiker.net/pipermail/pycuda/attachments/20110320/45035804/attachment.pgp>
More information about the PyCUDA
mailing list