I'm getting ~7x speed-up on my GTX280:
*You may want to consider the "mem_alloc_host" (more at
On Wed, Jun 24, 2009 at 5:03 AM, Stanislav Ravas <ravas(a)tind.sk> wrote:
I'm new to both CUDA and PyCUDA. I'm trying to write binary
erosion/dilation accelerator module for my project, but they are slower
then scipy.ndimage's functions.
I don't know if i'm doing something wrong(as I said, I'm new), or nvidia
nvs140m in my notebook is just not fast enough.
It would be great if someone with more powerful card could try it, or
may be some guru :) could have a look into my sources?
Source is attached.
If I get it to work, I'll share it for all :)
Anyway, CUDA and PyCUDA are great work!
PyCUDA mailing list
Ph.D. Candidate, Brain & Computer Sciences
Massachusetts Institute of Technology, USA