In my application I need to perform a (relatively) simple reduction of
the form: sum(f(x[i], y[i])) over two device allocations x, y. If
possible I would very much like to use the rather nice reduction code
already in the pycuda.reduction module.
However, the module only appears to work with GPUArrays and no raw
device allocations. Is anyone aware of any simple workarounds for this
-- other than creating actual GPUArrays for my data?
Show replies by date