[PyOpenCL] Passing a floatn table to a kernel

Bogdan Opanchuk mantihor at gmail.com
Mon Dec 13 18:48:54 PST 2010


Hello David,

On Tue, Dec 14, 2010 at 2:54 AM, David Libault <david.libault at gmail.com> wrote:
> Thank you for your answer. I tried your proposition, and, as you say,
> it returns 16 bytes instead of 12.
>
> Strange bug in Apple's implementation...

I would not call that a bug; most probably they decided that improved
float3 fetching/storing speed (16 bytes can be transferred using the
single instruction and properly coalesced, as opposed to 12 bytes) is
worth increased storage space. If you were using "new float3[...]" to
allocate buffer, it would pass unnoticed, but when you allocated
memory in Python, you bumped into this design decision.

Best regards,
Bogdan



More information about the PyOpenCL mailing list