[PyOpenCL] Passing a floatn table to a kernel
Bogdan Opanchuk
mantihor at gmail.com
Mon Dec 13 18:48:54 PST 2010
Hello David,
On Tue, Dec 14, 2010 at 2:54 AM, David Libault <david.libault at gmail.com> wrote:
> Thank you for your answer. I tried your proposition, and, as you say,
> it returns 16 bytes instead of 12.
>
> Strange bug in Apple's implementation...
I would not call that a bug; most probably they decided that improved
float3 fetching/storing speed (16 bytes can be transferred using the
single instruction and properly coalesced, as opposed to 12 bytes) is
worth increased storage space. If you were using "new float3[...]" to
allocate buffer, it would pass unnoticed, but when you allocated
memory in Python, you bumped into this design decision.
Best regards,
Bogdan
More information about the PyOpenCL
mailing list