On 11/06/2012 03:49, Andreas Kloeckner wrote:
Timo Burggraf<TimoFleckenstein(a)web.de> writes:
> I am not sure, if this was understandable
I hope so. My problem is, that
> this process is not vectorizable, since I write with different Point Tupels
> into the same uint8 Entry.
> My question is: Is there a way to use a bool-linear- array in OpenCL? Or is
> there a better way at all?
If I understand your question, the problem is different threads doing
bit twiddles on the same uint8 at the same time ? Could you arrange your
threads so that one thread writes one entire uint8 and the thread
conflict is avoided ?
Otherwise perhaps you could use an atomic_or and arrange data in 32 bit