Look up opencl / cuda coalesced memory access on stack overflow, there's
plenty of threads there
On 14 Aug 2015 13:55, "Joe Haywood" <haywoojr(a)mercyhealth.com> wrote:
Will you explain this to me a little more?
"One that jumps to the eye is that you're accessing 4 bytes of memory in
an arbitrary place, but every time you're really loading up, and then
writing back, a whole page! That's why it's so slow, even without atomic
operations. The solution is local memory."
Sent from my Samsung Galaxy Tab® S
This e-mail, including any attachments is the property of Trinity Health
and is intended for the sole use of the intended recipient(s). It may
contain information that is privileged and confidential. Any unauthorized
review, use, disclosure, or distribution is prohibited. If you are not the
intended recipient, please delete this message, and reply to the sender
regarding the error in a separate email.