"Schock, Jonathan" <jonathan.schock(a)tum.de> writes:
I am not quite getting the function of the event system together with
I want to enqueue a non-blocking copy function which returns an event
for a kernel to wait on:
wev1 = cl.enqueue_copy(IOqueue, device_image, host_image,
is_blocking=False, origin=(0,0,0), region=host_image_shape)
Both queues are defined as OoO queues in the same context on the same
In my profiling I can see the start of the kernel, before the copying is
finished. Does that mean, I have to use blocking copies,
or am I doing something else wrong?
How are you measuring? What implementation is this on? (Only Intel CPU
supports OoO queues as of now, as far as I know.) Can you show code to
reproduce? FWIW, your code snippet looks correct to me, in the sense
that the kernel should see all results of the copy.