Keith Brafford <keith.brafford(a)gmail.com> writes:
When I run the particle animation demo with an Intel
or AMD CPU context it
only seems to be using 1 processor core--the task manager shows no more
than 1 of the hyperthreads being pegged. If I increase the number of
particles it works, but it runs slower and doesn't try to use any of the
other 7 threads. Is there some trick to get it to seamlessly use all of
the compute devices that the context offers?
When I run it on the GPU it seems to use all of the devices naturally.
First of all, what operating system are you using? Next, since you speak
of 'particles', I suspect you are speaking about a particular code that
uses PyOpenCL. Do you know what grid configuration this code is using?
If the grid consists of 1 workgroups, then only using 1 thread is
entirely expected (but doing this would not necessarily be common).
FWIW, the AMD CPU OpenCL ICD uses all my cores without needing to be