I have a Pycuda code, which deals with two kernels. Both kernels run well
separately, but when I put them together, there is a memory problem
"LogicError: cuMemcpyDtoH failed: an illegal memory access was encountered".
In the second kernel "DotKernel", I can't change the values of any shared
array or global array. Could you please have a look at the code? Thank you