hello,,,

i'm trying to translate from cu code to pycuda but got unexpected result because the flops value is too far
i think it because i don't really know how to translate it in pycuda...

especially in CUDA_CALL_SAFE and measure the time..

regards