Is it possible to perform a matrix vector multiply using the gpuarray.dot() method (or maybe a better one)?
I can individually multiply matrix rows by a vector
gpuarray.dot(d_A[0, :], d_b)
but don't seem to be able to multiply a whole matrix by a vector?
It returns a result, but it is just a bogus integer, not an array.
Applications Support Specialist
IM&T | Scientific Computing
E Chris.Watkins(a)csiro.au T +61 3 9545 2369
Private Bag 10, Clayton South, Vic, 3169
The information contained in this email may be confidential or privileged. Any unauthorised use or disclosure is prohibited. If you have received this email in error, please delete it immediately and notify the sender by return email. Thank you. To the extent permitted by law, CSIRO does not represent, warrant and/or guarantee that the integrity of this communication has been maintained or that the communication is free of errors, virus, interception or interference.
I have a question on the cuda capable device detection using pycuda.
Does the driver.Device.count() function detect each gpu on dual gpu cards
the nvidia K2 or K80 cards as separate?, that is if I have one K2 card
does the count() function return 2?
Also is there a device attribute that matches it with the other gpu(s) on
the same card?
Sorry if this is a trivial question, but I wasn't able to find a definitive
answer online, nor do
I have a dual gpu card available at the moment.
View this message in context: http://pycuda.2962900.n2.nabble.com/Pycuda-Device-Detection-tp7575696.html
Sent from the PyCuda mailing list archive at Nabble.com.
I imagine that one of the main things that PyOpenCL and PyCUDA get used
for are computations with large arrays. As such, I can imagine that many
of you are sympathetic to the cause of trying to come up with simpler
abstractions that nonetheless yield high-performance code for such
ARRAY'16 is a workshop that concerns itself with programming language
aspects of computing with arrays, including language design,
compilation, libraries, and performance optimization. More information
on the workshop (including the full call for papers) may be found here:
The workshop will be held June 14 in Santa Barbara, and the deadline for
submissions is April 1.
I would be delighted to see some submissions from the PyCUDA/PyOpenCL
crowd! (Disclaimer: I am on the organizing committee.)
I have a fresh installed ubuntu 14.04.3 distribution, with cuda 7.0 and pycuda 2015.1.3. I'm using an NVIDIA GTX 960 card and the latest driver 361.28. All the installation went well but when i run test_driver.py i get :
============================= test session starts ==============================
platform linux2 -- Python 2.7.6, pytest-2.8.7, py-1.4.31, pluggy-0.3.1
rootdir: /opt/pycuda-2015.1.3, inifile:
collected 23 items
===================== 22 passed, 1 xfailed in 5.91 seconds =====================
Can you please help me to understand why the test fails once?
Thank you very much,