On Tue, 22 May 2012 20:04:34 +0200, Ludwig Schmidt-Hackenberg <ludwig(a)iupr.com> wrote:
> Hi,
>
> I wanted to ask about the status/plans of including the new CUDA
> features introduced with Kepler into PyCUDA.
It seems all of the headline features would be supported without further work
on PyCUDA's side, namely nested parallelism, GPU Direct and improved queuing.
There is exactly one large-ish change in CUDA 5 from the driver API
side, and that is support for mip-mapped arrays. I personally don't view
this as a very pressing issue for most compute codes. It'll happen
eventually, but I'm not really in a hurry with it.
Hope that clarifies things,
Andreas
Hi,
I wanted to ask about the status/plans of including the new CUDA
features introduced with Kepler into PyCUDA.
Kind regard,
Ludwig
--
:::.: Ludwig Schmidt-Hackenberg
:::.: ludwig(a)iupr.com
:::.: AG Bildverstehen und Mustererkennung
:::.: Technische Universität Kaiserslautern
:::.: www.iupr.com
Hi,
I updated various packages (e.g. ugpraded cuda to 4, most recent
pycuda, ubuntu nvidia-dev drivers) on ubuntu 11.10. Pycuda was working
fine before, but after rebuilding and reinstalling the newest version
I get:
ImportError: /usr/local/lib/python2.7/dist-packages/pycuda-2011.2.2-py2.7-linux-x86_64.egg/pycuda/_driver.so:
undefined symbol: cuMemAllocPitch_v2
When I import pycuda.gpuarray. Any ideas?
Thomas
On Thu, 17 May 2012 16:28:33 -0700, Serge Rey <srey(a)asu.edu> wrote:
> hi all,
>
> we have been trying, unsuccessfully to get pycuda installed on a new
> mac pro with a quadro fx 4800. we have nvida toolkit and drivers
> installed correctly and can build the test binaries just fine.
>
> our error/log messages are below - if anyone has suggestions we would
> be most appreciative.
>
> thanks in advance.
Try building with
CC=gcc python setup.py build
? (shot in the dark)
hi all,
we have been trying, unsuccessfully to get pycuda installed on a new
mac pro with a quadro fx 4800. we have nvida toolkit and drivers
installed correctly and can build the test binaries just fine.
our error/log messages are below - if anyone has suggestions we would
be most appreciative.
thanks in advance.
--
Sergio (Serge) Rey
Professor, School of Geographical Sciences and Urban Planning
GeoDa Center for Geospatial Analysis and Computation
Arizona State University
http://geoplan.asu.edu/rey
Editor, International Regional Science Review
http://irx.sagepub.com
pycuda-2011.2.2 python setup.py build
running build
running build_py
creating build
creating build/lib.macosx-10.5-x86_64-2.7
creating build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/__init__.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/_cluda.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/_mymako.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/autoinit.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/characterize.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/compiler.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/cumath.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/curandom.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/debug.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/driver.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/elementwise.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/gpuarray.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/reduction.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/scan.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
copying pycuda/tools.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda
creating build/lib.macosx-10.5-x86_64-2.7/pycuda/gl
copying pycuda/gl/__init__.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda/gl
copying pycuda/gl/autoinit.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda/gl
creating build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/__init__.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/cg.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/coordinate.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/inner.py -> build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/operator.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/packeted.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
copying pycuda/sparse/pkt_build.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/sparse
creating build/lib.macosx-10.5-x86_64-2.7/pycuda/compyte
copying pycuda/compyte/__init__.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/compyte
copying pycuda/compyte/array.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/compyte
copying pycuda/compyte/dtypes.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/compyte
copying pycuda/compyte/scan.py ->
build/lib.macosx-10.5-x86_64-2.7/pycuda/compyte
running build_ext
building '_driver' extension
creating build/temp.macosx-10.5-x86_64-2.7
creating build/temp.macosx-10.5-x86_64-2.7/src
creating build/temp.macosx-10.5-x86_64-2.7/src/cpp
creating build/temp.macosx-10.5-x86_64-2.7/src/wrapper
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/python
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/python/src
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/python/src/converter
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/python/src/object
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/smart_ptr
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/smart_ptr/src
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/thread
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/thread/src
creating build/temp.macosx-10.5-x86_64-2.7/bpl-subset/bpl_subset/libs/thread/src/pthread
/usr/bin/clang -fno-strict-aliasing -fno-common -dynamic -fwrapv -Wall
-g -DPYGPU_PACKAGE=pycuda -DHAVE_CURAND=1 -DBOOST_PYTHON_SOURCE=1
-DPYGPU_PYCUDA=1 -DBOOST_MULTI_INDEX_DISABLE_SERIALIZATION=1
-Dboost=pycudaboost -Isrc/cpp -Ibpl-subset/bpl_subset
-I/usr/local/cuda/include
-I/usr/local/Cellar/python/2.7.3/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/core/include
-I/usr/local/Cellar/python/2.7.3/Frameworks/Python.framework/Versions/2.7/include/python2.7
-c src/cpp/cuda.cpp -o
build/temp.macosx-10.5-x86_64-2.7/src/cpp/cuda.o -arch x86_64 -arch
i386 -isysroot /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX10.6.sdk
In file included from src/cpp/cuda.cpp:1:
In file included from src/cpp/cuda.hpp:29:
In file included from bpl-subset/bpl_subset/boost/foreach.hpp:78:
In file included from bpl-subset/bpl_subset/boost/range/rend.hpp:19:
In file included from bpl-subset/bpl_subset/boost/range/reverse_iterator.hpp:20:
In file included from
bpl-subset/bpl_subset/boost/iterator/reverse_iterator.hpp:12:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_adaptor.hpp:15:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_facade.hpp:26:
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:45:35: error: 'T'
does not refer to a
value
BOOST_INTERNAL_IS_POD(T)
^
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:26:47: note:
expanded from macro
'BOOST_INTERNAL_IS_POD'
#define BOOST_INTERNAL_IS_POD(T) BOOST_IS_POD(T)
^
bpl-subset/bpl_subset/boost/type_traits/intrinsics.hpp:139:37: note:
expanded from macro
'BOOST_IS_POD'
# define BOOST_IS_POD(T) __is_pod(T)
^
bpl-subset/bpl_subset/boost/config/suffix.hpp:424:72: note: expanded from macro
'BOOST_STATIC_CONSTANT'
# define BOOST_STATIC_CONSTANT(type, assignment) static const type
assignment
^
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:38:20: note: declared here
template <typename T> struct is_pod_impl
^
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:46:13: error: type
name requires a
specifier or qualifier
>::value));
^
bpl-subset/bpl_subset/boost/config/suffix.hpp:424:72: note: expanded from macro
'BOOST_STATIC_CONSTANT'
# define BOOST_STATIC_CONSTANT(type, assignment) static const type
assignment
^
In file included from src/cpp/cuda.cpp:1:
In file included from src/cpp/cuda.hpp:29:
In file included from bpl-subset/bpl_subset/boost/foreach.hpp:78:
In file included from bpl-subset/bpl_subset/boost/range/rend.hpp:19:
In file included from bpl-subset/bpl_subset/boost/range/reverse_iterator.hpp:20:
In file included from
bpl-subset/bpl_subset/boost/iterator/reverse_iterator.hpp:12:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_adaptor.hpp:15:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_facade.hpp:26:
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:46:13: error: expected ')'
>::value));
^
bpl-subset/bpl_subset/boost/config/suffix.hpp:424:72: note: expanded from macro
'BOOST_STATIC_CONSTANT'
# define BOOST_STATIC_CONSTANT(type, assignment) static const type
assignment
^
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:42:9: note: to match this '('
(::boost::type_traits::ice_or<
^
bpl-subset/bpl_subset/boost/config/suffix.hpp:424:72: note: expanded from macro
'BOOST_STATIC_CONSTANT'
# define BOOST_STATIC_CONSTANT(type, assignment) static const type
assignment
^
In file included from src/cpp/cuda.cpp:1:
In file included from src/cpp/cuda.hpp:29:
In file included from bpl-subset/bpl_subset/boost/foreach.hpp:78:
In file included from bpl-subset/bpl_subset/boost/range/rend.hpp:19:
In file included from bpl-subset/bpl_subset/boost/range/reverse_iterator.hpp:20:
In file included from
bpl-subset/bpl_subset/boost/iterator/reverse_iterator.hpp:12:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_adaptor.hpp:15:
In file included from
bpl-subset/bpl_subset/boost/iterator/iterator_facade.hpp:26:
bpl-subset/bpl_subset/boost/type_traits/is_pod.hpp:46:20: error:
expected expression
>::value));
^
In file included from src/cpp/cuda.cpp:1:
In file included from src/cpp/cuda.hpp:34:
In file included from bpl-subset/bpl_subset/boost/python.hpp:11:
In file included from bpl-subset/bpl_subset/boost/python/args.hpp:8:
In file included from bpl-subset/bpl_subset/boost/python/detail/prefix.hpp:13:
In file included from
bpl-subset/bpl_subset/boost/python/detail/wrap_python.hpp:142:
In file included from
/usr/local/Cellar/python/2.7.3/Frameworks/Python.framework/Versions/2.7/include/python2.7/Python.h:126:
/usr/local/Cellar/python/2.7.3/Frameworks/Python.framework/Versions/2.7/include/python2.7/modsupport.h:27:65:
warning:
'format' attribute argument not supported: PyArg_ParseTuple
PyAPI_FUNC(int) PyArg_ParseTuple(PyObject *, const char *, ...)
Py_FORMAT_PARSETUPLE(...
^
/usr/local/Cellar/python/2.7.3/Frameworks/Python.framework/Versions/2.7/include/python2.7/pyport.h:871:57:
note:
expanded from macro 'Py_FORMAT_PARSETUPLE'
#define Py_FORMAT_PARSETUPLE(func,p1,p2) __attribute__((format(func,p1,p2)))
^
1 warning and 4 errors generated.
error: command '/usr/bin/clang' failed with exit status 1
➜ pycuda-2011.2.2
LinkedIn
------------
Saito Norio requested to add you as a connection on LinkedIn:
------------------------------------------
Michael,
I'd like to add you to my professional network on LinkedIn.
- Saito
Accept invitation from Saito Norio
http://www.linkedin.com/e/-s1x6c6-h2adwi65-t/vBkXylyN9hJL27A5s4uIp6ISq9x/bl…
View invitation from Saito Norio
http://www.linkedin.com/e/-s1x6c6-h2adwi65-t/vBkXylyN9hJL27A5s4uIp6ISq9x/bl…
------------------------------------------
Why might connecting with Saito Norio be a good idea?
Saito Norio's connections could be useful to you:
After accepting Saito Norio's invitation, check Saito Norio's connections to see who else you may know and who you might want an introduction to. Building these connections can create opportunities in the future.
--
(c) 2012, LinkedIn Corporation
On Mon, 14 May 2012 15:20:41 -0700, "Eli Stevens (Gmail)" <wickedgrey(a)gmail.com> wrote:
> I've seen this error a few times, but it's not reproducible. Can
> anyone give any insight into what might be going wrong?
>
> Traceback (most recent call last):
> File "/home/elis/edit/work/dev/mms/common/util/threads.py", line 219, in run
> mod = cudahelper.compileSourceModule(kernel.code_str,
> kernel.buildOptions_list)
> File "/home/elis/edit/work/dev/mms/common/util/cudahelper.py", line
> 551, in compileSourceModule
> return SourceModule(const_src + textwrap.dedent(source),
> options=list(default_build_options | set(build_options)),
> nvcc=os.path.join(os.getenv('CUDA_HOME', '/usr/local/cuda'), 'bin',
> 'nvcc'), **kwargs)
> File "/home/elis/venv/dev/local/lib/python2.7/site-packages/pycuda-2011.2.2-py2.7-linux-x86_64.egg/pycuda/compiler.py",
> line 286, in __init__
> self.module = module_from_buffer(cubin)
> LaunchError: cuModuleLoadDataEx failed: launch failed -
>
> This happened in some (but not all, I don't think) of the threads that
> got launched at roughly the same time.
>
> A quick google search didn't turn up anything relevant. Since I can't
> reproduce it, I haven't had much traction on debugging it. Is the
> SourceModule threadsafe? We're using it from a number of python
> threads ATM, so if it's not threadsafe that could explain it. Any
> clues would be much appreciated. :)
Easy to test: does it run fine single-threaded? :) Also, if you're using
the prefork mechanism, that's definitely not thread-safe, because all
threads share a pipe to communicate with the pre-forked nvcc runner.
Andreas
I've seen this error a few times, but it's not reproducible. Can
anyone give any insight into what might be going wrong?
Traceback (most recent call last):
File "/home/elis/edit/work/dev/mms/common/util/threads.py", line 219, in run
mod = cudahelper.compileSourceModule(kernel.code_str,
kernel.buildOptions_list)
File "/home/elis/edit/work/dev/mms/common/util/cudahelper.py", line
551, in compileSourceModule
return SourceModule(const_src + textwrap.dedent(source),
options=list(default_build_options | set(build_options)),
nvcc=os.path.join(os.getenv('CUDA_HOME', '/usr/local/cuda'), 'bin',
'nvcc'), **kwargs)
File "/home/elis/venv/dev/local/lib/python2.7/site-packages/pycuda-2011.2.2-py2.7-linux-x86_64.egg/pycuda/compiler.py",
line 286, in __init__
self.module = module_from_buffer(cubin)
LaunchError: cuModuleLoadDataEx failed: launch failed -
This happened in some (but not all, I don't think) of the threads that
got launched at roughly the same time.
A quick google search didn't turn up anything relevant. Since I can't
reproduce it, I haven't had much traction on debugging it. Is the
SourceModule threadsafe? We're using it from a number of python
threads ATM, so if it's not threadsafe that could explain it. Any
clues would be much appreciated. :)
Thanks,
Eli
Hi all,
just wanted to let you know that PyCUDA now has Japanese docs available,
thanks to Norio Saito.
http://translation.saitonorio.com/pycuda/
(link is also in the English docs)
Andreas
On Wed, 2 May 2012 09:11:28 +0900, SungHwan Choi <sunghwanchoi91(a)gmail.com> wrote:
> Thank you for reply.
> After I send the first email, I did Dependency Walker, The program reported
> that driver.pyd cannot connect to python27.dll, msvcp90.dll and msvcr90.dll
> but I did not know how to resolve this problem please share knowhow
a) please keep the list cc'd
b) a quick resolution to this would be to search for these DLLs and dump
them into the same directory as _driver.pyd.
Andreas