Re: [Cython] OpenMP support

Dag Sverre Seljebotn Fri, 11 Mar 2011 04:16:09 -0800

On 03/11/2011 12:37 PM, Stefan Behnel wrote:

Dag Sverre Seljebotn, 11.03.2011 08:56:
Basically, I'm +1 to anything that can make me
pretend the GIL doesn't exist, even if it comes with a 2x performancehit:Because that will make me write parallell code (which I can't bebothered
to do in Cython currently), and I have 4 cores on the laptop I use for
debugging, so I'd still get a 2x speedup.
Perhaps the long-term solution is something like an "autogil" mode could
work where Cython automatically releases the GIL on blocks where it can
(such as a typed for-loop), and acquires it back when needed (an
exception-raising if-block within said for-loop).
I assume you mean this to become a decorator or other option writteninto the code.
And when doing
multi-threading, GIL-requiring calls are dispatched to a masterGIL-holding
thread (which would not be a worker thread, i.e. on 4 cores you'd have 4
workers + 1 GIL-holding support thread). So the advice for speedingup codeis simply "make sure your code is all typed", just like before, butpeople
can follow that advice without even having to learn about the GIL.
The GIL does not only protect the interpreter core. It also protects Clevel data structures in user code and keeps threaded code fromrunning amok. Releasing and acquiring it doesn't come for free either,so besides likely breaking code that was not specifically written tobe reentrant, releasing it automatically may also introduce aperformance penalty for many users.

The intention was that the GIL would be acquired in exceptionalcircumstances (doesn't matter for overall performance) or duringdebugging (again don't care about performance). But I agree the ideaneeds more thought on the possible pitfalls.

I'm very happy the GIL exists, and I'm against anything that tries todisable it automatically. Threading is an extremely dangerousprogramming model. The GIL has its gotchas, too, but it stillsimplifies it quite a bit. Actually, threading is so complex and easyto get wrong, that any threaded code should always be writtenspecifically to support threading. Explicitly acquiring and releasingthe GIL is really just a minor issue on that path.

I guess the point is that OpenMP takes that "extremely dangerousprogramming model" and makes it tractable, at least for a class oftrivial problems (not necessarily SIMD, but almost).

BTW, threading is often used simply because how how array data is laidout in memory. Typical usecase is every thread write to differentnon-overlapping blocks of the same array (and read from the same inputarrays that are not changed). Then you move on to step B, which does thesame, but perhaps blocks the arrays in a different way between threads.Then step C blocks the data in yet another way, etc. But at each stepit's just "input arrays, non-overlapping blocks in output arrays",global parameters, local loop counters.

(One doesn't need to use threads, there was another thread onmultiprocessing + shared memory arrays.)

Just saying that not all use of threads is "extremely dangerous", andOpenMP exists explicitly to dumb threading down for those cases.


Dag Sverre
_______________________________________________
cython-devel mailing list
cython-devel@python.org
http://mail.python.org/mailman/listinfo/cython-devel

Re: [Cython] OpenMP support

Reply via email to