How to choose between different concurrent method available in Python?

Question

There's different ways of doing concurrent in Python, below is a simple list:

process-based: process.Popen, multiprocessing.Process, old fashioned os.system, os.popen, os.exe*
thread-based: threading.Thread
microthread-based: greenlet

I know the difference between thread-based concurrency and process-based concurrency, and I know some (but not too much) about GIL's impact in CPython's thread support.

For a beginner who want to implement some level of concurrency, how to choose between them? Or, what's the general difference between them? Are there any more ways to do concurrent in Python?

I'm not sure if I'm asking the right question, please feel free to improve this question.

It sounds like you know the difference between the different types of concurrency, in which case you already know the answer to your question. Also, a more answerable question would identify the specific situation, in order to choose between them. There are plenty of websites discussing the general difference between concurrency methods — lxop
– lxop, Commented Jan 14, 2013 at 22:22
As a side note: You should not be using the os.* methods for process-based concurrency. If multiprocessing (or a similar third-party module) isn't what you want, and you just want to exec a child process, use the subprocess module. — abarnert
– abarnert, Commented Jan 14, 2013 at 22:26
+1 to @lxop. The key things we'd need to know before we could even begin to answer this are whether you're expecting to be IO-bound or CPU-bound, what kind of data sharing you need between your threads of execution (using the term loosely to mean all three), and whether you already inherently need a main loop for some reason (e.g., a GUI or a network server—in which case the answer may be "none of the above, do it in a single-threaded event loop"). — abarnert
– abarnert, Commented Jan 14, 2013 at 22:28
@abarnert I think your comment would be a good answer to this question. There's a huge gap between knowing the difference (like my current knowledge) and choosing the right method when dealing with a real problem. — yegle
– yegle, Commented Jan 15, 2013 at 4:03
@yegle: Well, I wrote it as a comment because I suck at brevity and conciseness unless it's forced on me. As you can see from the version I've now posted as an answer. :) — abarnert
– abarnert, Commented Jan 15, 2013 at 22:15

abarnert · Accepted Answer · 2013-01-15 21:36:45Z

The reason all three of these mechanisms exist is that they have different strengths and weaknesses.

First, if you have huge numbers of small, independent tasks, and there's no sensible way to batch them up (typically, this means you're writing a C10k server, but that's not the only possible case), microthreads win hands down. You can only run a few hundred OS threads or processes before everything either bogs down or just fails. So, either you use microthreads, or you give up on automatic concurrency and start writing explicit callbacks or coroutines. This is really the only time microthreads win; otherwise, they're just like OS threads except a few things don't work right.

Next, if your code is CPU-bound, you need processes. Microthreads are an inherently single-core solution; Threads in Python generally can't parallelize well because of the GIL; processes get as much parallelism as the OS can handle. So, processes will let your 4-core system run your code 4x as fast; nothing else will. (In fact, you might want to go farther and distribute across separate computers, but you didn't ask about that.) But if your code is I/O-bound, core-parallelism doesn't help, so threads are just as good as processes.

If you have lots of shared, mutable data, things are going to be tough. Processes require explicitly putting everything into sharable structures, like using multiprocessing.Array in place of list, which gets nightmarishly complicated. Threads share everything automatically—which means there are race conditions everywhere. Which means you need to think through your flow very carefully and use locks effectively. With processes, an experienced developers can build a system that works on all of the test data but has to be reorganized every time you give it a new set of inputs. With threads, an experienced developer can write code that runs for weeks before accidentally and silently scrambling everyone's credit card numbers.

Whichever of those two scares you more—do that one, because you understand the problem better. Or, if it's at all possible, step back and try to redesign your code to make most of the shared data independent or immutable. This may not be possible (without making things either too slow or too hard to understand), but think about it hard before deciding that.

If you have lots of independent data or shared immutable data, threads clearly win. Processes need either explicit sharing (like multiprocessing.Array again) or marshaling. multiprocessing and its third-party alternatives make marshaling pretty easy for the simple cases where everything is picklable, but it's still not as simple as just passing values around directly, and it's also a lot slower.

Unfortunately, most cases where you have lots of immutable data to pass around are the exact same cases where you need CPU parallelism, which means you have a tradeoff. And the best answer to this tradeoff may be OS threads on your current 4-core system, but processes on the 16-core system you have in 2 years. (If you organize things around, e.g., multiprocessing.ThreadPool or concurrent.futures.ThreadPoolExecutor, and trivially switch to Pool or ProcessPoolExecutor later—or even with a runtime configuration switch—that pretty much solves the problem. But this isn't always possible.)

Finally, if your application inherently requires an event loop (e.g., a GUI app or a network server), pick the framework you like first. Coding with, say, PySide vs. wx, or twisted vs. gevent, is a bigger difference than coding with microthreads vs. OS threads. And, once you've picked the framework, see how much you can take advantage of its event loop where you thought you needed real concurrency. For example, if you need some code to run every 30 seconds, don't start a thread (micro- or OS) for that, ask the framework to schedule it however it wants.

One thing I should have mentioned: C extension modules can release the GIL. And some important ones, like numpy, do so in many cases where it's useful to. Which means, in those cases, you can use threads instead of processes, and still get core parallelism.

Collectives™ on Stack Overflow

How to choose between different concurrent method available in Python?

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related