Python scalable chat server

Question

I've just begun learning sockets with Python. So I've written some examples of chat servers and clients. Most of what I've seen on the internet seems to use threading module for (asynchronous) handling of clients' connections to the server. I do understand that for a scalable server you need to use some additional tricks, because thousands of threads can kill the server (correct me if I'm wrong, but is it due to GIL?), but that's not my concern at the moment.

The strange thing is that I've found somewhere in Python documentation that creating subprocesses is the right way (unfortunately I've lost the reference, sorry :( ) for handling sockets.

So the question is: to use threading or multiprocessing? Or is there even better solution?

Please, give me the answer and explain the difference to me.

By the way: I do know that there are things like Twisted which are well-written. I'm not looking for a pre-made scalable server, I am instead trying to understand how to write one that can be scaled or will deal with at least 10k clients.

EDIT: The operating system is Linux.

@Ben James: Obviously it is easier if someone explains it to me rather then go through thousands of lines of code which I may not even understand. :) That's what stackoverflow is for, right? — freakish, Oct 21 '11 at 09:02
unfortunately, i fear that the answer depends heavily on the operating system you are running on. subprocesses may be the best for unix based system, when asynchronous i/o is the standard for windows systems... — Adrien Plisson, Oct 21 '11 at 09:03
@Adrien Plisson: Sorry, forgot to write it. It is Linux (I've edit the question). — freakish, Oct 21 '11 at 09:04

score 11 · Accepted Answer · answered Oct 21 '11 at 09:29

Facebook needed a scalable server so they wrote Tornado (which uses async). Twisted is also famously scalable (it also uses async). Gunicorn is also a top performer (it uses multiple processes). None of the fast, scalable tools that I know about uses threading.

An easy way to experiment with the different approaches is to start with the SocketServer module in the standard library: http://docs.python.org/library/socketserver.html . It lets you easily switch approaches by alternately inheriting from either ThreadingMixin or ForkingMixin.

Also, if you're interested in learning about the async approach, the easiest way to build your understanding is to read a blog post discussing the implementation of Tornado: http://golubenco.org/2009/09/19/understanding-the-code-inside-tornado-the-asynchronous-web-server-powering-friendfeed/

Good luck and happy computing :-)

Isnt it FriendFeed instead of Facebook? – Jesvin Jose Oct 21 '11 at 09:53 — Jesvin Jose, Oct 21 '11 at 09:53

score 0 · Answer 2 · answered Sep 26 '12 at 12:08

thousands of threads can kill the server (correct me if I'm wrong, but is it due to GIL?)

For one thing, GIL has nothing to do with no. of threads. If you're are doing IO within these threads, you could have hundreds of thousands of these threads without any problem from GIL or otherwise.

GIL comes into play when you have CPU intensive tasks.

See this very informative talk from David Beazly to know more about GIL.

Python scalable chat server

2 Answers2