Can this technology stack scale?

Question

My client ask me to build a realtime application that could chat, send images and videos all in realtime. He asked me to come up with my own technology stack, so I did a lot of research and found out that the easiest one to build would be using below tech stack

1) Node.js and cluster to max out the CPU core for one instance of server - Language

2) Socket.io - realtime framework

3) Redis - pub/sub for multiple instances of server

4) Nginx - to reverse proxy and load balance multiple servers

5) Amazon EC2 - to run the server

6) Amazon S3 and CloudFront - to save the images/videos and to deliver

Correct me if I'm wrong for the above stack. My real question is, can the above tech stack scale 1,000,000 messages per seconds (text, images, videos)?

Anyone who have experienced with node.js and socket.io, could give me an insights or an alternatives of the above stack.

Regards,

SinusGob

If you want to use socket.io for push notification I suggest you to use APN,GCM so for chat server I suggest you to use XMPP open source implementations like WhatsApp and others — Mehdi, Jul 09 '16 at 07:28
This is a bit of a naive question. Can a system that can handle 1,000,000 messages per second be built out of the pieces you've named. Yes. Do you or we have any idea how many servers, load balancers, bandwidth, network cards, and other custom development, etc... it might take to get to that scale. No. There's very little specified here in terms of detail to go that far. — jfriend00, Jul 09 '16 at 07:29
If you could process a single message in 5ms (which is a wild number pulled out of air since you've provided no context at all to know what the server needs to do), then you could do 200 messages/sec/core which would need 5000 cores and quite a bit of network bandwidth in order to do 1,000,000 messages/sec. I'd suggest you start building proof of concept test harnesses that you can start running tests on. That's the only way to really know if you can do what you need to do. Measure. — jfriend00, Jul 09 '16 at 07:31
@jfriend00 i know that this is a little bit naive question, but my client asked me could it scale up to 1mill messages per second, and I don't know what to reply. That's why I asked on SO — sinusGob, Jul 09 '16 at 07:31
@jfriend00 will look into measuring the concept anytime soon, but the question is if you were in my shoes what kind stack would you go with according to the question's use case which is realtime chat (text, images,videos)? — sinusGob, Jul 09 '16 at 07:34
The only real answer is that it can be built, but you'd have to run a proof of concept study (at your client's expense) to really offer an accurate estimate of how much infrastructure would be needed to solve their specific problem at 1,000,000 messages/sec. And, you need to educate the client that anyone else who offers a different answer is blowing smoke up their xxxx because these aren't questions that can be answered with this level of information and no measuring/benchmarking of prototypes. A lot more details must be known, prototyped and measured. — jfriend00, Jul 09 '16 at 07:34
I agree with @jfriend00 that this is not the right Q. I don't think the question should be "how many messages can the system handle", but instead should be "can I maintain a system that is handling 1m messages per second? For example, sustaining 1m messages per second is fine, but what happens during deployments, how will you migrate data during failures, can you delivery any sensible delivery guarantees at this rate, etc". Sustaining high messages volumes in isolation is relatively easy, providing continuity is hard. **Disclaimer: I am the co-founder of [Ably realtime](https://www.ably.io)** — Matthew O'Riordan, Jul 10 '16 at 18:04

score 1 · Accepted Answer · answered Jul 09 '16 at 07:56

My real question is, can the above tech stack scale 1,000,000 messages per seconds (text, images, videos)?

Sure it can. With the right design and enough hardware. The question your client should be asking is really not whether it can be made to go that big, but at what cost and practicality can it be done and are those the best choices.

Let's look at each piece you've mentioned:

node.js - For an I/O centric app, it's an excellent choice for high scale and it can scale by deploying many CPUs in a cluster (both multi-process per server and multi-server). How practical this type of scale is depends a lot on what kind of shared data all these server processes need access to. Usually, the data store ultimately ends up being the harder bottleneck in scaling because it's easy to throw more servers at the request processing. It's not so easy to throw more hardware at a centralized data store. There are ways to do that, but it depends a lot on the demands of the app for how you do it and how hard it is.

socket.io - If you need efficient server push of smallish messages, then socket.io is probably the best way to go because it's the most efficient at push to the client. It is not great at all types of transport though. For example, I wouldn't be moving large images or video around through socket.io as there are more purpose built ways to do that. So, the use of socket.io depends a lot on what exactly the app wants to use it for. If you wanted to push a video to a client, you could also push just an URL and have the client turn around and request the video via a regular http URL using well known high scale technology.

Redis - Again, great for some things, not great at everything. So, it really depends upon what you're trying to do. What I explained earlier is that the design of your data store and the number of transactions through it is probably where your real scale problems lie. If I were starting this job, I'd start with an understanding of the data storage needs for a server, transactions per second of various types, caching strategy, redundancy, fail-over, data persistence, etc... and design the high scale access to data first. I wouldn't be entirely sure redis was the preferred choice. I'd probably suggest you need a high scale database guy as a consultant early in the project.

Nginx - Lots of high scale sites using nginx so it's certainly a good tool. Whether it's exactly the right tool for you depends upon your design. I'd probably work on this part last because it seems less central to the design and once the rest of the system is laid out, you can then consider what you need here.

Amazon EC2 - One of several possible choices. These choices are hard to compare directly in an apples to apples comparison. Large scale systems have been built out of EC2 so there is proof of concept there and the general architecture seems an appropriate match. If you wanted to know where the real gremlins are there, you'd need a consultant that had done high scale stuff on EC2.

Amazon S3 - I personally know some very high storage and bandwidth sites using S3 for both video and images. It works for that.

So ... these are generally likely good tools to use if they are used in the right way. Redis would be a question-mark depending upon the storage needs of the actual application (you've provided zero requirements and a database can't be selected with zero requirements). A more reasoned answer would be based on putting together a high level set of requirements that analyze what the system needs to be able to do to serve 1,000,000 whatever. Those requirements could be compared with known capabilities for some of these pieces to start a ballpark on scaling a system. Then, you'd have to put together some benchmarking tests to run some tests on certain pieces of the system. As much of the success of failure would depend upon how the app was built and how the tools were used as it would which tools were selected. You can likely make a successful scale with many different types of tools. Heck, Facebook runs on PHP (well, a highly modified, customized PHP that is not really typical PHP at all at runtime).

Thank you for clarifying above stack! – sinusGob Jul 09 '16 at 15:45 — sinusGob, Jul 09 '16 at 15:45

Can this technology stack scale?

1 Answers1

Linked