I've been looking at Esper (and Storm) for stream processing.. Esper seems to do exactly what I want.. i.e. roling means, medians, complex queries, etc... but one thing has me wondering.
How would I scale out to multiple instances with Esper?
As far as I understand, Storm handles distributed processing, but with Esper you're on your own.
I wouldn't need to do it for the forseable future, but as we grow, so would our data volumes, would then need to scale out as well. Most likely we would be deployed in Amazon EC2.
Would I need to run multiple servers and shard data before sending them to my Esper application?
Is there a more graceful way of handling it?
-Sajal