Since the application from which I require most updated data every time does not send change-notifications, I am required to poll it every second to get an update. Since there are thousands of items for which I need to get an update every second, I thought of designing an application with thousands of pollers [of course, I would welcome your suggestion for a better solution].
Got to know from this post that Java VM on each server can support a lot of threads. Just to mention that I'm not bound by a language.
Now, I'm trying to figure out how to make it scalable and make it work in distributed environment.
One idea is to create a master server having the list of thousands of items to poll. It passes a few of them to slave servers for polling while getting heartbeats from them. The other one is about servers talking to each other sharing index-ranges (probably for a file in S3) with each other about who's working on what items. I'm not even sure if any of them even work.
I couldn't find any frameworks that could help me with it. Or as a newbie, probably I don't know what to look for.
What would be your suggestion? Any pointers would help. Really appreciate it.