I wrote a simple script with 200 concurrent threads that makes http requests and processes the results. I need to do around 50,000,000 requests. Script is simple modification of "twistedless solution" suggested here (the first reply).
If I want to run it on a 10-nodes AWS cluster, how can I parallelize it? If I just copy that script onto one node and run it, what happens? Same as on a local machine?
Being new to all this, I am need a starting point. Thanks!