I am trying to access this portal via Google Cloud Instance - http://mca.gov.in/. I am not sure if there has been some problematic behaviour from someone else in the past or some other reason but looks like they have blocked every GCP IP address in existence. The website simply doesn't load on it.
I am using selenium and need to setup a way to proxy at browser or server level dynamically. Can you suggest the best way to go about it? I need to download one file every day from this portal. The entire thing done manually takes less than 2 minutes.
The website TOS provided over here permits automated scraping
Acceptable use of MCA Searchable Databases MCA searchable databases are designed to meet the needs of a wide range of users wishing to interrogate our information on-line. Due to limitations of equipment and bandwidth, they are not intended to be a source for bulk downloads.
Individuals, companies, IP addresses or blocks of IP addresses who deny or degrade service to other users by generating unusually high numbers of daily database accesses, whether generated manually or in an automated fashion, may be denied access to these services without notice.
Considering my use case is to download one file everyday, it is not an issue as far as TOS are concerned.