I am trying to trigger a lambda function object arrival in s3 alonng with object details like name and path. then trigger python script on EMR which will access the file which is on s3. Please let me know how i can trigger python script (may within pig /hive script?) to process the file on EMR which will trigger this action?
To copy the file to local once we have the details from lambda trigger for hive/pig script.
s3_client = boto3.client('s3')
s3_client.download_file('s3:n//<bucket name>/','', '/home/hadoop/data/')
please let me know how it can be done. Files will come every 30-40 mins.