I am using s3disctcp to copy 31,16,886 files(300 GB) from S3 to HDFS and it took 4 days to just copy 10,48,576 files .I killed the job and need to understand how can i reduce this time or what am i doing wrong.
s3-dist-cp --src s3://xml-prod/ --dest hdfs:///Output/XML/
Its on AWS EMR machine.