I have directories/files in S3 in the below structure.
root/
20180101/files.txt
20180102/files.txt
20180103/files.txt
Now i want to pass a date range as start_date=20180101 and end_date=20180102
. I want the pyspark code to read files from these directories included in the range. How can i achieve this.
**The range is configurable, i.e it can be 1 week/30days/90days