I have to load millions of XML files from S3 and process it in the spark . But loading of all files should be done in specific order . So for example I have appended time series in the name of the folder ,Now I need to sort all files by time series prefix and load it in the spark in same sorted order .
Order of the files should not change while loading into spark data frame .
Can we do this in spark?