I have a huge csv file with 720,000,000 (720 million) lines. I want to sort it and my command is:
sort -T /tmp -S 50% --parallel=4 file.csv -o file_sorted.csv
Is there any other option that I can use to make it really fast?
Thanks!
I have a huge csv file with 720,000,000 (720 million) lines. I want to sort it and my command is:
sort -T /tmp -S 50% --parallel=4 file.csv -o file_sorted.csv
Is there any other option that I can use to make it really fast?
Thanks!
Use parallel sorting algorithms for huge data.
Useful topic: Which parallel sorting algorithm has the best average case performance?