Does Spark shuffle write all intermediate data to disk, or only that which will not fit in memory ("spill")?
In particular, if the intermediate data is small, will anything be written to disk, or will the shuffle be performed entirely using memory without writing anything to disk?
I've checked the docs and related StackOverflow questions, but they weren't clear on this precise question.