A colleague mentioned that the spark dataframeWriter class writes to a temporary location in s3 and then copies to the desired s3 location once complete. I wanted to understand this behavior more but cannot locate the source code describing this behavior. I'm been looking here:
Is the behavior described in this post what spark performs when writing to s3?
If the behavior is what should occur, a link to the location(s) where this code path exists and a description of why this behavior is defaulted (and obfuscated from an end user) would be helpful to understand.