When I have queried a table and checked num of partitions available for that df is 14. The data size is more than 10 GB.
But when I see the table location there are 400 part files available which was created by spark while saving df as a table.
Ideally, num of partitions should be equal to num of output files, right?
Can someone please help me understand this scenario?