I have a set of gzip-compressed CSV files in S3. But they have the .csv extension, not .csv.gz. The issue is that when I try to read them using Pyspark, they do not read properly. I have tried many configurations, but with no luck.
Then I found similar issue in here(link). But here they have used Scala. I tried to implement this with Python, but I could not find the correct APIs for doing that.
Any help would be appreciated.
Implement Python code for read a compressed file with custom extension using Pyspark.