I have a large xlsx Excel file (56mb, 550k rows) from which I tried to read the first 10 rows. I tried using xlrd
, openpyxl
, and pyexcel-xlsx
, but they always take more than 35 mins because it loads the whole file in memory.
I unzipped the Excel file and found out that the xml
which contains the data I need is 800mb unzipped.
When you load the same file in Excel it takes 30 seconds. I'm wondering why it takes that much time in Python?