I am trying to write a pandas DataFrame
to an .xlsx
file where different numerical columns would have different formats. For example, some would show only two decimal places, some would show none, some would be formatted as percents with a "%" symbol, etc.
I noticed that DataFrame.to_html()
has a formatters
parameter that allows one to do just that, mapping different formats to different columns. However, there is no similar parameter on the DataFrame.to_excel()
method. The most we have is a float_format
that is global to all numbers.
I have read many SO posts that are at least partly related to my question, for example:
- Use the older
openpyxl
engine to apply formats one cell at a time. This is the approach with which I've had the most success. But it means writing loops to apply formats cell-by-cell, remembering offsets, etc. - Render percentages by changing the table data itself into strings. Going the route of altering the actual data inspired me to try dealing with decimal place formatting by calling
round()
on each column before writing to Excel - this works too, but I'd like to avoid altering the data. - Assorted others, mostly about date formats
Are there other more convenient Excel-related functions/properties in the pandas API that can help here, or something similar on openpyxl
, or perhaps some way to specify output format metadata directly onto each column in the DataFrame
that would then be interpreted downstream by different outputters?