I have a directory where i have xls files which are being updated and new xls files are being added.
I have combined all of the data using-
import os
import pandas as pd
def read_sheets(filename):
result = []
sheets = pd.read_excel(filename, sheet_name=None)
for name, sheet in sheets.items():
sheet['Sheetname'] = name
sheet['Row'] = sheet.index
result.append(sheet)
return pd.concat(result, ignore_index=True)
def read_files(filenames):
result = []
for filename in filenames:
file = read_sheets(filename)
file['Filename'] = filename
result.append(file)
return pd.concat(result, ignore_index=True)
files = ['1.xls', '2.xls','3.xls','4.xls','5.xls']
dfoo = read_files(files)
But I want to know if any changes are being made to these xls files, how can I automate appends to dfoo from these files and if new files are created lets say 6.xls or 7.xls later(which will have same column headers) how can that data also be appended to dfoo