I am trying to parse a table from a dropbox link (https://www.dropbox.com/s/i77mern7joxc9ur/TestResultCodelistVoC.xlsx). It is a .xlsx
table and I have tried two methods so far
METHOD 1
codeID_url = 'https://www.dropbox.com/s/i77mern7joxc9ur/TestResultCodelistVoC.xlsx'
tables = pd.read_html(codeID_url)
df_codeID = tables[0]
gives
ValueError: No tables found
Which makes sense, as, in the end, I am not parsing a table from a html page. The commands above work perfectly fine for the tables in this page (https://www.ecdc.europa.eu/en/covid-19/variants-concern)
METHOD 2
codeID_url = 'https://www.dropbox.com/s/i77mern7joxc9ur/TestResultCodelistVoC.xlsx'
data = pd.read_excel(codeID_url,'TestResultCodelistVoC')
gives:
XLRDError: Unsupported format, or corrupt file: Expected BOF record; found b'<!DOCTYP'
I did find a topic on this same error here, though all the answers are dealing with a local .xls
file, and in my case I am trying to parse a webpage/ link, which is in the end a .xls
file.
I also came across one solution using a dropbox token, though I first would like to try to download the aforementioned table without a dropbox account, if possible.