I want to use pandas to extract web table content.
Here are the code:
import pandas as pd
import requests
ids = [1722044,1722045,1722046,1722047]
str = 'http://vip.win007.com/AsianOdds_n.aspx?id={}'
for id in ids:
url = str.format(id)
asianodds = pd.read_html(url, header = 0)
asianodds[0]
df_NaN = asianodds[0]
#Data cleaning
asianodds = df_NaN.drop(df_NaN[df_NaN.多盘口 == '多盘口'].index)
#Delete Columns
asianodds.drop('多盘口', inplace=True, axis=1)
asianodds.drop('历史资料', inplace=True, axis=1)
# 澳门 data
macau_asianodds = asianodds.iloc[0:1]
asianodds.iloc[0:1].to_excel("c:/logs/test.xls")
print(macau_asianodds)
Expected output is 4 records. But, the Excel file had just one record.