-1

For example, here is an address: https://pesdb.net/pes2021/?id=44379
There seems to be no api call (I am pretty new to this but I checked XHR in network monitor and there are no relevant json calls).

1 Answers1

0

There's an example here of how to parse an html table, with just the Pandas/requests library.

According to the latest docs, you can skip the requests call in that answer, but you will need to install dependencies:

pip install lxml html5lib beautifulsoup4

then you can do something like this:

df_list = pd.read_html('https://pesdb.net/pes2021/?id=44379')    # this parses all the tables in webpages to a list
df = df_list[0]                   # the first table on the page
print(df)                         # this is your dataframe!

Generally, Beautiful Soup 4 is the most popular Python library for webscraping.

You can read some examples here

Alternatively you could perform a GET request to the site and manually parse the response. (Most difficult / pointless)

nihilok
  • 1,325
  • 8
  • 12