I'm working on analysis of soccer rating systems recently and got a data source from scoreboard.com.
After I parsed some sample data, realized that the data was not readable. Seems like it's in broken English texts.
Would you refer to following python code and sample result? Looking forward to see your help.
Thanks.
import requests
import lxml.html
import cssselect
from bs4 import BeautifulSoup
url = requests.get('https://www.scoreboard.com/soccer/england/premier-league-2016-2017/results/')
urlshow = url.text
print(urlshow)
-- sample of the result --
Premier League¬ZEE÷dYlOSQOD¬ZB÷198¬ZY÷England¬ZC÷fZHsKRg9¬ZD÷t¬ZE÷8Ai8InSt¬