I currently use the following method to clean websites.
http://www.example.com > example.com
https://www.example.com > example.com
http://example.com > example.com
However,
www.example.com > www.example.com
How can I make sure, www.example.com turns into example.com
import re
website = "http://www.example.com"
def clean_website(website):
"""
Transform http://google.com, https://google.com, http://www.google.com and
https:www.//google.com into google.com.
"""
url = re.compile(r"https?://(www\.)?")
return url.sub("", website).strip().strip("/")
clean_website(website)