Could someone explain why this works:
import pandas as pd
df = pd.DataFrame({
'brand': ['Yum Yum', 'Yum Yum', 'Indomie', 'Indomie', 'Indomie','Indomie','Indomie']
})
df['websites']= pd.Series(['http://imtiazconsultancy.co.uk', 'http://www.fidelitymortgageservices.com','https://willinghandsmi.com/','http://noname.co.za','https://nakaiindianjewelry.wixsite.com/nakaiindianjewerly',
'https://www.tranzact.net/?utm_source=google&utm_medium=organic&utm_campaign=gmb-local-listings&utm_content=charlotte-university', 'http://noname.co.ja'])
df['websites'] = df['websites'].str.extract(r"http(.*).com")
# df['websites'] = df['websites'].str.extract(r"http(.*).com|http(.*).uk|http(.*).za|http(.*).ja|http(.*).net|http(.*).site|http(.*).jp|http(.*).gov|http(.*).org|http(.*).edu")
print(df)
but this doesn't?:
import pandas as pd
df = pd.DataFrame({
'brand': ['Yum Yum', 'Yum Yum', 'Indomie', 'Indomie', 'Indomie','Indomie','Indomie']
})
df['websites']= pd.Series(['http://imtiazconsultancy.co.uk', 'http://www.fidelitymortgageservices.com','https://willinghandsmi.com/','http://noname.co.za','https://nakaiindianjewelry.wixsite.com/nakaiindianjewerly',
'https://www.tranzact.net/?utm_source=google&utm_medium=organic&utm_campaign=gmb-local-listings&utm_content=charlotte-university', 'http://noname.co.ja'])
df['websites'] = df['websites'].str.extract(r"http(.*).com|http(.*).uk|http(.*).za|http(.*).ja|http(.*).net|http(.*).site|http(.*).jp|http(.*).gov|http(.*).org|http(.*).edu")
print(df)
It's currently returning ValueError: Columns must be same length as key
I know I'm misusing the |
operator but can't figure out what I need to change.
Thanks!