I read a lot of articles on this, but still I could not decide and find useful for all to have an answer, especially on Pandas behaviour in Python 2.7.
Is it better to 'recast' in str or in Unicode in Pandas, Python 2.7? (option 1 or option 2)
Option 1:
df = pd.DataFrame({'b':['ホテ','・旅館', 'ホテル']})
df= df.astype({ 'b': 'unicode'})
Option 2:
df = pd.DataFrame({'b':['ホテ','・旅館', 'ホテル']})
df= df.astype({ 'b': 'str'})
Based on references, everything should be put in Unicode before any processing?
References: Python str vs unicode types