I have a simple string in utf-8 encoding. I am performing stemming using nltk stemmer. But after stemming, it converts the string to unicode. How can I convert it back to utf-8 encoding? Following is the code.
from nltk.stem import SnowballStemmer
stemmer = SnowballStemmer('english')
string = "something i am writing"
string_before_Stem = string.split()
print string_before_Stem
['something', 'i', 'am', 'writing']
string = stemmer.stem(string)
string = string.split()
print string
[u'something', u'i', u'am', u'writ']