While Doing the count vectorization in Hindi, features names are getting automatically stemmed.
from sklearn.feature_extraction.text import CountVectorizer
test = []
test.append("हमें फिल्म बहुत अच्छी लगी ।")
test.append("फिल्म में कुछ बेहतरीन गाने हैं ।")
cv = CountVectorizer().fit(test)
print(cv.get_feature_names())
output: ['अच', 'बह', 'लग', 'हतर', 'हम']