i want to do a sentiment analysis on twitter tweet and for that i have test data set in csv file in which tweet column as input but how to create a training data and how to label the data and how to use as feature set.
please correct me if I'm wrong here.. i have to manually create the feature set and label each word?
till now i have done word tokenize and stop word and porter stemming to my test dataset. now from here i want apply my training set and as output i want positive and negative tweet result. please give me some correct direction..