I am not sure what is the most effective way to deal with categorical variables for a regression problem.
My table looks like :
Date Category Sales
1/1/2018 Shoes 200
1/2/2018 Shoes 300
1/1/2018 home 100
The problem I am dealing with is sales forecasting.
What is the best way to deal with the Category column? Get dummies or label encoder?
I used label encoder, followed by the standard scaler but I got very poor fitting.
After that I scaled all my inputs (date, category
) with the exception of the y
variable (Sales).