I want to perform one hot encoding to one column in my data. The column may looks like this:
app
0 a
1 b
2 c
3 a
I've performed:
pd.get_dummies(df, columns=['app'])
app_a app_b app_c
0 1 0 0
1 0 1 0
2 0 0 1
3 1 0 0
But in reality, the app column can contain 'd' value, in my data to train I don't have it. So what I want is to add app_d
after perform get_dummies
without 'd' value in my data.
Is there any code can one hot encoding form my simple data above to predefined columns? What I want looks like this:
app_a app_b app_c app_d
0 1 0 0 0
1 0 1 0 0
2 0 0 1 0
3 1 0 0 0