I have 2 questions about formatting data for contextual bandit model training.
If I have data such as below...
1:1:0.2 | d1:us d2:female d3:12
Question 1) I read from VW Wiki that each feature is optionally followed by a float. In case where I have categorical features (such as us, female) as values, what is the best way to re-format them? I am thinking that I would just not suffix them with a float let them have a default value of 1. I'm hoping this would achieve one-hot encoding.
Question 2) I've been wrongly training the model by logging the data as below
1:1:0.2 | us female 12
What I now realize is that "us", "female", and "12" are treated as features with default values as 1. Am I correct?