I'm using the AWS Machine Learning regression to predict the waiting time in a line of a restaurant, in a specific weekday/time. Today I have around 800k data.
Example Data:
restaurantID (rowID)weekDay (categorical)time (categorical)tablePeople (numeric)waitingTime (numeric - target)
1 sun 21:29 2 23
2 fri 20:13 4 43
...
I have two questions:
1) Should I use time as Categorical or Numeric? It's better to split into two fields: minutes and seconds?
2) I would like in the same model to get the predictions for all my restaurants.
Example: I expected to send the rowID identifier and it returns different predictions, based on each restaurant data (ignoring others data).
I tried, but it's returning the same prediction for any rowID. Why?
Should I have a model for each restaurant?