so I have a data set of bike rentals in Washington D.C.
Some of my variables are factors and some are numerics and continuous.
I couldn't find a way to upload the dataset, therefore I hope the next explanation will be enough:
I want to explain the "count"(which is numeric and continuous) of rentals of bike with the climate.
I want to merge the follow variables into one which will be called agg_climate
:
- season(factor) - 1 = Winter, 2 = Summer, 3 = Spring, 4 = Fall
- weather(factor) - 1 = Good, 2 = Normal, 3 = Bad
- temp(continuous) - measured in degrees
- atemp(continuous) - measured in degrees
- windspeed(continuous) - measured in mp/h
- humidity(continuous) - measured in %
datetime season holiday workingday weather temp atemp humidity windspeed count hour
3201 2011-09-15 17:00:00 Summer Regular day Working day Bad 19.68 23.485 82 31.0009 261 17
377 2011-02-02 05:00:00 Winter Regular day Working day Bad 9.02 12.120 93 7.0015 3 5
6103 2012-06-01 21:00:00 Spring Regular day Working day Bad 26.24 29.545 78 16.9979 85 21
daytime
3201 After Noon
377 Early Morning
6103 Evening
a pic of the data table: https://ibb.co/SnphvBt
What is the proper way to do so? Thanks!