Iam doing the Bulldozer price calculation problem, using RandomForestRegressor.After removing all the missing values and converting all data into numeric, I try to fit and train the data into a model. The data set is pretty large about 412698 rows × 57 columns and using a 3gb Ram device.
here is my code
%%time
# Instantiate model
model = RandomForestRegressor(n_jobs=-1,
random_state=42)
# Fit the model
model.fit(df_tmp.drop("SalePrice", axis=1), df_tmp["SalePrice"])
The data set is available in Kaggle and I am also attaching its link.. https://www.kaggle.com/c/bluebook-for-bulldozers/data