According to H2O docs in FAQ of the DRF section, this note is mentioned on the "How does the algorithm handle missing values during training?" FAQ:
Note: Unlike in GLM, in DRF numerical values are handled the same way as categorical values. Missing values are not imputed with the mean, as is done by default in GLM.
I use a DRF Algorithm to solve a regression problem, but when I saw this note, I felt strange. If I convert all numerical value to categorical value to solve regression problem, I think that it is nonsense.
Here is My question.
- Do I need to convert all numerical values to categorical values to use DRF algorithm?
or
- Do I not need to convert all numerical values to categorical values to use DRF algorithm?
Thank you to read my question.