1

Please tell me how to split a node that has a numerical value, like suppose my parent node is temperature and it has numerical values 45.20, 33.10, 11.00, etc. How should I split such kind of numerical values? If I have a categorical column like temperature having a low and high value, I will split it low on the left side and high on the right side. But how should I split the column if it is numeric?

Navin Bondade
  • 43
  • 1
  • 4
  • https://datascience.stackexchange.com/q/24339/40260 – GoodDeeds Feb 12 '20 at 13:19
  • Read this, all the details of decision tree are explained within: https://www.analyticsvidhya.com/blog/2016/04/complete-tutorial-tree-based-modeling-scratch-in-python/ – Mohamed Ali JAMAOUI Feb 12 '20 at 13:19
  • Does this answer your question? [Decision tree using continuous variable](https://stackoverflow.com/questions/40889344/decision-tree-using-continuous-variable) – GoodDeeds Feb 12 '20 at 13:19

1 Answers1

1

There are discretization methods for converting numerical features into categories e.g. for using in Decision Trees. There are many supervised and unsupervised algorithms, from a simple Binning to Information Theoretic approaches like what Fayyad & Irani proposed. Follow this tutorial to learn how to discretize your features. The algorithm by Fayyad and Irani is explained in this course.

Disclaimer: I am the instructor of that course.

Kasra Manshaei
  • 137
  • 1
  • 11