I am trying to find the time complexity of a binary decision tree algorithm. I have understood that at each node, the complexity is bounded by the complexity of searching the best attribute O(m nlog n)
knowing that m
is the number of features and n
is the number of exemples in the training set. I think we should multiply O(m nlog n)
by the number of nodes to find the complexity of the whole algorithm, is it? I don't understand why in some resources, the complexity of the decision tree is considered only O(m nlog n)
!
Can anyone explain this? Is there any difference in the computation of the complexity, depending on whether categorical attributes or continuous are used?