1

Following the example from the official documentation, I am at this point in training a DecisionTree classifier:

arma::rowvec weights = arma::randu<arma::mat>(1, labels.size());
size_t numClasses = 2;

KFoldCV<DecisionTree<>, Accuracy> cv2(5, data, labels, numClasses, weights);
size_t minimumLeafSize = 12;
double weightedDecisionTreeAccuracy = cv2.Evaluate(minimumLeafSize);

My question: does DecisionTree work well if the samples from the 2 classes are not evenly distributed? How should I modify my code to take into account that in the training set there is a 75% samples from class 1 which narrowly distribute around a single value, while the remaining 25% are from class 2 and are quite sparse?

Gigioz
  • 367
  • 5
  • 19

0 Answers0