C4.5 is a decision tree algorithm invented by Ross Quinlan in 1993.
Questions tagged [c4.5]
33 questions
5
votes
1 answer
R packages/models that can handle NA's
I'm looking for R packages or machine learning models/algos like randomForest, glmnet, gbdt, etc that can handle NA's, as opposed to ignoring the row or column that has any instances of NA's. I'm not looking to impute. Any suggestions?

screechOwl
- 27,310
- 61
- 158
- 267
4
votes
1 answer
How does the C4.5 Algorithm handle continuous data?
I am implementing the C4.5 algorithm in .net, however I don't have clear idea of how it deals "continuous (numeric) data". Could someone give me a more detailed explanation?

user2210106
- 41
- 1
- 3
3
votes
1 answer
Transform from one decision tree (J48) classification to ensemble in python
I would like to implement the classification of the algorithm based on the paper. I have a single J48 (C4.5) decision tree (code mentioned down). I would like to run it for several (I_max) times over the dataset and calculate the C* = class…

Avi
- 2,247
- 4
- 30
- 52
2
votes
1 answer
is it possible to implement c4.5 algorithm in scikit-learn?
I've read in the documentation that sklearn uses CART algorithm for trees.
Are there specific attributes to change so that it becomes similar to a c4.5 implementation?

Lee Merlas
- 430
- 9
- 24
2
votes
0 answers
Information Gain in R
I found packages being used to calculating "Information Gain" for selecting main attributes in C4.5 Decision Tree and I tried using them to calculating "Information Gain".
But the results of calculation of each packages are different like the code…

Archimpressom
- 41
- 1
- 5
2
votes
1 answer
How to use RWeka package on a dataset?
So I generated a random dataset online and I need to apply the C4.5 algorithm on it.
I installed the RWeka package and all its dependencies but I do not know how to execute it.
Can somebody help me with links to tutorials? Anything apart from the…

Saksham Arora
- 71
- 2
- 9
2
votes
0 answers
Implemantation of Ensemble for C4.5 in R
I'm looking for a "voting” classifier which is obtained by splitting a training set into k fragments, building a C4.5-tree on each fragment separately, and using a majority vote of the trees as the output.
What is the the model I have to choose?…

Avi
- 2,247
- 4
- 30
- 52
1
vote
0 answers
How does pessimistic error pruning in C4.5 algorithm working?
I'm studing C4.5 algorithm and trying to make it in java by myself, but in the part of pruning, I don't understand what it compute in book of C4.5.
In this book,it said:"When N training cases are covered by a leaf, E of them incorrectly....For a…

hua
- 169
- 9
1
vote
0 answers
C4.5:When to stop growing tree?
I have a dataset include 10 continuous attributes and 500 cases. When I growing a decision tree, I got a problem: when to stop growing tree?
This is a tree that I build in java with c4.5, in attribute8, the subdataset is in second picture and sort…

hua
- 169
- 9
1
vote
1 answer
What does Number of leaves and Size of tree mean in Weka?
Could someone explain in plain English what the number of leaves and size of tree mean in Weka? I have made my decision tree, and this is what I got below, but I just need an explanation of what these values mean. Thank you.
Number of leaves:…
user12502105
1
vote
1 answer
How to make a prediction for an instance without creating an ARFF file for that instance in WEKA?
Last semester I had a project where when given a set of data on cars I had to build a model and use that model to make predictions from user inputted data (It involved a GUI and so on). The professor introduced Weka, but only in its GUI form. I'm…

Matthew Evans
- 13
- 4
1
vote
2 answers
Can I prevent the J48 classifier from splitting on the same field more than x times?
Using a dataset, Weka and the J48 classifier I've got the following tree:
And it splits off a lot on 'NumTweets' on the right side. Can I prevent J48 from doing more than a specified amount of splits on one field? Because this is obviously…

user3394131
- 199
- 13
1
vote
0 answers
R caret train() underperforming on J48 compared to manual parameter setting
I need to optimize the accuracy of the C4.5 algorithm on my churn dataset using RWeka's implementation (J48()). Therefore I am using the train() function of the caret package to help me determine the optimal parameter settings (for M and C).
I tried…

m3ph
- 53
- 6
1
vote
1 answer
How C4.5 algorithm handles data with same attributes but different results?
I'm trying to create a decision tree with C4.5 algorithm for a school project. The decision tree is for Haberman's Survival Data Set, attribute information is as follows.
Attribute Information:
1. Age of patient at time of operation (numerical)
2.…

razorx
- 115
- 1
- 5
1
vote
1 answer
Reduced Error Pruning Algorithm
I have a question about this algorithm:
Partition training data in “grow” and “validation” sets.
Build a complete tree from the “grow” data.
Until accuracy on validation set decreases do:
For each non-leaf node, n, in the tree do:
…

Nick
- 2,818
- 5
- 42
- 60