I have read several threads on here in regards to h2o.predict() and h2o.performance() differences (as seen from link below).
How to interpret the probabilities (p0, p1) of the result of h2o.predict()
Can someone tell me which threshold does h2o.predict() use? Is it max f1
? If so, is it the threshold from training data, validation data, or cross validation?
I tried to use the validation threshold using max f1
and max f0point5
on the testing set (completely separate from training and validation data) but the predicted class from h2o.predict() and the class from using the threshold doesn't match completely.
The closest one I got is to use max f0point5
threshold from training and apply it to testing set.
There is not much documentation on h2o.predict. Also, is there a best practice for threshold, i.e. mean threshold of validation and training, etc?
Thanks in advance!