Basically, I am using some data mining algorithms from python sk-learn
library to do a classification.
However, I got some very un-balanced results, say, around 0.99
recall rate and less than 0.1
precision rate.
Conceptually classification algorithms rely on some "threshold" to make the decision, which means I should be able to balance the precision and recall rate simply through adjusting this "threshold".
However, I cannot find some APIs in sklearn
to help on this, so my question is: How can I manipulate the underlying "threshold" inside sklearn
library to balance the precision and recall rate?