In this note, I study how the precision of a binary classifier depends on the ratio of positive to negative cases in the test set, as well as the classifier's true and false-positive rates. This relationship allows prediction of how the precision-recall curve will change with , which seems not to be well known. It also allows prediction of how and the precision gain and recall gain measures of Flach and Kull (2015) vary with .
© 2021 Massachusetts Institute of Technology
Massachusetts Institute of Technology