A Uniform Convergence Bound for the Area Under the ROC Curve
نویسندگان
چکیده
The area under the ROC curve (AUC) has been advocated as an evaluation criterion for the bipartite ranking problem. We study uniform convergence properties of the AUC; in particular, we derive a distribution-free uniform convergence bound for the AUC which serves to bound the expected accuracy of a learned ranking function in terms of its empirical AUC on the training sequence from which it is learned. Our bound is expressed in terms of a new set of combinatorial parameters that we term the bipartite rank-shatter coefficients; these play the same role in our result as do the standard VC-dimension related shatter coefficients (also known as the growth function) in uniform convergence results for the classification error rate. A comparison of our result with a recent uniform convergence result derived by Freund et al. [9] for a quantity closely related to the AUC shows that the bound provided by our result can be considerably tighter.
منابع مشابه
A Uniform Convergence Bound for the Area Under the ROC Curve 1
The area under the ROC curve (AUC) has beenadvocated as an evaluation criterion for the bi-partite ranking problem. We study uniform con-vergence properties of the AUC; in particular, wederive a distribution-free uniform convergencebound for the AUC which serves to bound theexpected accuracy of a learned ranking functionin terms of its empirical AUC on the traini...
متن کاملGeneralization Bounds for the Area Under an ROC Curve
We study generalization properties of the area under an ROC curve (AUC), a quantity that has been advocated as an evaluation criterion for bipartite ranking problems. The AUC is a different and more complex term than the error rate used for evaluation in classification problems; consequently, existing generalization bounds for the classification error rate cannot be used to draw conclusions abo...
متن کاملGeneralization Bounds for the Area Under the ROC Curve
We study generalization properties of the area under the ROC curve (AUC), a quantity that has been advocated as an evaluation criterion for the bipartite ranking problem. The AUC is a different term than the error rate used for evaluation in classification problems; consequently, existing generalization bounds for the classification error rate cannot be used to draw conclusions about the AUC. I...
متن کاملReceiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation
This review provides the basic principle and rational for ROC analysis of rating and continuous diagnostic test results versus a gold standard. Derived indexes of accuracy, in particular area under the curve (AUC) has a meaningful interpretation for disease classification from healthy subjects. The methods of estimate of AUC and its testing in single diagnostic test and also comparative studies...
متن کاملARE for Testing; Convergence Rate of Kernel Density Estimation
The t-test is as defined in the previous lecture. It has slope 1/σ. The Mann-Whitney test rejects if 1 nm ∑ i,j I(Xi ≤ Yj) is large. Note. The Mann-Whitney statistic has a relationship with the area under the ROC curve (AUC) for classification algorithms with a tunable parameter. The ROC plot has one axis for proportion of false positives and one axis for proportion of true positives; as we mov...
متن کامل