TY - JOUR
T1 - Bipartite Ranking
T2 - A Risk-Theoretic Perspective
AU - Menon, Aditya Krishna
AU - Williamson, Robert C.
N1 - Publisher Copyright:
© 2016 Aditya Krishna Menon and Robert C. Williamson.
PY - 2016/11/1
Y1 - 2016/11/1
N2 - We present a systematic study of the bipartite ranking problem, with the aim of explicating its connections to the class-probability estimation problem. Our study focuses on the properties of the statistical risk for bipartite ranking with general losses, which is closely related to a generalised notion of the area under the ROC curve: we establish alternate representations of this risk, relate the Bayes-optimal risk to a class of probability divergences, and characterise the set of Bayes-optimal scorers for the risk. We further study properties of a generalised class of bipartite risks, based on the p-norm push of Rudin (2009). Our analysis is based on the rich framework of proper losses, which are the central tool in the study of class-probability estimation. We show how this analytic tool makes transparent the generalisations of several existing results, such as the equivalence of the minimisers for four seemingly disparate risks from bipartite ranking and class-probability estimation. A novel practical implication of our analysis is the design of new families of losses for scenarios where accuracy at the head of ranked list is paramount, with comparable empirical performance to the p-norm push.
AB - We present a systematic study of the bipartite ranking problem, with the aim of explicating its connections to the class-probability estimation problem. Our study focuses on the properties of the statistical risk for bipartite ranking with general losses, which is closely related to a generalised notion of the area under the ROC curve: we establish alternate representations of this risk, relate the Bayes-optimal risk to a class of probability divergences, and characterise the set of Bayes-optimal scorers for the risk. We further study properties of a generalised class of bipartite risks, based on the p-norm push of Rudin (2009). Our analysis is based on the rich framework of proper losses, which are the central tool in the study of class-probability estimation. We show how this analytic tool makes transparent the generalisations of several existing results, such as the equivalence of the minimisers for four seemingly disparate risks from bipartite ranking and class-probability estimation. A novel practical implication of our analysis is the design of new families of losses for scenarios where accuracy at the head of ranked list is paramount, with comparable empirical performance to the p-norm push.
KW - Bayes-optimality
KW - Bipartite ranking
KW - Class-probability estimation
KW - Proper losses
KW - Ranking the best
UR - http://www.scopus.com/inward/record.url?scp=85008498724&partnerID=8YFLogxK
M3 - Review article
SN - 1532-4435
VL - 17
SP - 1
EP - 102
JO - Journal of Machine Learning Research
JF - Journal of Machine Learning Research
ER -