Boosting nearest neighbors for the efficient estimation of posteriors

Roberto D'Ambrosio*, Richard Nock, Wafa Bel Haj Ali, Frank Nielsen, Michel Barlaud

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

It is an admitted fact that mainstream boosting algorithms like AdaBoost do not perform well to estimate class conditional probabilities. In this paper, we analyze, in the light of this problem, a recent algorithm, unn, which leverages nearest neighbors while minimizing a convex loss. Our contribution is threefold. First, we show that there exists a subclass of surrogate losses, elsewhere called balanced, whose minimization brings simple and statistically efficient estimators for Bayes posteriors. Second, we show explicit convergence rates towards these estimators for unn, for any such surrogate loss, under a Weak Learning Assumption which parallels that of classical boosting results. Third and last, we provide experiments and comparisons on synthetic and real datasets, including the challenging SUN computer vision database. Results clearly display that boosting nearest neighbors may provide highly accurate estimators, sometimes more than a hundred times more accurate than those of other contenders like support vector machines.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Proceedings
Pages314-329
Number of pages16
EditionPART 1
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012 - Bristol, United Kingdom
Duration: 24 Sept 201228 Sept 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume7523 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012
Country/TerritoryUnited Kingdom
CityBristol
Period24/09/1228/09/12

Fingerprint

Dive into the research topics of 'Boosting nearest neighbors for the efficient estimation of posteriors'. Together they form a unique fingerprint.

Cite this