Defensive universal learning with experts

Jan Poland*, Marcus Hutter

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Citations (Scopus)

Abstract

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. Prom this, we obtain a master algorithm for "reactive" experts problems, which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs - in a certain sense - almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.

Original languageEnglish
Title of host publicationAlgorithmic Learning Theory - 16th International Conference, ALT 2005, Proceedings
Pages356-370
Number of pages15
DOIs
Publication statusPublished - 2005
Externally publishedYes
Event16th International Conference on Algorithmic Learning Theory, ALT 2005 - Singapore, Singapore
Duration: 8 Oct 200511 Oct 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3734 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Algorithmic Learning Theory, ALT 2005
Country/TerritorySingapore
CitySingapore
Period8/10/0511/10/05

Fingerprint

Dive into the research topics of 'Defensive universal learning with experts'. Together they form a unique fingerprint.

Cite this