TY - GEN
T1 - Defensive universal learning with experts
AU - Poland, Jan
AU - Hutter, Marcus
PY - 2005
Y1 - 2005
N2 - This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. Prom this, we obtain a master algorithm for "reactive" experts problems, which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs - in a certain sense - almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.
AB - This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. Prom this, we obtain a master algorithm for "reactive" experts problems, which means that the master's actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs - in a certain sense - almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.
UR - http://www.scopus.com/inward/record.url?scp=33646515747&partnerID=8YFLogxK
U2 - 10.1007/11564089_28
DO - 10.1007/11564089_28
M3 - Conference contribution
SN - 354029242X
SN - 9783540292425
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 356
EP - 370
BT - Algorithmic Learning Theory - 16th International Conference, ALT 2005, Proceedings
T2 - 16th International Conference on Algorithmic Learning Theory, ALT 2005
Y2 - 8 October 2005 through 11 October 2005
ER -