TY - JOUR
T1 - Bad universal priors and notions of optimality
AU - Leike, Jan
AU - Hutter, Marcus
N1 - Publisher Copyright:
© 2015 J. Leike & M. Hutter.
PY - 2015
Y1 - 2015
N2 - A big open question of algorithmic information theory is the choice of the universal Turing machine (UTM). For Kolmogorov complexity and Solomonoff induction we have invariance theorems: the choice of the UTM changes bounds only by a constant. For the universally intelligent agent AIXI (Hutter, 2005) no invariance theorem is known. Our results are entirely negative: we discuss cases in which unlucky or adversarial choices of the UTM cause AIXI to misbehave drastically. We show that Legg-Hutter intelligence and thus balanced Pareto optimality is entirely subjective, and that every policy is Pareto optimal in the class of all computable environments. This undermines all existing optimality properties for AIXI. While it may still serve as a gold standard for AI, our results imply that AIXI is a relative theory, dependent on the choice of the UTM.
AB - A big open question of algorithmic information theory is the choice of the universal Turing machine (UTM). For Kolmogorov complexity and Solomonoff induction we have invariance theorems: the choice of the UTM changes bounds only by a constant. For the universally intelligent agent AIXI (Hutter, 2005) no invariance theorem is known. Our results are entirely negative: we discuss cases in which unlucky or adversarial choices of the UTM cause AIXI to misbehave drastically. We show that Legg-Hutter intelligence and thus balanced Pareto optimality is entirely subjective, and that every policy is Pareto optimal in the class of all computable environments. This undermines all existing optimality properties for AIXI. While it may still serve as a gold standard for AI, our results imply that AIXI is a relative theory, dependent on the choice of the UTM.
KW - AIXI
KW - Asymptotic Optimality
KW - Balanced Pareto Optimality
KW - General Reinforcement Learning
KW - Legg-Hutter Intelligence
KW - Universal Turing Machine
UR - http://www.scopus.com/inward/record.url?scp=84984698408&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84984698408
SN - 1532-4435
VL - 40
JO - Journal of Machine Learning Research
JF - Journal of Machine Learning Research
IS - 2015
T2 - 28th Conference on Learning Theory, COLT 2015
Y2 - 2 July 2015 through 6 July 2015
ER -