Bad universal priors and notions of optimality

Jan Leike, Marcus Hutter

    Research output: Contribution to journalConference articlepeer-review

    8 Citations (Scopus)

    Abstract

    A big open question of algorithmic information theory is the choice of the universal Turing machine (UTM). For Kolmogorov complexity and Solomonoff induction we have invariance theorems: the choice of the UTM changes bounds only by a constant. For the universally intelligent agent AIXI (Hutter, 2005) no invariance theorem is known. Our results are entirely negative: we discuss cases in which unlucky or adversarial choices of the UTM cause AIXI to misbehave drastically. We show that Legg-Hutter intelligence and thus balanced Pareto optimality is entirely subjective, and that every policy is Pareto optimal in the class of all computable environments. This undermines all existing optimality properties for AIXI. While it may still serve as a gold standard for AI, our results imply that AIXI is a relative theory, dependent on the choice of the UTM.

    Original languageEnglish
    JournalJournal of Machine Learning Research
    Volume40
    Issue number2015
    Publication statusPublished - 2015
    Event28th Conference on Learning Theory, COLT 2015 - Paris, France
    Duration: 2 Jul 20156 Jul 2015

    Fingerprint

    Dive into the research topics of 'Bad universal priors and notions of optimality'. Together they form a unique fingerprint.

    Cite this