Optimistic AIXI

Peter Sunehag*, Marcus Hutter

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    6 Citations (Scopus)

    Abstract

    We consider extending the AIXI agent by using multiple (or even a compact class of) priors. This has the benefit of weakening the conditions on the true environment that we need to prove asymptotic optimality. Furthermore, it decreases the arbitrariness of picking the prior or reference machine. We connect this to removing symmetry between accepting and rejecting bets in the rationality axiomatization of AIXI and replacing it with optimism. Optimism is often used to encourage exploration in the more restrictive Markov Decision Process setting and it alleviates the problem that AIXI (with geometric discounting) stops exploring prematurely.

    Original languageEnglish
    Title of host publicationArtificial General Intelligence - 5th International Conference, AGI 2012, Proceedings
    Pages312-321
    Number of pages10
    DOIs
    Publication statusPublished - 2012
    Event5th International Conference on Artificial General Intelligence, AGI 2012 - Oxford, United Kingdom
    Duration: 8 Dec 201211 Dec 2012

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume7716 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference5th International Conference on Artificial General Intelligence, AGI 2012
    Country/TerritoryUnited Kingdom
    CityOxford
    Period8/12/1211/12/12

    Fingerprint

    Dive into the research topics of 'Optimistic AIXI'. Together they form a unique fingerprint.

    Cite this