Optimistic agents are asymptotically optimal

Peter Sunehag*, Marcus Hutter

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    9 Citations (Scopus)

    Abstract

    We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.

    Original languageEnglish
    Title of host publicationAI 2012
    Subtitle of host publicationAdvances in Artificial Intelligence - 25th Australasian Joint Conference, Proceedings
    Pages15-26
    Number of pages12
    DOIs
    Publication statusPublished - 2012
    Event25th Australasian Joint Conference on Artificial Intelligence, AI 2012 - Sydney, NSW, Australia
    Duration: 4 Dec 20127 Dec 2012

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume7691 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference25th Australasian Joint Conference on Artificial Intelligence, AI 2012
    Country/TerritoryAustralia
    CitySydney, NSW
    Period4/12/127/12/12

    Fingerprint

    Dive into the research topics of 'Optimistic agents are asymptotically optimal'. Together they form a unique fingerprint.

    Cite this