Death and suicide in universal artificial intelligence

Jarryd Martin*, Tom Everitt, Marcus Hutter

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    8 Citations (Scopus)

    Abstract

    Reinforcement learning (RL) is a general paradigm for studying intelligent behaviour, with applications ranging from artificial intelligence to psychology and economics. AIXI is a universal solution to the RL problem; it can learn any computable environment. A technical subtlety of AIXI is that it is defined using a mixture over semimeasures that need not sum to 1, rather than over proper probability measures. In this work we argue that the shortfall of a semimeasure can naturally be interpreted as the agent’s estimate of the probability of its death. We formally define death for generally intelligent agents like AIXI, and prove a number of related theorems about their behaviour. Notable discoveries include that agent behaviour can change radically under positive linear transformations of the reward signal (from suicidal to dogmatically self-preserving), and that the agent’s posterior belief that it will survive increases over time.

    Original languageEnglish
    Title of host publicationArtificial General Intelligence - 9th International Conference, AGI 2016, Proceedings
    EditorsBas Steunebrink, Pei Wang, Ben Goertzel
    PublisherSpringer Verlag
    Pages23-32
    Number of pages10
    ISBN (Print)9783319416489
    DOIs
    Publication statusPublished - 2016
    Event9th International Conference on Artificial General Intelligence, AGI 2016 - New York, United States
    Duration: 16 Jul 201619 Jul 2016

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume9782
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference9th International Conference on Artificial General Intelligence, AGI 2016
    Country/TerritoryUnited States
    CityNew York
    Period16/07/1619/07/16

    Fingerprint

    Dive into the research topics of 'Death and suicide in universal artificial intelligence'. Together they form a unique fingerprint.

    Cite this