A game-theoretic analysis of the off-switch game

Tobias Wängberg, Mikael Böörs, Elliot Catt, Tom Everitt*, Marcus Hutter

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    4 Citations (Scopus)

    Abstract

    The off-switch game is a game theoretic model of a highly intelligent robot interacting with a human. In the original paper by Hadfield-Menell et al. (2016b), the analysis is not fully game-theoretic as the human is modelled as an irrational player, and the robot’s best action is only calculated under unrealistic normality and soft-max assumptions. In this paper, we make the analysis fully game theoretic, by modelling the human as a rational player with a random utility function. As a consequence, we are able to easily calculate the robot’s best action for arbitrary belief and irrationality assumptions.

    Original languageEnglish
    Title of host publicationArtificial General Intelligence - 10th International Conference, AGI 2017, Proceedings
    EditorsTom Everitt, Ben Goertzel, Alexey Potapov
    PublisherSpringer Verlag
    Pages167-177
    Number of pages11
    ISBN (Print)9783319637020
    DOIs
    Publication statusPublished - 2017
    Event10th International Conference on Artificial General Intelligence, AGI 2017 - Melbourne, Australia
    Duration: 15 Aug 201718 Aug 2017

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume10414 LNAI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference10th International Conference on Artificial General Intelligence, AGI 2017
    Country/TerritoryAustralia
    CityMelbourne
    Period15/08/1718/08/17

    Fingerprint

    Dive into the research topics of 'A game-theoretic analysis of the off-switch game'. Together they form a unique fingerprint.

    Cite this