Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective

Tom Everitt*, Marcus Hutter, Ramana Kumar, Victoria Krakovna

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    39 Citations (Scopus)

    Fingerprint

    Dive into the research topics of 'Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective'. Together they form a unique fingerprint.

    Computer Science

    Medicine and Dentistry