Abstract
In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are no examples demonstrating the known results regarding generalised discounting. We have added to the GRL simulation platform (AIXIjs) the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on an agent's policy. Using this, we investigate how geometric, hyperbolic and power discounting affect an informed agent in a simple MDP. We experimentally reproduce a number of theoretical results, and discuss some related subtleties. It was found that the agent's behaviour followed what is expected theoretically, assuming appropriate parameters were chosen for the Monte-Carlo Tree Search (MCTS) planning algorithm.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS |
Editors | E Durfee, M Winikoff, K Larson & S Das |
Place of Publication | TBC |
Publisher | IFAAMAS (International Foundation for Autonomous Agents and Multiagent Systems) |
Pages | 1589-1591 |
Edition | To be checked |
ISBN (Print) | 978-151085507-6 |
DOIs | |
Publication status | Published - 2017 |
Event | 16th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2017 - Sao Paulo, Brazil Duration: 1 Jan 2017 → … |
Conference
Conference | 16th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2017 |
---|---|
Period | 1/01/17 → … |
Other | May 8-12 2017 |