TY - GEN
T1 - Universal knowledge-seeking agents for stochastic environments
AU - Orseau, Laurent
AU - Lattimore, Tor
AU - Hutter, Marcus
PY - 2013
Y1 - 2013
N2 - We define an optimal Bayesian knowledge-seeking agent, KL-KSA, designed for countable hypothesis classes of stochastic environments and whose goal is to gather as much information about the unknown world as possible. Although this agent works for arbitrary countable classes and priors, we focus on the especially interesting case where all stochastic computable environments are considered and the prior is based on Solomonoff's universal prior. Among other properties, we show that KL-KSA learns the true environment in the sense that it learns to predict the consequences of actions it does not take. We show that it does not consider noise to be information and avoids taking actions leading to inescapable traps. We also present a variety of toy experiments demonstrating that KL-KSA behaves according to expectation.
AB - We define an optimal Bayesian knowledge-seeking agent, KL-KSA, designed for countable hypothesis classes of stochastic environments and whose goal is to gather as much information about the unknown world as possible. Although this agent works for arbitrary countable classes and priors, we focus on the especially interesting case where all stochastic computable environments are considered and the prior is based on Solomonoff's universal prior. Among other properties, we show that KL-KSA learns the true environment in the sense that it learns to predict the consequences of actions it does not take. We show that it does not consider noise to be information and avoids taking actions leading to inescapable traps. We also present a variety of toy experiments demonstrating that KL-KSA behaves according to expectation.
KW - Solomonoff induction
KW - Universal artificial intelligence
KW - algorithmic information theory
KW - exploration
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=84887449337&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-40935-6_12
DO - 10.1007/978-3-642-40935-6_12
M3 - Conference contribution
SN - 9783642409349
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 158
EP - 172
BT - Algorithmic Learning Theory - 24th International Conference, ALT 2013, Proceedings
T2 - 24th International Conference on Algorithmic Learning Theory, ALT 2013
Y2 - 6 October 2013 through 9 October 2013
ER -