TY - GEN
T1 - Optimistic agents are asymptotically optimal
AU - Sunehag, Peter
AU - Hutter, Marcus
PY - 2012
Y1 - 2012
N2 - We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.
AB - We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.
KW - Optimality
KW - Optimism
KW - Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=84871399605&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-35101-3_2
DO - 10.1007/978-3-642-35101-3_2
M3 - Conference contribution
SN - 9783642351006
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 15
EP - 26
BT - AI 2012
T2 - 25th Australasian Joint Conference on Artificial Intelligence, AI 2012
Y2 - 4 December 2012 through 7 December 2012
ER -