Identifying Reusable Early-Life Options

Aline Weber, Charles P. Martin, Jim Torresen, Bruno C. Da Silva

Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

1 Citation (Scopus)

Abstract

We introduce a method for identifying short-duration reusable motor behaviors, which we call early-life options, that allow robots to perform well even in the very early stages of their lives. This is important when agents need to operate in environments where the use of poor-performing policies (such as the random policies with which they are typically initialized) may be catastrophic. Our method augments the original action set of the agent with specially-constructed behaviors that maximize performance over a possibly infinite family of related motor tasks. These are akin to primitive reflexes in infant mammals - agents born with our early-life options, even if acting randomly, are capable of producing rudimentary behaviors comparable to those acquired by agents that actively optimize a policy for hundreds of thousands of steps. We also introduce three metrics for identifying useful early-life options and show that they result in behaviors that maximize both the option's expected return while minimizing the risk that executing the option will result in extremely poor performance. We evaluate our technique on three simulated robots tasked with learning to walk under different battery consumption constraints and show that even random policies over early-life options are already sufficient to allow for the agent to perform similarly to agents trained for hundreds of thousands of steps.

Original languageEnglish
Title of host publication2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2019
EditorsAmir Aly, Estela Bicho, Sofiane Boucenna, Bruno Castro da Silva, Mohamed Chetouani, Angel P. del Pobil, Julien Diard, Stephane Doncieux, Tilbe Goksun, Angela Grimminger, Frank Guerin, Yoshinobu Hagiwara, Lorenzo Jamone, Sinan Kalkan, Bruno Lara, Clement Moulin-Frier, Shingo Murata, Takayuki Nagai, Yukie Nagai, Iris Nomikou, Masaki Ogino, Pierre-Yves Oudeyer, Alfredo F. Pereira, Alexandre Pitti, Joanna Raczaszek-Leonardi, Sebastian Risi, Benjamin Rosman, Yulia Sandamirskaya, Malte Schilling, Alessandra Sciutti, Patricia Shaw, Andrea Soltoggio, Michael Spranger, Tadahiro Taniguchi, Serge Thill, Jochen Triesch, Emre Ugur, Anna-Lisa Vollmer
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages335-340
Number of pages6
ISBN (Electronic)978-1-5386-8128-2
ISBN (Print)978-1-5386-8129-9
DOIs
Publication statusPublished - 30 Sept 2019
Externally publishedYes
Event9th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2019 - Oslo, Norway
Duration: 19 Aug 201922 Aug 2019

Publication series

NameIEEE International Conference on Development and Learning, ICDL
PublisherIEEE
Number2019
ISSN (Print)2161-9484
ISSN (Electronic)2161-9484

Conference

Conference9th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2019
Country/TerritoryNorway
CityOslo
Period19/08/1922/08/19

Fingerprint

Dive into the research topics of 'Identifying Reusable Early-Life Options'. Together they form a unique fingerprint.

Cite this