Abstract
We present our approach for robotic perception in cluttered scenes that led to winning the recent Amazon Robotics Challenge (ARC) 2017. Next to small objects with shiny and transparent surfaces, the biggest challenge of the 2017 competition was the introduction of unseen categories. In contrast to traditional approaches which require large collections of annotated data and many hours of training, the task here was to obtain a robust perception pipeline with only few minutes of data acquisition and training time. To that end, we present two strategies that we explored. One is a deep metric learning approach that works in three separate steps: semantic-agnostic boundary detection, patch classification and pixel-wise voting. The other is a fully-supervised semantic segmentation approach with efficient dataset collection. We conduct an extensive analysis of the two methods on our ARC 2017 dataset. Interestingly, only few examples of each class are sufficient to fine-tune even very deep convolutional neural networks for this specific task.
Original language | English |
---|---|
Title of host publication | 2018 IEEE International Conference on Robotics and Automation, ICRA 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1908-1915 |
Number of pages | 8 |
ISBN (Electronic) | 9781538630815 |
DOIs | |
Publication status | Published - 10 Sept 2018 |
Event | 2018 IEEE International Conference on Robotics and Automation, ICRA 2018 - Brisbane, Australia Duration: 21 May 2018 → 25 May 2018 |
Publication series
Name | Proceedings - IEEE International Conference on Robotics and Automation |
---|---|
ISSN (Print) | 1050-4729 |
Conference
Conference | 2018 IEEE International Conference on Robotics and Automation, ICRA 2018 |
---|---|
Country/Territory | Australia |
City | Brisbane |
Period | 21/05/18 → 25/05/18 |