Abstract
The problem of object detection and recognition is a notoriously difficult one, and one that has been the focus of much work in the computer vision and robotics communities. Most work has concentrated on systems that operate purely on visual inputs (i.e. , images) and largely ignores other sensor modalities. However, despite the great progress made down this track, the goal of high accuracy object detection for robotic platforms in cluttered real-world environments remains elusive. Instead of relying on information from the image alone, we present a method that exploits the multiple sensor modalities available on a robotic platform. In particular, our method augments a 2-d object detector with 3-d information from a depth sensor to produce a “multi-modal object detector.” We demonstrate our method on a working robotic system and evaluate its performance on a number of common household/office objects.
Original language | English |
---|---|
Title of host publication | Proceedings of European Conference on Computer Vision (ECCV 2008) |
Editors | Forsyth, David; Torr, Philip; Zisserman, Andrew (Eds.) |
Place of Publication | France |
Publisher | Springer |
Pages | 1-12 |
Edition | Peer Reviewed |
ISBN (Print) | 9783540886921 |
Publication status | Published - 2008 |
Event | 10th European Conference on Computer Vision (ECCV 2008) - Marseille France Duration: 1 Jan 2008 → … http://www.springer.com/computer/image+processing/book/978-3-540-88692-1 |
Conference
Conference | 10th European Conference on Computer Vision (ECCV 2008) |
---|---|
Period | 1/01/08 → … |
Other | October 12-18 2008 |
Internet address |