Knowledge Resource Center for Ecological Environment in Arid Area
Lifelong Robotic Object Perception | |
Collet Romea, Alvaro | |
出版年 | 2012 |
学位类型 | 博士 |
导师 | Hebert, Martial ; Srinivasa, Siddhartha |
学位授予单位 | Carnegie Mellon University |
英文摘要 | In this thesis, we study the topic of Lifelong Robotic Object Perception. We propose, as a long-term goal, a framework to recognize known objects and to discover unknown objects in the environment as the robot operates, for as long as the robot operates. We build the foundations for Lifelong Robotic Object Perception by focusing our study on the two critical components of this framework: 1) how to recognize and register known objects for robotic manipulation, and 2) how to automatically discover novel objects in the environment so that we can recognize them in the future. Our work on Object Recognition and Pose Estimation addresses two main challenges in computer vision for robotics: robust performance in complex scenes, and low latency for real-time operation. We present MOPED, a framework for Multiple Object Pose Estimation and Detection that integrates single-image and multi-image object recognition and pose estimation in one optimized, robust, and scalable framework. We extend MOPED to leverage RGBD images using an adaptive image-depth fusion model based on maximum likelihood estimates. We incorporate this model to each stage of MOPED to achieve object recognition robust to imperfect depth data. In Robotic Object Discovery, we address the challenges of scalability and robustness for long-term operation. As a first step towards Lifelong Robotic Object Perception, we aim to automatically process the raw video stream of an entire workday of a robotic agent to discover novel objects. The key to achieve this goal is to incorporate non-visual information—robotic metadata—in the discovery process. We encode the natural constraints and non-visual sensory information in service robotics to make long-term object discovery feasible. We introduce an optimized implementation, HerbDisc, that processes a video stream of 6 h 20 min of challenging human environments in under 19 min and discovers 206 novel objects. We tailor our solutions to the sensing capabilities and requirements in service robotics, with the goal of enabling our service robot, HERB, to operate autonomously in human environments. |
英文关键词 | Computer vision Object recognition Sensor fusion |
语种 | 英语 |
国家 | United States |
来源学科分类 | Robotics; Computer science |
URL | https://pqdtopen.proquest.com/doc/1268027626.html?FMT=AI |
来源机构 | Carnegie Mellon University |
资源类型 | 学位论文 |
条目标识符 | http://119.78.100.177/qdio/handle/2XILL650/246279 |
推荐引用方式 GB/T 7714 | Collet Romea, Alvaro. Lifelong Robotic Object Perception[D]. Carnegie Mellon University,2012. |
条目包含的文件 | 条目无相关文件。 |
个性服务 |
推荐该条目 |
保存到收藏夹 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[Collet Romea, Alvaro]的文章 |
百度学术 |
百度学术中相似的文章 |
[Collet Romea, Alvaro]的文章 |
必应学术 |
必应学术中相似的文章 |
[Collet Romea, Alvaro]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。