PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

被引：0

作者：

Xiang, Yu ^{[1
,2
]}

Schmidt, Tanner ^{[2
]}

Narayanan, Venkatraman ^{[3
]}

Fox, Dieter ^{[1
,2
]}

机构：

[1] NVIDIA Res, Santa Clara, CA 95051 USA

[2] Univ Washington, Seattle, WA 98195 USA

[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

ROBOTICS: SCIENCE AND SYSTEMS XIV | 2018年

关键词：

RECOGNITION;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Estimating the 6D pose of known objects is important for robots to interact with the real world. The problem is challenging due to the variety of objects as well as the complexity of a scene caused by clutter and occlusions between objects. In this work, we introduce PoseCNN, a new Convolutional Neural Network for 6D object pose estimation. PoseCNN estimates the 3D translation of an object by localizing its center in the image and predicting its distance from the camera. The 3D rotation of the object is estimated by regressing to a quaternion representation. We also introduce a novel loss function that enables PoseCNN to handle symmetric objects. In addition, we contribute a large scale video dataset for 6D object pose estimation named the YCB-Video dataset. Our dataset provides accurate 6D poses of 21 objects from the YCB dataset observed in 92 videos with 133,827 frames. We conduct extensive experiments on our YCB-Video dataset and the OccludedLINEMOD dataset to show that PoseCNN is highly robust to occlusions, can handle symmetric objects, and provide accurate pose estimation using only color images as input. When using depth data to further refine the poses, our approach achieves state-of-the-art results on the challenging OccludedLINEMOD dataset. Our code and dataset are available at https://rse-lab.cs.washington.edu/projects/posecnn/.

引用

页数：10

共 50 条

[1] Accurate 6D Object Pose Estimation and Refinement in Cluttered Scenes
Jin, Yixiang
Rossiter, John Anthony
Veres, Sandor M.
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS), 2021, : 31 - 39
[2] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
Xiao-Long Yang
Xiao-Hong Jia
Yuan Liang
Lu-Bin Fan
[J]. Journal of Computer Science and Technology, 2022, 37 : 719 - 730
[3] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
Yang, Xiao-Long
Jia, Xiao-Hong
Liang, Yuan
Fan, Lu-Bin
[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (03): : 719 - 730
[4] Learning latent geometric consistency for 6D object pose estimation in heavily cluttered scenes
Li, Qingnan
Hu, Ruimin
Xiao, Jing
Wang, Zhongyuan
Chen, Yu
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
[5] Graph neural network for 6D object pose estimation
Yin, Pengshuai
Ye, Jiayong
Lin, Guoshen
Wu, Qingyao
[J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
[6] Robust 6D Object Pose Estimation in Cluttered Scenes using Semantic Segmentation and Pose Regression Networks
Periyasamy, Arul Selvam
Schwarz, Max
Behnke, Sven
[J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6660 - 6666
[7] 6D pose estimation of object based on fused region-level feature in cluttered scenes
Liu, Xiangpeng
Duanmu, Huiping
An, Kang
Wang, Wancheng
Song, Yaqing
Gu, Qingying
Yuan, Bo
Wang, Danning
[J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (07)
[8] Deep instance segmentation and 6D object pose estimation in cluttered scenes for robotic autonomous grasping
Wu, Yongxiang
Fu, Yili
Wang, Shuguo
[J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (04): : 593 - 606
[9] 6D Hybrid Pose Estimation in Cluttered Industrial Scenes for Robotic Grasping
Peng, Yueyan
Yang, Xuyun
Wei, Sheng
Gao, Xiang
Li, Wei
Wen, James Zhiging
[J]. 2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2022, : 19 - 23
[10] ConvPoseCNN: Dense Convolutional 6D Object Pose Estimation
Capellen, Catherine
Schwarz, Max
Behnke, Sven
[J]. PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 162 - 172

← 1 2 3 4 5 →