PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

被引:0
|
作者
Xiang, Yu [1 ,2 ]
Schmidt, Tanner [2 ]
Narayanan, Venkatraman [3 ]
Fox, Dieter [1 ,2 ]
机构
[1] NVIDIA Res, Santa Clara, CA 95051 USA
[2] Univ Washington, Seattle, WA 98195 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
RECOGNITION;
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Estimating the 6D pose of known objects is important for robots to interact with the real world. The problem is challenging due to the variety of objects as well as the complexity of a scene caused by clutter and occlusions between objects. In this work, we introduce PoseCNN, a new Convolutional Neural Network for 6D object pose estimation. PoseCNN estimates the 3D translation of an object by localizing its center in the image and predicting its distance from the camera. The 3D rotation of the object is estimated by regressing to a quaternion representation. We also introduce a novel loss function that enables PoseCNN to handle symmetric objects. In addition, we contribute a large scale video dataset for 6D object pose estimation named the YCB-Video dataset. Our dataset provides accurate 6D poses of 21 objects from the YCB dataset observed in 92 videos with 133,827 frames. We conduct extensive experiments on our YCB-Video dataset and the OccludedLINEMOD dataset to show that PoseCNN is highly robust to occlusions, can handle symmetric objects, and provide accurate pose estimation using only color images as input. When using depth data to further refine the poses, our approach achieves state-of-the-art results on the challenging OccludedLINEMOD dataset. Our code and dataset are available at https://rse-lab.cs.washington.edu/projects/posecnn/.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Accurate 6D Object Pose Estimation and Refinement in Cluttered Scenes
    Jin, Yixiang
    Rossiter, John Anthony
    Veres, Sandor M.
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS), 2021, : 31 - 39
  • [2] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
    Xiao-Long Yang
    Xiao-Hong Jia
    Yuan Liang
    Lu-Bin Fan
    [J]. Journal of Computer Science and Technology, 2022, 37 : 719 - 730
  • [3] 6D Object Pose Estimation in Cluttered Scenes from RGB Images
    Yang, Xiao-Long
    Jia, Xiao-Hong
    Liang, Yuan
    Fan, Lu-Bin
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (03): : 719 - 730
  • [4] Learning latent geometric consistency for 6D object pose estimation in heavily cluttered scenes
    Li, Qingnan
    Hu, Ruimin
    Xiao, Jing
    Wang, Zhongyuan
    Chen, Yu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
  • [5] Graph neural network for 6D object pose estimation
    Yin, Pengshuai
    Ye, Jiayong
    Lin, Guoshen
    Wu, Qingyao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [6] Robust 6D Object Pose Estimation in Cluttered Scenes using Semantic Segmentation and Pose Regression Networks
    Periyasamy, Arul Selvam
    Schwarz, Max
    Behnke, Sven
    [J]. 2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6660 - 6666
  • [7] 6D pose estimation of object based on fused region-level feature in cluttered scenes
    Liu, Xiangpeng
    Duanmu, Huiping
    An, Kang
    Wang, Wancheng
    Song, Yaqing
    Gu, Qingying
    Yuan, Bo
    Wang, Danning
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (07)
  • [8] Deep instance segmentation and 6D object pose estimation in cluttered scenes for robotic autonomous grasping
    Wu, Yongxiang
    Fu, Yili
    Wang, Shuguo
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2020, 47 (04): : 593 - 606
  • [9] 6D Hybrid Pose Estimation in Cluttered Industrial Scenes for Robotic Grasping
    Peng, Yueyan
    Yang, Xuyun
    Wei, Sheng
    Gao, Xiang
    Li, Wei
    Wen, James Zhiging
    [J]. 2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2022, : 19 - 23
  • [10] ConvPoseCNN: Dense Convolutional 6D Object Pose Estimation
    Capellen, Catherine
    Schwarz, Max
    Behnke, Sven
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 162 - 172