RGB-D object detection and semantic segmentation for autonomous manipulation in clutter

被引:120
|
作者
Schwarz, Max [1 ]
Milan, Anton [2 ]
Periyasamy, Arul Selvam [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Bonn, Bonn, Germany
[2] Univ Adelaide, Adelaide, SA, Australia
来源
基金
欧盟地平线“2020”;
关键词
Deep learning; object perception; RGB-D camera; transfer learning; object detection; semantic segmentation;
D O I
10.1177/0278364917713117
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Autonomous robotic manipulation in clutter is challenging. A large variety of objects must be perceived in complex scenes, where they are partially occluded and embedded among many distractors, often in restricted spaces. To tackle these challenges, we developed a deep-learning approach that combines object detection and semantic segmentation. The manipulation scenes are captured with RGB-D cameras, for which we developed a depth fusion method. Employing pretrained features makes learning from small annotated robotic datasets possible. We evaluate our approach on two challenging datasets: one captured for the Amazon Picking Challenge 2016, where our team NimbRo came in second in the Stowing and third in the Picking task; and one captured in disaster-response scenarios. The experiments show that object detection and semantic segmentation complement each other and can be combined to yield reliable object perception.
引用
收藏
页码:437 / 451
页数:15
相关论文
共 50 条
  • [1] Multimodal Neural Networks: RGB-D for Semantic Segmentation and Object Detection
    Schneider, Lukas
    Jasch, Manuel
    Froehlich, Bjoern
    Weber, Thomas
    Franke, Uwe
    Pollefeys, Marc
    Raetsch, Matthias
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 98 - 109
  • [2] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [3] Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
    Gupta, Saurabh
    Arbelaez, Pablo
    Girshick, Ross
    Malik, Jitendra
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) : 133 - 149
  • [4] Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
    Saurabh Gupta
    Pablo Arbeláez
    Ross Girshick
    Jitendra Malik
    [J]. International Journal of Computer Vision, 2015, 112 : 133 - 149
  • [5] Semantic Mapping Using Object-Class Segmentation of RGB-D Images
    Stueckler, Joerg
    Biresev, Nenad
    Behnke, Sven
    [J]. 2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 3005 - 3010
  • [6] Object Detection for Soft Robotic Manipulation Based on RGB-D Sensors
    Wu Dongyu
    Hu Fuwen
    Mikolajczyk, Tadeusz
    He Yunhua
    [J]. 2018 WRC SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA), 2018, : 52 - 58
  • [7] Semantic parsing for priming object detection in indoors RGB-D scenes
    Cadena, Cesar
    Kosecka, Jana
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (4-5): : 582 - 597
  • [8] Joining geometric and RGB features for RGB-D semantic segmentation
    Zhang, Shaopeng
    Zhong, Min
    Zeng, Gang
    Gan, Rui
    [J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [9] Object Pose Estimation From RGB-D Images With Affordance-Instance Segmentation Constraint for Semantic Robot Manipulation
    Wang, Zhongli
    Tian, Guohui
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 595 - 602
  • [10] Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection
    Li, Jingjing
    Ji, Wei
    Bi, Qi
    Yan, Cheng
    Zhang, Miao
    Piao, Yongri
    Lu, Huchuan
    Cheng, Li
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34