Multi-view Fusion for Multi-level Robotic Scene Understanding

被引:11
|
作者
Lin, Yunzhi [1 ,2 ]
Tremblay, Jonathan [1 ]
Tyree, Stephen [1 ]
Vela, Patricio A. [2 ]
Birchfield, Stan [1 ]
机构
[1] NVIDIA, Berkeley, CA 94720 USA
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
D O I
10.1109/IROS51168.2021.9635994
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a system for multi-level scene awareness for robotic manipulation. Given a sequence of camera-in-hand RGB images, the system calculates three types of information: 1) a point cloud representation of all the surfaces in the scene, for the purpose of obstacle avoidance. 2) the rough pose of unknown objects from categories corresponding to primitive shapes (e.g., cuboids and cylinders), and 3) full 6-DoF pose of known objects. By developing and fusing recent techniques in these domains, we provide a rich scene representation for robot awareness. We demonstrate the importance of each of these modules, their complementary nature, and the potential benefits of the system in the context of robotic manipulation.
引用
收藏
页码:6817 / 6824
页数:8
相关论文
共 50 条
  • [1] OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images
    Li, Weijia
    Lai, Yawen
    Xu, Linning
    Xiangli, Yuanbo
    Yu, Jinhua
    He, Conghui
    Xia, Gui-Song
    Lin, Dahua
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17397 - 17407
  • [2] MCoCo: Multi-level Consistency Collaborative multi-view clustering
    Zhou, Yiyang
    Zheng, Qinghai
    Wang, Yifei
    Yan, Wenbiao
    Shi, Pengcheng
    Zhu, Jihua
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Towards the Integration of Multi-Level and Multi-View Modelling for Interoperability
    Fu, Yuhong
    Grossmann, Georg
    Kaur, Karamjit
    Selway, Matt
    Stumptner, Markus
    [J]. 2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 679 - 688
  • [4] Multi-level photo quality assessment with multi-view features
    Dong, Zhe
    Tian, Xinmei
    [J]. NEUROCOMPUTING, 2015, 168 : 308 - 319
  • [5] Multi-level Feature Learning for Contrastive Multi-view Clustering
    Xu, Jie
    Tang, Huayi
    Ren, Yazhou
    Peng, Liang
    Zhu, Xiaofeng
    He, Lifang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16030 - 16039
  • [6] Multi-level correlation learning for multi-view unsupervised feature selection
    Wu, Jian-Sheng
    Gong, Jun-Xiao
    Liu, Jing-Xin
    Min, Weidong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 281
  • [7] Multi-view PointNet for 3D Scene Understanding
    Jaritz, Maximilian
    Gu, Jiayuan
    Su, Hao
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3995 - 4003
  • [8] MULTI-LEVEL SCENE UNDERSTANDING VIA HIERARCHICAL CLASSIFICATION
    Clouse, Hamilton Scott
    Bian, Xiao
    Gentimis, Thanos
    Krim, Hamid
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 966 - 970
  • [9] Multi-view and Multi-level network for fault diagnosis accommodating feature transferability
    Lu, Na
    Cui, Zhiyan
    Hu, Huiyang
    Yin, Tao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [10] Semantic Scene Completion through Multi-Level Feature Fusion
    Fu, Ruochong
    Wu, Hang
    Hao, Mengxiang
    Miao, Yubin
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8399 - 8406