3D Attention-Driven Depth Acquisition for Object Identification

被引:31
|
作者
Xu, Kai [1 ,2 ,3 ]
Shi, Yifei [1 ]
Zheng, Lintao [1 ]
Zhang, Junyu
Liu, Min [1 ]
Huang, Hui [3 ,4 ]
Su, Hao [5 ]
Cohen-Or, Daniel [6 ]
Chen, Baoquan [2 ]
机构
[1] Natl Univ Def Technol, HPCL, Changsha, Hunan, Peoples R China
[2] Shandong Univ, Jinan, Shandong, Peoples R China
[3] Shenzhen Univ, Shenzhen, Guangdong, Peoples R China
[4] SIAT, Shenzhen, Peoples R China
[5] Stanford Univ, Stanford, CA 94305 USA
[6] Tel Aviv Univ, IL-69978 Tel Aviv, Israel
来源
ACM TRANSACTIONS ON GRAPHICS | 2016年 / 35卷 / 06期
基金
中国国家自然科学基金;
关键词
3D acquisition; depth camera; next-best-view; object identification; attention-based model; shape classification; IMAGE;
D O I
10.1145/2980179.2980224
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We address the problem of autonomously exploring unknown objects in a scene by consecutive depth acquisitions. The goal is to reconstruct the scene while online identifying the objects from among a large collection of 3D shapes. Fine-grained shape identification demands a meticulous series of observations attending to varying views and parts of the object of interest. Inspired by the recent success of attention-based models for 2D recognition, we develop a 3D Attention Model that selects the best views to scan from, as well as the most informative regions in each view to focus on, to achieve efficient object recognition. The region-level attention leads to focus-driven features which are quite robust against object occlusion. The attention model, trained with the 3D shape collection, encodes the temporal dependencies among consecutive views with deep recurrent networks. This facilitates order-aware view planning accounting for robot movement cost. In achieving instance identification, the shape collection is organized into a hierarchy, associated with pre-trained hierarchical classifiers. The effectiveness of our method is demonstrated on an autonomous robot (PR) that explores a scene and identifies the objects to construct a 3D scene model.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Attention-driven Segmentation of Cluttered 3D Scenes
    Potapova, Ekaterina
    Zillich, Michael
    Vincze, Markus
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3610 - 3613
  • [2] Incremental Attention-driven Object Segmentation
    Potapova, Ekaterina
    Richtsfeld, Andreas
    Zillich, Michael
    Vincze, Markus
    [J]. 2014 14TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2014, : 252 - 258
  • [3] Attention-driven parts-based object detection
    Autio, I
    Lindgren, JT
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 917 - 921
  • [4] Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven Loss
    Jiao, Jianbo
    Cao, Ying
    Song, Yibing
    Lau, Rynson
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 55 - 71
  • [5] Attention-driven conscious bi-stable stereoscopic depth perception
    van Ee, R
    [J]. PERCEPTION, 2003, 32 : 45 - 45
  • [6] BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection
    Li, Yinhao
    Ge, Zheng
    Yu, Guanyi
    Yang, Jinrong
    Wang, Zengran
    Shi, Yukang
    Sun, Jianjian
    Li, Zeming
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1477 - 1485
  • [7] Attention-driven Object Detection and Segmentation of Cluttered Table Scenes using 2.5D Symmetry
    Potapova, Ekaterina
    Varadarajan, Karthik M.
    Richtsfeld, Andreas
    Zillich, Michael
    Vincze, Markus
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 4946 - 4952
  • [8] Can't Take my Eye off You: Attention-Driven Monocular Obstacle Detection and 3D Mapping
    Einhorn, E.
    Schroeter, Ch
    Gross, H. M.
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 816 - 821
  • [9] Spectral Attention-Driven Intelligent Target Signal Identification on a Wideband Spectrum
    Mendis, Gihan J.
    Wei, Jin
    Madanayake, Arjuna
    Mandal, Soumyajit
    [J]. 2019 IEEE COGNITIVE COMMUNICATIONS FOR AEROSPACE APPLICATIONS WORKSHOP (CCAAW), 2019,
  • [10] 3D Depth of Field Acquisition Based on OpenCV
    Yu, Feng
    Shuai, Junfeng
    Yin, Baolin
    Feng, Xiaoming
    [J]. PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 920 - 923