A computational framework for attentional object discovery in RGB-D videos

被引:0
|
作者
Garcia, German Martin [1 ]
Pavel, Mircea [1 ]
Frintrop, Simone [2 ]
机构
[1] Univ Bonn, Inst Comp Sci 6, Bonn, Germany
[2] Univ Hamburg, Dept Informat, Comp Vis Grp, Hamburg, Germany
关键词
RGB-D object discovery; Computational visual attention; 3D inhibition of return; INHIBITION; RETURN; MODEL;
D O I
10.1007/s10339-017-0791-z
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
We present a computational framework for attention-guided visual scene exploration in sequences of RGB-D data. For this, we propose a visual object candidate generation method to produce object hypotheses about the objects in the scene. An attention system is used to prioritise the processing of visual information by (1) localising candidate objects, and (2) integrating an inhibition of return (IOR) mechanism grounded in spatial coordinates. This spatial IOR mechanism naturally copes with camera motions and inhibits objects that have already been the target of attention. Our approach provides object candidates which can be processed by higher cognitive modules such as object recognition. Since objects are basic elements for many higher level tasks, our architecture can be used as a first layer in any cognitive system that aims at interpreting a stream of images. We show in the evaluation how our framework finds most of the objects in challenging real-world scenes.
引用
收藏
页码:169 / 182
页数:14
相关论文
共 50 条
  • [1] A computational framework for attentional object discovery in RGB-D videos
    Germán Martín García
    Mircea Pavel
    Simone Frintrop
    [J]. Cognitive Processing, 2017, 18 : 169 - 182
  • [2] Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data
    Garcia, German Martin
    Werner, Thomas
    Frintrop, Simone
    [J]. KUNSTLICHE INTELLIGENZ, 2015, 29 (01): : 75 - 81
  • [3] Joint Object Affordance Reasoning and Segmentation in RGB-D Videos
    Thermos, Spyridon
    Potamianos, Gerasimos
    Daras, Petros
    [J]. IEEE ACCESS, 2021, 9 : 89699 - 89713
  • [4] Object Discovery on RGB-D Data via Salient Object Proposals
    Li, Wanyi
    Wang, Peng
    Qiao, Hong
    Fan, Naiji
    Zhou, Hai
    Jing, Feng
    [J]. 2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 737 - 739
  • [5] Bidirectional Attentional Interaction Networks for RGB-D salient object detection
    Wei, Weiyi
    Xu, Mengyu
    Wang, Jian
    Luo, Xuzhe
    [J]. IMAGE AND VISION COMPUTING, 2023, 138
  • [6] Learning human activities and object affordances from RGB-D videos
    Koppula, Hema Swetha
    Gupta, Rudhir
    Saxena, Ashutosh
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (08): : 951 - 970
  • [7] Cross-Modal Attentional Context Learning for RGB-D Object Detection
    Li, Guanbin
    Gan, Yukang
    Wu, Hejun
    Xiao, Nong
    Lin, Liang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1591 - 1601
  • [8] RGB-D Object Discovery via Multi-Scene Analysis
    Herbst, Evan
    Ren, Xiaofeng
    Fox, Dieter
    [J]. 2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
  • [9] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414
  • [10] Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos
    Lu, Shiyang
    Deng, Yunfu
    Boularias, Abdeslam
    Bekris, Kostas
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7017 - 7023