A computational framework for attentional object discovery in RGB-D videos

被引:0
|
作者
Germán Martín García
Mircea Pavel
Simone Frintrop
机构
[1] University of Bonn,Institute of Computer Science VI
[2] University of Hamburg,Computer Vision Group, Department of Informatics
来源
Cognitive Processing | 2017年 / 18卷
关键词
RGB-D object discovery; Computational visual attention; 3D inhibition of return;
D O I
暂无
中图分类号
学科分类号
摘要
We present a computational framework for attention-guided visual scene exploration in sequences of RGB-D data. For this, we propose a visual object candidate generation method to produce object hypotheses about the objects in the scene. An attention system is used to prioritise the processing of visual information by (1) localising candidate objects, and (2) integrating an inhibition of return (IOR) mechanism grounded in spatial coordinates. This spatial IOR mechanism naturally copes with camera motions and inhibits objects that have already been the target of attention. Our approach provides object candidates which can be processed by higher cognitive modules such as object recognition. Since objects are basic elements for many higher level tasks, our architecture can be used as a first layer in any cognitive system that aims at interpreting a stream of images. We show in the evaluation how our framework finds most of the objects in challenging real-world scenes.
引用
收藏
页码:169 / 182
页数:13
相关论文
共 50 条
  • [1] A computational framework for attentional object discovery in RGB-D videos
    Garcia, German Martin
    Pavel, Mircea
    Frintrop, Simone
    COGNITIVE PROCESSING, 2017, 18 (02) : 169 - 182
  • [2] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE Transactions on Image Processing, 2024, 33 : 6660 - 6675
  • [3] Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data
    Garcia, German Martin
    Werner, Thomas
    Frintrop, Simone
    KUNSTLICHE INTELLIGENZ, 2015, 29 (01): : 75 - 81
  • [4] Joint Object Affordance Reasoning and Segmentation in RGB-D Videos
    Thermos, Spyridon
    Potamianos, Gerasimos
    Daras, Petros
    IEEE ACCESS, 2021, 9 : 89699 - 89713
  • [5] Object Discovery on RGB-D Data via Salient Object Proposals
    Li, Wanyi
    Wang, Peng
    Qiao, Hong
    Fan, Naiji
    Zhou, Hai
    Jing, Feng
    2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 737 - 739
  • [6] Bidirectional Attentional Interaction Networks for RGB-D salient object detection
    Wei, Weiyi
    Xu, Mengyu
    Wang, Jian
    Luo, Xuzhe
    IMAGE AND VISION COMPUTING, 2023, 138
  • [7] Learning human activities and object affordances from RGB-D videos
    Koppula, Hema Swetha
    Gupta, Rudhir
    Saxena, Ashutosh
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (08): : 951 - 970
  • [8] Cross-Modal Attentional Context Learning for RGB-D Object Detection
    Li, Guanbin
    Gan, Yukang
    Wu, Hejun
    Xiao, Nong
    Lin, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1591 - 1601
  • [9] RGB-D Object Discovery via Multi-Scene Analysis
    Herbst, Evan
    Ren, Xiaofeng
    Fox, Dieter
    2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
  • [10] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414