A computational framework for attentional object discovery in RGB-D videos

被引：0

作者：

Germán Martín García

Mircea Pavel

Simone Frintrop

机构：

[1] University of Bonn,Institute of Computer Science VI

[2] University of Hamburg,Computer Vision Group, Department of Informatics

来源：

Cognitive Processing | 2017年 / 18卷

关键词：

RGB-D object discovery; Computational visual attention; 3D inhibition of return;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We present a computational framework for attention-guided visual scene exploration in sequences of RGB-D data. For this, we propose a visual object candidate generation method to produce object hypotheses about the objects in the scene. An attention system is used to prioritise the processing of visual information by (1) localising candidate objects, and (2) integrating an inhibition of return (IOR) mechanism grounded in spatial coordinates. This spatial IOR mechanism naturally copes with camera motions and inhibits objects that have already been the target of attention. Our approach provides object candidates which can be processed by higher cognitive modules such as object recognition. Since objects are basic elements for many higher level tasks, our architecture can be used as a first layer in any cognitive system that aims at interpreting a stream of images. We show in the evaluation how our framework finds most of the objects in challenging real-world scenes.

引用

页码：169 / 182

页数：13

共 50 条

[1] A computational framework for attentional object discovery in RGB-D videos
Garcia, German Martin
Pavel, Mircea
Frintrop, Simone
COGNITIVE PROCESSING, 2017, 18 (02) : 169 - 182
[2] Salient Object Detection in RGB-D Videos
Mou, Ao
Lu, Yukang
He, Jiahao
Min, Dingyao
Fu, Keren
Zhao, Qijun
IEEE Transactions on Image Processing, 2024, 33 : 6660 - 6675
[3] Attentional Scene-Exploration and Object Discovery in Image and RGB-D Data
Garcia, German Martin
Werner, Thomas
Frintrop, Simone
KUNSTLICHE INTELLIGENZ, 2015, 29 (01): : 75 - 81
[4] Joint Object Affordance Reasoning and Segmentation in RGB-D Videos
Thermos, Spyridon
Potamianos, Gerasimos
Daras, Petros
IEEE ACCESS, 2021, 9 : 89699 - 89713
[5] Object Discovery on RGB-D Data via Salient Object Proposals
Li, Wanyi
Wang, Peng
Qiao, Hong
Fan, Naiji
Zhou, Hai
Jing, Feng
2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 737 - 739
[6] Bidirectional Attentional Interaction Networks for RGB-D salient object detection
Wei, Weiyi
Xu, Mengyu
Wang, Jian
Luo, Xuzhe
IMAGE AND VISION COMPUTING, 2023, 138
[7] Learning human activities and object affordances from RGB-D videos
Koppula, Hema Swetha
Gupta, Rudhir
Saxena, Ashutosh
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (08): : 951 - 970
[8] Cross-Modal Attentional Context Learning for RGB-D Object Detection
Li, Guanbin
Gan, Yukang
Wu, Hejun
Xiao, Nong
Lin, Liang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1591 - 1601
[9] RGB-D Object Discovery via Multi-Scene Analysis
Herbst, Evan
Ren, Xiaofeng
Fox, Dieter
2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011,
[10] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
Tang, Yansong
Tian, Yi
Lu, Jiwen
Feng, Jianjiang
Zhou, Jie
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414

← 1 2 3 4 5 →