Multimodal Saliency-based Attention for Object-based Scene Analysis

被引:0
|
作者
Schauerte, Boris [1 ]
Kuehn, Benjamin [1 ]
Kroschel, Kristian [2 ]
Stiefelhagen, Rainer [1 ,2 ]
机构
[1] KIT, Inst Anthropomat, Adenauerring 2, D-76131 Karlsruhe, Germany
[2] Syst Technol & Image Exploitat, Fraunhofer Inst Optron, D-76131 Karlsruhe, Germany
关键词
audio-visual saliency; auditory surprise; isophote-based visual proto-objects; parametric 3-D saliency model; object-based inhibition of return; multimodal attention; scene exploration; hierarchical object analysis; overt attention; active perception; VISUAL-ATTENTION; AUDITORY ATTENTION; MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal attention is a key requirement for humanoid robots in order to navigate in complex environments and act as social, cognitive human partners. To this end, robots have to incorporate attention mechanisms that focus the processing on the potentially most relevant stimuli while controlling the sensor orientation to improve the perception of these stimuli. In this paper, we present our implementation of audio-visual saliency-based attention that we integrated in a system for knowledge-driven audio-visual scene analysis and object-based world modeling. For this purpose, we introduce a novel isophote-based method for proto-object segmentation of saliency maps, a surprise-based auditory saliency definition, and a parametric 3-D model for multimodal saliency fusion. The applicability of the proposed system is demonstrated in a series of experiments.
引用
收藏
页码:1173 / 1179
页数:7
相关论文
共 50 条
  • [1] A model of saliency-based visual attention for rapid scene analysis
    Itti, L
    Koch, C
    Niebur, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) : 1254 - 1259
  • [2] Multimodal Saliency-based Attention: A Lazy Robot's Approach
    Kuhn, Benjamin
    Schauerte, Boris
    Kroschel, Kristian
    Stiefelhagen, Rainer
    [J]. 2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 807 - 814
  • [3] A model of space and object-based attention for visual saliency
    Zhong, Jingjing
    Luo, Siwei
    [J]. ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 237 - +
  • [4] Contour Grouping and Object-Based Attention with Saliency Maps
    Zhong, Jingjing
    Luo, Siwei
    Wang, Jiao
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12): : 2531 - 2534
  • [5] New evidences of visual saliency impact on object-based attention
    Urban, F.
    Follet, B.
    [J]. PERCEPTION, 2011, 40 : 152 - 152
  • [6] Saliency-based dual-attention network for unsupervised video object segmentation
    Guifang Zhang
    Hon-Cheng Wong
    [J]. The Journal of Supercomputing, 2024, 80 (4) : 4996 - 5010
  • [7] Saliency-based dual-attention network for unsupervised video object segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4996 - 5010
  • [8] Saliency-Based Image Object Indexing and Retrieval
    Lam, Yat Hong Jacky
    Yayilgan, Sule Yildirim
    [J]. IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 269 - 277
  • [9] Contextual uncertainty of visual scene modulates object-based attention
    Luo, Ting
    Fu, Shimin
    [J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2023, 76 (01): : 44 - 53
  • [10] Learning saliency-based visual attention: A review
    Zhao, Qi
    Koch, Christof
    [J]. SIGNAL PROCESSING, 2013, 93 (06) : 1401 - 1407