A comparison of feature combination strategies for saliency-based visual attention systems

被引:77
|
作者
Itti, L [1 ]
Koch, C [1 ]
机构
[1] CALTECH, Computat & Neural Syst Program, Pasadena, CA 91125 USA
来源
关键词
attention; saliency; target detection; feature integration; learning;
D O I
10.1117/12.348467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bottom-up or saliency-based visual attention allows primates to detect non-specific conspicuous targets in cluttered scenes. A classical metaphor, derived from electrophysiological and psychophysical studies, describes attention as a rapidly shiftable "spotlight". The model described here reproduces the attentional scanpaths of this spotlight: Simple multi-scale "feature maps" detect local spatial discontinuities in intensity, color, orientation or optical flow, and are combined into a unique "master" or "saliency" map. The saliency map is sequentially scanned, in order of decreasing saliency, by the focus of attention. We study the problem of combining feature maps, from different visual modalities and with unrelated dynamic ranges (such as color and motion), into a unique saliency map. Four combination strategies are compared using three databases of natural color images: (1) Simple normalized summation, (2) linear combination with learned weights, (3) global non-linear normalization followed by summation, and (4) local nonlinear competition between salient locations. Performance was measured as the number of false detections before the most salient target was found. Strategy (1) always yielded poorest performance and (2) best performance, with a 3 to 8-fold improvement in time to find a salient target. However, (2) yielded specialized systems with poor generalization. Interestingly, strategy (4) and its simplified, computationally efficient approximation (3) yielded significantly better performance than (1), with up to 4-fold improvement, while preserving generality.
引用
收藏
页码:473 / 482
页数:10
相关论文
共 50 条
  • [1] Feature combination strategies for saliency-based visual attention systems
    Itti, L
    Koch, C
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2001, 10 (01) : 161 - 169
  • [2] Sparse embedding feature combination strategy for saliency-based visual attention system
    Zhao, Cairong
    Liu, Chuancai
    [J]. Journal of Computational Information Systems, 2010, 6 (09): : 2831 - 2838
  • [3] A Novel Feature Fusion Technique in Saliency-Based Visual Attention
    Armanfard, Zeynab
    Bahmani, Hamed
    Nasrabadi, Ali Motie
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 230 - +
  • [4] Learning saliency-based visual attention: A review
    Zhao, Qi
    Koch, Christof
    [J]. SIGNAL PROCESSING, 2013, 93 (06) : 1401 - 1407
  • [5] A Teleoperation System Utilizing Saliency-Based Visual Attention
    Teng, Wei-Chung
    Kuo, Yi-Ching
    Tara, Rayi Yanu
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 139 - 144
  • [6] Nonlinear Data Fusion in Saliency-Based Visual Attention
    Bahmani, Hamed
    Nasrabadi, Ali Motie
    Gholpayeghani, Mohammad Reza Hashemi
    [J]. 2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 152 - +
  • [7] Pulse Discrete Cosine Transform for Saliency-based Visual Attention
    Yu, Ying
    Wang, Bin
    Zhang, Liming
    [J]. 2009 IEEE 8TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2009, : 41 - 46
  • [8] A model of saliency-based visual attention for rapid scene analysis
    Itti, L
    Koch, C
    Niebur, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) : 1254 - 1259
  • [9] A saliency-based search mechanism for overt and covert shifts of visual attention
    Itti, L
    Koch, C
    [J]. VISION RESEARCH, 2000, 40 (10-12) : 1489 - 1506
  • [10] Distorted Low-Level Visual Features Affect Saliency-Based Visual Attention
    Bahmani, Hamed
    Wahl, Siegfried
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2016, 10