Exploring Implicit Image Statistics for Visual Representativeness Modeling

被引:4
|
作者
Sun, Xiaoshuai [1 ]
Wang, Xin-Jing [2 ]
Yao, Hongxun [1 ]
Zhang, Lei [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2013.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a computational model of visual representativeness by integrating cognitive theories of representativeness heuristics with computer vision and machine learning techniques. Unlike previous models that build their representativeness measure based on the visible data, our model takes the initial inputs as explicit positive reference and extend the measure by exploring the implicit negatives. Given a group of images that contains obvious visual concepts, we create a customized image ontology consisting of both positive and negative instances by mining the most related and confusable neighbors of the positive concept in ontological semantic knowledge bases. The representativeness of a new item is then determined by its likelihoods for both the positive and negative references. To ensure the effectiveness of probability inference as well as the cognitive plausibility, we discover the potential prototypes and treat them as an intermediate representation of semantic concepts. In the experiment, we evaluate the performance of representativeness models based on both human judgements and user-click logs of commercial image search engine. Experimental results on both ImageNet and image sets of general concepts demonstrate the superior performance of our model against the state-of-the-arts.
引用
收藏
页码:516 / 523
页数:8
相关论文
共 50 条
  • [41] Modeling Image Composition for Visual Aesthetic Assessment
    Liu, Dong
    Puri, Rohit
    Kamath, Nagendra
    Bhattacharya, Subhabrata
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 320 - 322
  • [42] Exploring visual attention and saliency modeling for task-based visual analysis
    Polatsek, Patrik
    Waldner, Manuela
    Viola, Ivan
    Kapec, Peter
    Benesova, Wanda
    COMPUTERS & GRAPHICS-UK, 2018, 72 : 26 - 38
  • [43] Fixational instability and natural image statistics: Implications for early visual representations
    Rucci, M
    Casile, A
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2005, 16 (2-3) : 121 - 138
  • [44] Early non-linearities in visual coding and natural image statistics
    Brady, N
    Field, DJ
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1997, 38 (04) : 2958 - 2958
  • [45] Image statistics determine the integration of visual cues to motion-in-depth
    Ross Goutcher
    Lauren Murray
    Brooke Benz
    Scientific Reports, 12
  • [46] Low in the forest, high in the city: Visual selection and natural image statistics
    Ossandon, J.
    Acik, A.
    Koenig, P.
    PERCEPTION, 2011, 40 : 95 - 95
  • [47] Image statistics determine the integration of visual cues to motion-in-depth
    Goutcher, Ross
    Murray, Lauren
    Benz, Brooke
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [48] Beyond Employment Rates: Exploring Labor Force Statistics for People With Visual Impairments
    McDonnall, Michele C.
    JOURNAL OF VISUAL IMPAIRMENT & BLINDNESS, 2022, 116 (01) : 5 - 6
  • [49] A comparison of visual statistics for the image enhancement of FORESITE aerial images with those of major image classes
    Jobson, Daniel J.
    Rahman, Zia-ur
    Woodell, Glenn A.
    Hines, Glenn D.
    VISUAL INFORMATION PROCESSING XV, 2006, 6246
  • [50] Social Image Captioning: Exploring Visual Attention and User Attention
    Wang, Leiquan
    Chu, Xiaoliang
    Zhang, Weishan
    Wei, Yiwei
    Sun, Weichen
    Wu, Chunlei
    SENSORS, 2018, 18 (02)