Visual synset: Towards a higher-level visual representation

被引:0
|
作者
Zheng, Yan-Tao [1 ]
Zhao, Ming [2 ]
Neo, Shi-Yong [1 ]
Chua, Tat-Seng [1 ]
Tian, Qi [3 ]
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
[2] Google Inc, Mountain View, CA 94043 USA
[3] Inst Infocomm Res, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a higher-level visual representation, visual synset, for object categorization. The visual synset improves the traditional bag of words representation with better discrimination and invariance power First, the approach strengthens the inter-class discrimination power by constructing an intermediate visual descriptor, delta visual phrase, from frequently co-occurring visual word-set with similar spatial context. Second, the approach achieves better intra-class invariance power, by clustering delta visual phrases into visual synset, based their probabilistic 'semantics', i.e. class probability distribution. Hence, the resulting visual synset can partially bridge the visual differences of images of same class. The tests on Caltech-101 and Pascal-VOC 05 dataset demonstrated that the proposed image representation can achieve good accuracies.
引用
收藏
页码:2094 / +
页数:2
相关论文
共 50 条
  • [1] A Semantic Higher-Level Visual Representation for Object Recognition
    El Sayad, Ismail
    Martinet, Jean
    Urruty, Thierry
    Dejraba, Chabane
    [J]. ADVANCES IN MULTIMEDIA MODELING, PT I, 2011, 6523 : 251 - 261
  • [2] Toward a higher-level visual representation for object-based image retrieval
    Yan-Tao Zheng
    Shi-Yong Neo
    Tat-Seng Chua
    Qi Tian
    [J]. The Visual Computer, 2009, 25 : 13 - 23
  • [3] Toward a higher-level visual representation for content-based image retrieval
    El Sayad, Ismail
    Martinet, Jean
    Urruty, Thierry
    Djeraba, Chabane
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 455 - 482
  • [4] Toward a higher-level visual representation for content-based image retrieval
    Ismail El sayad
    Jean Martinet
    Thierry Urruty
    Chabane Djeraba
    [J]. Multimedia Tools and Applications, 2012, 60 : 455 - 482
  • [5] Toward a higher-level visual representation for object-based image retrieval
    Zheng, Yan-Tao
    Neo, Shi-Yong
    Chua, Tat-Seng
    Tian, Qi
    [J]. VISUAL COMPUTER, 2009, 25 (01): : 13 - 23
  • [6] Ferrets as a Model for Higher-Level Visual Motion Processing
    Lempel, Augusto A.
    Nielsen, Kristina J.
    [J]. CURRENT BIOLOGY, 2019, 29 (02) : 179 - +
  • [7] Predicting Eye Fixations With Higher-Level Visual Features
    Liang, Ming
    Hu, Xiaolin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (03) : 1178 - 1189
  • [8] Visualization of information encoded by neurons in the higher-level areas of the visual system
    Malakhova, K.
    [J]. JOURNAL OF OPTICAL TECHNOLOGY, 2018, 85 (08) : 494 - 498
  • [9] Visualization of higher-level receptive fields in a hierarchical model of the visual system
    Christian Hinze
    Niko Wilbert
    Laurenz Wiskott
    [J]. BMC Neuroscience, 10 (Suppl 1)
  • [10] Encoding of event roles from visual scenes is rapid, spontaneous, and interacts with higher-level visual processing
    Hafri, Alon
    Trueswell, John C.
    Strickland, Brent
    [J]. COGNITION, 2018, 175 : 36 - 52