Discovery of collocation patterns: from visual words to visual phrases

被引:82
|
作者
Yuan, Junsong [1 ]
Wu, Ying [1 ]
Yang, Ming [1 ]
机构
[1] Northwestern Univ, Dept EECS, 2145 Sheridan Rd, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2007.383222
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
引用
收藏
页码:1930 / +
页数:2
相关论文
共 50 条
  • [41] Visual Recognition of Permuted Words
    Rashid, Sheikh Faisal
    Shafait, Faisal
    Breuel, Thomas M.
    HUMAN VISION AND ELECTRONIC IMAGING XV, 2010, 7527
  • [42] From dynamical emerging patterns to patterns in visual art
    Bucolo, M.
    Buscarino, A.
    Fortuna, L.
    Frasca, M.
    Xibilia, M. G.
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2008, 18 (01): : 51 - 81
  • [43] PICTURES AND WORDS IN VISUAL SEARCH
    PAIVIO, A
    BEGG, I
    MEMORY & COGNITION, 1974, 2 (03) : 515 - 521
  • [44] Visual recognition of objects and words
    Engelkamp, J
    SPRACHE & KOGNITION, 1995, 14 (04): : 174 - 192
  • [45] NOTE ON VISUAL RECOGNITION OF WORDS
    SMITH, OW
    LANDY, F
    PERCEPTUAL AND MOTOR SKILLS, 1969, 29 (01) : 83 - &
  • [46] Visual Navigation using Place Recognition with Visual Line Words
    Kim, Yong Nyeon
    Kol, Dong Wook
    Suh, Il Hong
    2014 11TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2014, : 676 - 676
  • [47] Visual mismatch negativity elicited by semantic violations in visual words
    Hu, Axu
    Gu, Feng
    Wong, Lena L. N.
    Tong, Xiuli
    Zhang, Xiaochu
    BRAIN RESEARCH, 2020, 1746
  • [48] Visual content representation using semantically similar visual words
    Kesorn, Kraisak
    Chimlek, Sutasinee
    Poslad, Stefan
    Piamsa-nga, Punpiti
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) : 11472 - 11481
  • [49] A quantitative evaluation of the conceptual consistency of visual words and visual vocabularies
    Stommel, M.
    Herzog, O.
    Xu, W. L.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 28 : 120 - 129
  • [50] Visual-Effect Dictionary for Converting Words into Visual Images
    Hirai, Shogo
    Sumi, Kaoru
    ENTERTAINMENT COMPUTING - ICEC 2017, 2018, 10507 : 177 - 182