Unsupervised Discovery of Co-occurrence in Sparse High Dimensional Data

被引:23
|
作者
Chum, Ondrej [1 ]
Matas, Jiri [1 ]
机构
[1] Czech Tech Univ, Fac Elec Eng, Dept Cybernet, CMP, CR-16635 Prague, Czech Republic
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5539997
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient min-Hash based algorithm for discovery of dependencies in sparse high-dimensional data is presented. The dependencies are represented by sets of features co-occurring with high probability and are called co-ocsets. Sparse high dimensional descriptors, such as bag of words, have been proven very effective in the domain of image retrieval. To maintain high efficiency even for very large data collection, features are assumed independent. We show experimentally that co-ocsets are not rare, i.e. the independence assumption is often violated, and that they may ruin retrieval performance if present in the query image. Two methods for managing co-ocsets in such cases are proposed. Both methods significantly outperform the state-of-the-art in image retrieval, one is also significantly faster.
引用
收藏
页码:3416 / 3423
页数:8
相关论文
共 50 条
  • [31] The Trajectory of Scientific Discovery: Concept Co-Occurrence and Converging Semantic Distance
    Cohen, Trevor
    Schvaneveldt, Roger W.
    MEDINFO 2010, PTS I AND II, 2010, 160 : 661 - 665
  • [32] Heterogeneous Transfer Clustering for Partial Co-occurrence Data
    Ye, Xiangyang
    Yang, Liu
    Hu, Qinghua
    Shen, Chenyang
    Jing, Liping
    Du, Zhibin
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1042 - 1049
  • [33] A Monte Carlo permutation test for co-occurrence data
    Balázs Kovács
    Quality & Quantity, 2014, 48 : 955 - 960
  • [34] Using co-occurrence data to determine a thesaurus structure
    Andrews, JE
    Patrick, TB
    Moxley, DE
    Meyer, CM
    Popescu, M
    Sievert, MC
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1998, : 968 - 968
  • [35] Frequent pattern discovery based on co-occurrence frequent item tree
    Hemalatha, R
    Krishnan, A
    Senthamarai, C
    Hemamalini, R
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 348 - 354
  • [36] Co-occurrence analysis for discovery of novel breast cancer pathology patterns
    Maskery, Susan M.
    Zhang, Yonghong
    Jordan, Rick M.
    Hu, Hai
    Hooke, Jeffrey A.
    Shriver, Craig D.
    Liebman, Michael N.
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 497 - 503
  • [37] A Monte Carlo permutation test for co-occurrence data
    Kovacs, Balazs
    QUALITY & QUANTITY, 2014, 48 (02) : 955 - 960
  • [38] Mining a chemical database for fragment co-occurrence:: Discovery of "chemical cliches"
    Lameijer, EW
    Kok, JN
    Bäck, T
    Ijzerman, AP
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) : 553 - 562
  • [39] Discovery of online game user relationship based on co-occurrence of words
    Thawonmas, Ruck
    Konno, Yuki
    Tsuda, Kohei
    ENTERTAINMENT COMPUTING - ICEC 2006, 2006, 4161 : 286 - +
  • [40] The Co-Occurrence of Diseases
    Ohrbach, Richard
    JOURNAL OF ORAL & FACIAL PAIN AND HEADACHE, 2021, 35 (02) : 89 - 91