Unsupervised Discovery of Co-occurrence in Sparse High Dimensional Data

被引:23
|
作者
Chum, Ondrej [1 ]
Matas, Jiri [1 ]
机构
[1] Czech Tech Univ, Fac Elec Eng, Dept Cybernet, CMP, CR-16635 Prague, Czech Republic
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5539997
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient min-Hash based algorithm for discovery of dependencies in sparse high-dimensional data is presented. The dependencies are represented by sets of features co-occurring with high probability and are called co-ocsets. Sparse high dimensional descriptors, such as bag of words, have been proven very effective in the domain of image retrieval. To maintain high efficiency even for very large data collection, features are assumed independent. We show experimentally that co-ocsets are not rare, i.e. the independence assumption is often violated, and that they may ruin retrieval performance if present in the query image. Two methods for managing co-ocsets in such cases are proposed. Both methods significantly outperform the state-of-the-art in image retrieval, one is also significantly faster.
引用
收藏
页码:3416 / 3423
页数:8
相关论文
共 50 条
  • [41] Co-Occurrence Filter
    Jevnisek, Roy J.
    Avidan, Shai
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3816 - 3824
  • [42] Chlamydia and gonorrhea co-occurrence in a high school population
    Nsuami, M
    Cammarata, CL
    Brooks, BN
    Taylor, SN
    Martin, DH
    SEXUALLY TRANSMITTED DISEASES, 2004, 31 (07) : 424 - 427
  • [43] Unsupervised Measure of Word Similarity: How to Outperform Co-Occurrence and Vector Cosine in VSMs
    Santus, Enrico
    Chiu, Tin-Shing
    Lu, Qin
    Lenci, Alessandro
    Huang, Chu-Ren
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4260 - 4261
  • [44] Context Co-occurrence Based Relationship Prediction in Spatiotemporal Data
    Xu, Caixu
    Yan, Jianfeng
    Yang, Lu
    Xu, Guanggen
    Shi, Hongbin
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTER MODELING, SIMULATION AND ALGORITHM (CMSA 2018), 2018, 151 : 281 - 287
  • [45] Associating expression and genomic data using co-occurrence measures
    Maarten Larmuseau
    Lieven P. C. Verbeke
    Kathleen Marchal
    Biology Direct, 14
  • [46] On Co-occurrence Pattern Discovery from Spatio-temporal Event Stream
    Huo, Jiangtao
    Zhang, Jinzeng
    Meng, Xiaofeng
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 385 - 395
  • [47] Associating expression and genomic data using co-occurrence measures
    Larmuseau, Maarten
    Verbeke, Lieven P. C.
    Marchal, Kathleen
    BIOLOGY DIRECT, 2019, 14 (1)
  • [48] A text semantic topic discovery method based on the conditional co-occurrence degree
    Wei, Wei
    Guo, Chonghui
    NEUROCOMPUTING, 2019, 368 : 11 - 24
  • [49] The textural analysis of gravity data using co-occurrence matrices
    Cooper, GRJ
    COMPUTERS & GEOSCIENCES, 2004, 30 (01) : 107 - 115
  • [50] Hierarchical Topic Model Inference by Community Discovery on Word Co-occurrence Networks
    Austin, Eric
    Trabelsi, Amine
    Largeron, Christine
    Zaiane, Osmar R.
    DATA MINING, AUSDM 2022, 2022, 1741 : 148 - 162