Unsupervised Discovery of Co-occurrence in Sparse High Dimensional Data

被引：23

作者：

Chum, Ondrej ^{[1
]}

Matas, Jiri ^{[1
]}

机构：

[1] Czech Tech Univ, Fac Elec Eng, Dept Cybernet, CMP, CR-16635 Prague, Czech Republic

来源：

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年

关键词：

D O I：

10.1109/CVPR.2010.5539997

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An efficient min-Hash based algorithm for discovery of dependencies in sparse high-dimensional data is presented. The dependencies are represented by sets of features co-occurring with high probability and are called co-ocsets. Sparse high dimensional descriptors, such as bag of words, have been proven very effective in the domain of image retrieval. To maintain high efficiency even for very large data collection, features are assumed independent. We show experimentally that co-ocsets are not rare, i.e. the independence assumption is often violated, and that they may ruin retrieval performance if present in the query image. Two methods for managing co-ocsets in such cases are proposed. Both methods significantly outperform the state-of-the-art in image retrieval, one is also significantly faster.

引用

页码：3416 / 3423

页数：8

共 50 条

[31] The Trajectory of Scientific Discovery: Concept Co-Occurrence and Converging Semantic Distance
Cohen, Trevor
Schvaneveldt, Roger W.
MEDINFO 2010, PTS I AND II, 2010, 160 : 661 - 665
[32] Heterogeneous Transfer Clustering for Partial Co-occurrence Data
Ye, Xiangyang
Yang, Liu
Hu, Qinghua
Shen, Chenyang
Jing, Liping
Du, Zhibin
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1042 - 1049
[33] A Monte Carlo permutation test for co-occurrence data
Balázs Kovács
Quality & Quantity, 2014, 48 : 955 - 960
[34] Using co-occurrence data to determine a thesaurus structure
Andrews, JE
Patrick, TB
Moxley, DE
Meyer, CM
Popescu, M
Sievert, MC
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1998, : 968 - 968
[35] Frequent pattern discovery based on co-occurrence frequent item tree
Hemalatha, R
Krishnan, A
Senthamarai, C
Hemamalini, R
2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 348 - 354
[36] Co-occurrence analysis for discovery of novel breast cancer pathology patterns
Maskery, Susan M.
Zhang, Yonghong
Jordan, Rick M.
Hu, Hai
Hooke, Jeffrey A.
Shriver, Craig D.
Liebman, Michael N.
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 497 - 503
[37] A Monte Carlo permutation test for co-occurrence data
Kovacs, Balazs
QUALITY & QUANTITY, 2014, 48 (02) : 955 - 960
[38] Mining a chemical database for fragment co-occurrence:: Discovery of "chemical cliches"
Lameijer, EW
Kok, JN
Bäck, T
Ijzerman, AP
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) : 553 - 562
[39] Discovery of online game user relationship based on co-occurrence of words
Thawonmas, Ruck
Konno, Yuki
Tsuda, Kohei
ENTERTAINMENT COMPUTING - ICEC 2006, 2006, 4161 : 286 - +
[40] The Co-Occurrence of Diseases
Ohrbach, Richard
JOURNAL OF ORAL & FACIAL PAIN AND HEADACHE, 2021, 35 (02) : 89 - 91

← 1 2 3 4 5 →