Adaptive document clustering based on query-based similarity

被引:6
|
作者
Na, Seung-Hoon [1 ]
Kang, In-Su [1 ]
Lee, Jong-Hyeok [1 ]
机构
[1] Pohang Univ Sci & Technol, Div Elect & Comp Engn, Pohang 790784, South Korea
关键词
adaptive document clustering; query-based similarity; cluster-based retrieval; language modeling approach;
D O I
10.1016/j.ipm.2006.08.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In information retrieval, cluster-based retrieval is a well-known attempt in resolving the problem of term mismatch. Clustering requires similarity information between the documents, which is difficult to calculate at a feasible time. The adaptive document clustering scheme has been investigated by researchers to resolve this problem. However, its theoretical viewpoint has not been fully discovered. In this regard, we provide a conceptual viewpoint of the adaptive document clustering based on query-based similarities, by regarding the user's query as a concept. As a result, adaptive document clustering scheme can be viewed as an approximation of this similarity. Based on this idea, we derive three new query-based similarity measures in language modeling framework, and evaluate them in the context of cluster-based retrieval, comparing with K-means clustering and full document expansion. Evaluation result shows that retrievals based on query-based similarities significantly improve the baseline, while being comparable to other methods. This implies that the newly developed query-based similarities become feasible criterions for adaptive document clustering. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:887 / 901
页数:15
相关论文
共 50 条
  • [41] Enhancing web search by using query-based clusters and multi-document summaries
    Qumsiyeh, Rani
    Ng, Yiu-Kai
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (02) : 355 - 380
  • [42] A query-based approach for test selection in diagnosis
    François Gagnon
    Babak Esfandiari
    [J]. Artificial Intelligence Review, 2008, 29
  • [43] Item Recommendation by Query-Based Biclustering Method
    Yokoyama, Naoya
    Okada, Yoshihumi
    [J]. KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 155 - 162
  • [44] Metamorphic Relation Patterns for Query-Based Systems
    Segura, Sergio
    Duran, Amador
    Troya, Javier
    Ruiz-Cortes, Antonio
    [J]. 2019 IEEE/ACM 4TH INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2019), 2019, : 24 - 31
  • [45] Query-based ontology approach for semantic search
    Hsieh, Tung-Cheng
    Tsai, Kun-Hua
    Chen, Ching-Lung
    Lee, Ming-Che
    Chiu, Ti-Kai
    Wang, Tzone-I
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2970 - 2975
  • [46] Research on Query-based Automatic Summarization of Webpage
    Chen, Zhimin
    Shen, Jie
    [J]. 2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 173 - 176
  • [47] Query-based HMM training method for ASR
    Kyung, Y
    Jung, J
    Moon, S
    [J]. ELECTRONICS LETTERS, 2003, 39 (16) : 1222 - 1223
  • [48] Intertopic Information Mining for Query-Based Summarization
    Ouyang, You
    Li, Wenjie
    Li, Sujian
    Lu, Qin
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (05): : 1062 - 1072
  • [49] A query-based approach for test selection in diagnosis
    Gagnon, Francois
    Esfandiari, Babak
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2008, 29 (3-4) : 249 - 263
  • [50] Query-based Summarization for Indonesian News Articles
    Annisa, Dininta
    Khodra, Masayu Leylia
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS, 2017,