The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters

被引:13
|
作者
Kurland, Oren [1 ]
Krikon, Eyal [1 ]
机构
[1] Technion Israel Inst Technol, Fac Ind Engn & Management, IL-32000 Haifa, Israel
基金
美国国家科学基金会; 以色列科学基金会;
关键词
INFORMATION;
D O I
10.1613/jair.3327
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting information induced from (query-specific) clustering of top-retrieved documents has long been proposed as a means for improving precision at the very top ranks of the returned results. We present a novel language model approach to ranking query-specific clusters by the presumed percentage of relevant documents that they contain. While most previous cluster ranking approaches focus on the cluster as a whole, our model utilizes also information induced from documents associated with the cluster. Our model substantially outperforms previous approaches for identifying clusters containing a high relevant document percentage. Furthermore, using the model to produce document ranking yields precision-at-top-ranks performance that is consistently better than that of the initial ranking upon which clustering is performed. The performance also favorably compares with that of a state-of-the-art pseudo-feedback-based retrieval method.
引用
收藏
页码:367 / 395
页数:29
相关论文
共 50 条
  • [31] Document Language Classification: Hierarchical Model with Deep Learning Approach
    Shah, Sarathi
    Joshi, M. V.
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2021, PT 1, 2021, 13052 : 372 - 381
  • [32] Document Language Classification: Hierarchical Model with Deep Learning Approach
    Shah, Sarathi
    Joshi, M.V.
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2021, 13052 LNCS : 372 - 381
  • [33] A generative approach to the implementation of language bindings for the Document Object Model
    Padovani, L
    Coen, CS
    Zacchiroli, S
    [J]. GENERATIVE PROGRAMMING AND COMPONENT ENGINEERING 2004, PROCEEDINGS, 2004, 3286 : 469 - 487
  • [34] Query language for location-based services: A model checking approach
    Hoareau, Christian
    Satoh, Ichiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (04): : 976 - 985
  • [35] Bengali document retrieval using a language modeling approach enhanced by improved cluster-based smoothing
    Chatterjee S.
    Sarkar K.
    [J]. Sadhana - Academy Proceedings in Engineering Sciences, 2023, 48 (04)
  • [36] DOCUMENT-SPECIFIC CONTEXT PLSA LANGUAGE MODEL FOR SPEECH RECOGNITION
    Haidar, Md Akmal
    O'Shaughnessy, Douglas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5326 - 5330
  • [37] Effective query model estimation using parsimonious translation model in language modeling approach
    Na, SH
    Kang, IS
    Roh, JE
    Lee, JH
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 288 - 298
  • [38] Graphs in clusters: a hybrid approach to unsupervised extractive long document summarization using language models
    Gokhan, Tuba
    Price, Malcolm James
    Lee, Mark
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)
  • [39] Graph object oriented model and query language: A semi-structured approach
    Choudhury, S
    Chaki, N
    Bhattacharya, S
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2001, : 685 - 689
  • [40] Engineering Document Summarization: A Bidirectional Language Model-Based Approach
    Qiu, Yunjian
    Jin, Yan
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2022, 22 (06)