A Comparative Study of Utilizing Topic Models for Information Retrieval

被引:0
|
作者
Yi, Xing [1 ]
Allan, James [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
Topic Model; Retrieval; Evaluation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We explore the utility of different types of topic models for retrieval purposes. Based on prior work, we describe several ways that topic models can be integrated into the retrieval process. We evaluate the effectiveness of different types of topic models within those retrieval approaches. We show that: (1) topic models are effective for document smoothing; (2) more rigorous topic models such as Latent Dirichlet Allocation provide gains over cluster-based models; (3) more elaborate topic models that capture topic dependencies provide no additional gains; (4) smoothing documents by using their similar documents is as effective as smoothing them by using topic models; (5) doing query expansion should utilize topics discovered in the feedback documents instead of coarse-grained topics from the whole corpus; (6) generally, incorporating topics in the feedback documents for building relevance models can benefit the performance more for queries that have more relevant documents.
引用
收藏
页码:29 / 41
页数:13
相关论文
共 50 条
  • [21] Utilizing temporal information in topic detection and tracking
    Makkonen, J
    Ahonen-Myka, H
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2003, 2769 : 393 - 404
  • [22] Prospecting the Effect of Topic Modeling in Information Retrieval
    Sharaff, Aakanksha
    Dewangan, Jitesh Kumar
    Sisodia, Dilip Singh
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2021, 17 (03) : 18 - 34
  • [23] Personalization Information Retrieval Based on Topic Directory
    Yu, Yangxin
    Zhang, Yizhou
    ADVANCES IN MANUFACTURING SCIENCE AND ENGINEERING, PTS 1-4, 2013, 712-715 : 2659 - +
  • [24] Using Topic Identification in Chinese Information Retrieval
    Yeh, Ching-Long
    Chen, Yi-Chun
    JOURNAL OF INTERNET TECHNOLOGY, 2009, 10 (02): : 95 - 102
  • [25] Topic Models for Comparative Summarization
    Campr, Michal
    Jezek, Karel
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 568 - 574
  • [26] Understanding the topic evolution in a scientific domain: An exploratory study for the field of information retrieval
    Chen, Baitong
    Tsutsui, Satoshi
    Ding, Ying
    Ma, Feicheng
    JOURNAL OF INFORMETRICS, 2017, 11 (04) : 1175 - 1189
  • [27] A comparative study of performance measures for information retrieval systems
    Meng, Xiannong
    Third International Conference on Information Technology: New Generations, Proceedings, 2006, : 578 - 579
  • [28] A Comparative Study of Fuzzy Topic Models and LDA in terms of Interpretability
    Rijcken, Emil
    Scheepers, Floortje
    Mosteiro, Pablo
    Zervanou, Kalliopi
    Spruit, Marco
    Kaymak, Uzay
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [29] Exploiting session context for information retrieval - A comparative study
    Pandey, Caurav
    Luxenburger, Julia
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 652 - 657
  • [30] Web Algorithms for Information Retrieval: A Performance Comparative Study
    Frikh, Bouchra
    Ouhbi, Brahim
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2014, 6 (01) : 1 - 16