A Comparative Study of Utilizing Topic Models for Information Retrieval

被引:0
|
作者
Yi, Xing [1 ]
Allan, James [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
Topic Model; Retrieval; Evaluation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We explore the utility of different types of topic models for retrieval purposes. Based on prior work, we describe several ways that topic models can be integrated into the retrieval process. We evaluate the effectiveness of different types of topic models within those retrieval approaches. We show that: (1) topic models are effective for document smoothing; (2) more rigorous topic models such as Latent Dirichlet Allocation provide gains over cluster-based models; (3) more elaborate topic models that capture topic dependencies provide no additional gains; (4) smoothing documents by using their similar documents is as effective as smoothing them by using topic models; (5) doing query expansion should utilize topics discovered in the feedback documents instead of coarse-grained topics from the whole corpus; (6) generally, incorporating topics in the feedback documents for building relevance models can benefit the performance more for queries that have more relevant documents.
引用
收藏
页码:29 / 41
页数:13
相关论文
共 50 条
  • [1] Assessment of the Quality of Topic Models for Information Retrieval Applications
    Yuan, Meng
    Lin, Pauline
    Rashidi, Lida
    Zobel, Justin
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 265 - 274
  • [2] A Spoken Dialogue System for Document Information Retrieval Utilizing Topic Knowledge
    Kiriyama, S., 1600, John Wiley and Sons Inc. (35):
  • [3] Comparative Study of Information Retrieval Models Used in Search Engine
    Khan, Javed Ahmad
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING AND TECHNOLOGY RESEARCH (ICAETR), 2014,
  • [4] Topic Models Ensembles for AD-HOC Information Retrieval
    Ormeno, Pablo
    Mendoza, Marcelo
    Valle, Carlos
    INFORMATION, 2021, 12 (09)
  • [5] Topic based language models for ad hoc information retrieval
    Azzopardi, L
    Girolami, M
    van Rijsbergen, CJ
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 3281 - 3286
  • [6] Local or Global? A Comparative Study on Applications of Embedding Models for Information Retrieval
    Roy, Dwaipayan
    Mitra, Mandar
    Mayr, Philipp
    Chowdhury, Amritap
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 115 - 119
  • [7] Topic Structure for Information Retrieval
    He, Jiyin
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 851 - 851
  • [8] Information retrieval approaches: A comparative study
    Moutaoukkil, Assmaa
    Idarrou, Ali
    Belahyane, Imane
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (10) : 961 - 970
  • [9] COMPARATIVE ESTIMATION OF MODELS OF THE INFORMATION-RETRIEVAL LANGUAGE
    GULNITSKII, LL
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1982, (06): : 15 - 20
  • [10] A Comparative Study of Topic Models for Topic Clustering of Chinese Web News
    Wu, Yonghui
    Ding, Yuxin
    Wang, Xiaolong
    Xu, Jun
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 5, 2010, : 236 - 240