Conditional variational autoencoder for query expansion in ad-hoc information retrieval

被引:1
|
作者
Ou, Wei [1 ]
Huynh, Van-Nam [2 ]
机构
[1] Zhejiang Gongshang Univ, Sch Tourism & Urban Rural Planning, Hangzhou, Zhejiang, Peoples R China
[2] Japan Adv Inst Sci & Technol, Sch Knowledge Sci, Nomi, Ishikawa, Japan
关键词
Information retrieval; Query expansion; Conditional variational autoencoder; Relevance language model;
D O I
10.1016/j.ins.2023.119764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query expansion (QE) is commonly used to improve the performance of traditional information retrieval (IR) models. With the adoption of deep learning in IR research, neural QE models have emerged in recent years. Many of these models focus on learning embeddings by leveraging query document relevance. These embedding models allow computing semantic similarities between queries and documents to generate expansion terms. However, existing models often ignore query-document interactions. This research aims to address that gap by proposing a QE model using a conditional variational autoencoder. It first maps a query-document pair into a latent space based on their interaction, then estimates an expansion model from that latent space. The proposed model is trained on relevance feedback data and generates expansions using pseudo relevance feedback at test time. The proposed model is evaluated on three standard TREC collections for document ranking: AP and Robust 04 and GOV02, and the MS MARCO dataset for passage ranking. Results show the model outperforms state-of-the-art traditional and neural QE models. It also demonstrates higher additivity with neural matching than baselines.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] A Hybrid Model for Ad-hoc Information Retrieval
    Ye, Zheng
    Huang, Jimmy Xiangji
    Miao, Jun
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1025 - 1026
  • [2] Topic Models Ensembles for AD-HOC Information Retrieval
    Ormeno, Pablo
    Mendoza, Marcelo
    Valle, Carlos
    [J]. INFORMATION, 2021, 12 (09)
  • [3] Document Expansion, Query Translation and Language Modeling for Ad-Hoc IR
    Leveling, Johannes
    Zhou, Dong
    Jones, Gareth J. F.
    Wade, Vincent
    [J]. MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 58 - +
  • [4] LEVERAGING NEURAL NETWORK PHRASE EMBEDDING MODEL FOR QUERY REFORMULATION IN AD-HOC BIOMEDICAL INFORMATION RETRIEVAL
    P., Amol
    Tiwari, Ashish
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2021, 34 (02) : 151 - 170
  • [5] Neural Ad-Hoc Retrieval Meets Open Information Extraction
    Vo, Duc-Thuan
    Zarrinkalam, Fattane
    Pham, Ba
    Arabzadeh, Negar
    Salamat, Sara
    Bagheri, Ebrahim
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 655 - 663
  • [6] A Simple Enhancement for Ad-hoc Information Retrieval via Topic Modelling
    Jian, Fanghong
    Huang, Jimmy Xiangji
    Zhao, Jiashu
    He, Tingting
    Hu, Po
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 733 - 736
  • [7] A study on the use of stemming for monolingual ad-hoc Portuguese information retrieval
    Orengo, Viviane Moreira
    Buriol, Luciana S.
    Coelho, Alexandre Ramos
    [J]. EVALUATION OF MULTILINGUAL AND MULTI-MODAL INFORMATION RETRIEVAL, 2007, 4730 : 91 - +
  • [8] MIRACLE Progress in Monolingual Information Retrieval at Ad-Hoc CLEF 2007
    Gonzalez-Cristobal, Jose-Carlos
    Goni-Menoyo, Jose Miguel
    Villena-Roman, Julio
    Lana-Serrano, Sara
    [J]. ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 156 - +
  • [9] Metallogenic-Factor Variational Autoencoder for Geochemical Anomaly Detection by Ad-Hoc and Post-Hoc Interpretability Algorithms
    Luo, Zijing
    Zuo, Renguang
    Xiong, Yihui
    Zhou, Bao
    [J]. NATURAL RESOURCES RESEARCH, 2023, 32 (03) : 835 - 853
  • [10] Metallogenic-Factor Variational Autoencoder for Geochemical Anomaly Detection by Ad-Hoc and Post-Hoc Interpretability Algorithms
    Zijing Luo
    Renguang Zuo
    Yihui Xiong
    Bao Zhou
    [J]. Natural Resources Research, 2023, 32 : 835 - 853