Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task

被引:1
|
作者
Zhai, Qiuhong [1 ]
Zhu, Wenhao [1 ]
Zhang, Xiaoyu [1 ]
Liu, Chenyun [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Municipal Big Data Ctr, Shanghai 200444, Peoples R China
来源
FUTURE INTERNET | 2023年 / 15卷 / 04期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
dense retrieval; pseudo-reference feedback; pseudo-labels; semi-supervised learning;
D O I
10.3390/fi15040137
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, dense retrieval has emerged as the primary method for open-domain question-answering (OpenQA). However, previous research often focused on the query side, neglecting the importance of the passage side. We believe that both the query and passage sides are equally important and should be considered for improved OpenQA performance. In this paper, we propose a contrastive pseudo-labeled data constructed around passages and queries separately. We employ an improved pseudo-relevance feedback (PRF) algorithm with a knowledge-filtering strategy to enrich the semantic information in dense representations. Additionally, we proposed an Auto Text Representation Optimization Model (AOpt) to iteratively update the dense representations. Experimental results demonstrate that our methods effectively optimize dense representations, making them more distinguishable in dense retrieval, thus improving the OpenQA system's overall performance.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Adaptive Information Seeking for Open-Domain Question Answering
    Zhu, Yunchang
    Pang, Liang
    Lan, Yanyan
    Shen, Huawei
    Cheng, Xueqi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3615 - 3626
  • [22] SPARTA: Efficient Open-Domain Question Answering via Sparse Transformer Matching Retrieval
    Zhao, Tiancheng
    Lu, Xiaopeng
    Lee, Kyusong
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 565 - 575
  • [23] RRQA: reconfirmed reader for open-domain question answering
    Li, Shi
    Zhang, Wenqian
    APPLIED INTELLIGENCE, 2023, 53 (15) : 18420 - 18430
  • [24] ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
    Lee, Chia-Hsuan
    Wang, Shang-Ming
    Chang, Huan-Cheng
    Lee, Hung-Yi
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 949 - 956
  • [25] Query Context Expansion for Open-Domain Question Answering
    Zhu, Wenhao
    Zhang, Xiaoyu
    Ye, Liang
    Zhai, Qiuhong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (08)
  • [26] Adaptive Batch Scheduling for Open-Domain Question Answering
    Choi, Donghyun
    Shin, Myeongcheol
    Kim, Eunggyun
    Shin, Dong Ryeol
    IEEE ACCESS, 2021, 9 : 112097 - 112103
  • [27] RRQA: reconfirmed reader for open-domain question answering
    Shi Li
    Wenqian Zhang
    Applied Intelligence, 2023, 53 : 18420 - 18430
  • [28] Designing an interactive open-domain question answering system
    Quarteroni, S.
    Manandhar, S.
    NATURAL LANGUAGE ENGINEERING, 2009, 15 : 73 - 95
  • [29] CREPE: Open-Domain Question Answering with False Presuppositions
    Yu, Xinyan Velocity
    Min, Sewon
    Zettlemoyer, Luke
    Hajishirzi, Hannaneh
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10457 - 10480
  • [30] Denoising Distantly Supervised Open-Domain Question Answering
    Lin, Yankai
    Ji, Haozhe
    Liu, Zhiyuan
    Sun, Maosong
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1736 - 1745