Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task

被引:1
|
作者
Zhai, Qiuhong [1 ]
Zhu, Wenhao [1 ]
Zhang, Xiaoyu [1 ]
Liu, Chenyun [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Shanghai Municipal Big Data Ctr, Shanghai 200444, Peoples R China
来源
FUTURE INTERNET | 2023年 / 15卷 / 04期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
dense retrieval; pseudo-reference feedback; pseudo-labels; semi-supervised learning;
D O I
10.3390/fi15040137
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, dense retrieval has emerged as the primary method for open-domain question-answering (OpenQA). However, previous research often focused on the query side, neglecting the importance of the passage side. We believe that both the query and passage sides are equally important and should be considered for improved OpenQA performance. In this paper, we propose a contrastive pseudo-labeled data constructed around passages and queries separately. We employ an improved pseudo-relevance feedback (PRF) algorithm with a knowledge-filtering strategy to enrich the semantic information in dense representations. Additionally, we proposed an Auto Text Representation Optimization Model (AOpt) to iteratively update the dense representations. Experimental results demonstrate that our methods effectively optimize dense representations, making them more distinguishable in dense retrieval, thus improving the OpenQA system's overall performance.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200
  • [2] Dense Passage Retrieval for Open-Domain Question Answering
    Karpukhin, Vladimir
    Oguz, Barlas
    Min, Sewon
    Lewis, Patrick
    Wu, Ledell
    Edunov, Sergey
    Chen, Danqi
    Yih, Wen Tau
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6769 - 6781
  • [3] Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering
    Cheng, Hao
    Fang, Hao
    Liu, Xiaodong
    Gao, Jianfeng
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1864 - 1875
  • [4] Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering
    Li, Minghan
    Li, Ming
    Xiong, Kun
    Lin, Jimmy
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 274 - 287
  • [5] RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
    Qu, Yingqi
    Ding, Yuchen
    Liu, Jing
    Liu, Kai
    Ren, Ruiyang
    Zhao, Wayne Xin
    Dong, Daxiang
    Wu, Hua
    Wang, Haifeng
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5835 - 5847
  • [6] Efficient Passage Retrieval with Hashing for Open-domain Question Answering
    Yamada, Ikuya
    Asai, Akari
    Hajishirzi, Hannaneh
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 979 - 986
  • [7] Generation-Augmented Retrieval for Open-Domain Question Answering
    Mao, Yuning
    He, Pengcheng
    Liu, Xiaodong
    Shen, Yelong
    Gao, Jianfeng
    Han, Jiawei
    Chen, Weizhu
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4089 - 4100
  • [8] Advances in open-domain question answering
    Zhang, Zhi-Chang
    Zhang, Yu
    Liu, Ting
    Li, Sheng
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (05): : 1058 - 1069
  • [9] Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering
    Xiong, Wenhan
    Wang, Hong
    Wang, William Yang
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2803 - 2815
  • [10] Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
    Feldman, Yair
    El-Yaniv, Ran
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2296 - 2309