UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models

被引:0
|
作者
Li, Xiaoxi [1 ]
Zhou, Yujia
Dou, Zhicheng
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative information retrieval, encompassing two major tasks of Generative Document Retrieval (GDR) and Grounded Answer Generation (GAR), has gained significant attention in the area of information retrieval and natural language processing. Existing methods for GDR and GAR rely on separate retrieval and reader modules, which hinder simultaneous optimization. To overcome this, we present UniGen, a Unified Generative framework for retrieval and question answering that integrates both tasks into a single generative model leveraging the capabilities of large language models. UniGen employs a shared encoder and two distinct decoders for generative retrieval and question answering. To facilitate the learning of both tasks, we introduce connectors, generated by large language models, to bridge the gaps between query inputs and generation targets, as well as between document identifiers and answers. Furthermore, we propose an iterative enhancement strategy that leverages generated answers and retrieved documents to iteratively improve both tasks. Through extensive experiments on the MS MARCO and NQ datasets, we demonstrate the effectiveness of UniGen, showcasing its superior performance in both the retrieval and the question answering tasks.
引用
收藏
页码:8688 / 8696
页数:9
相关论文
共 50 条
  • [31] Natural language Question - Answering model applied to document retrieval system
    Dang, Nguyen Tuan
    Tuyen, Do Thi Thanh
    World Academy of Science, Engineering and Technology, 2009, 39 : 36 - 39
  • [32] Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering
    Shao, Zhenwei
    Yu, Zhou
    Wang, Meng
    Yu, Jun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14974 - 14983
  • [33] ZVQAF: Zero-shot visual question answering with feedback from large language models
    Liu, Cheng
    Wang, Chao
    Peng, Yan
    Li, Zhixu
    NEUROCOMPUTING, 2024, 580
  • [34] Generative Large Language Models Explained
    Yan, Xueming
    Xiao, Yan
    Jin, Yaochu
    IEEE Computational Intelligence Magazine, 2024, 19 (04) : 45 - 46
  • [35] A Generative Adaptive Context Learning Framework for Large Language Models in Cheapfake Detection
    Pham, Long-Khanh
    Vo-Hoang, Hoa-Vien
    Tran, Anh-Duy
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1288 - 1293
  • [36] Efficient Question Answering Based on Language Models and Knowledge Graphs
    Li, Fengying
    Huang, Hongfei
    Dong, Rongsheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 340 - 351
  • [37] Language processing and learning models for community question answering in Arabic
    Romeo, Salvatore
    Da San Martino, Giovanni
    Belinkov, Yonatan
    Barron-Cedeno, Alberto
    Eldesouki, Mohamed
    Darwish, Kareem
    Mubarak, Hamdy
    Glass, James
    Moschitti, Alessandro
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (02) : 274 - 290
  • [38] Improving Retrieval-Based Question Answering with Deep Inference Models
    Pirtoaca, George-Sebastian
    Rebedea, Traian
    Ruseti, Stefan
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [39] Unveiling the power of language models in chemical research question answering
    Xiuying Chen
    Tairan Wang
    Taicheng Guo
    Kehan Guo
    Juexiao Zhou
    Haoyang Li
    Zirui Song
    Xin Gao
    Xiangliang Zhang
    Communications Chemistry, 8 (1)
  • [40] Foundation Models, Generative AI, and Large Language Models
    Ross, Angela
    McGrow, Kathleen
    Zhi, Degui
    Rasmy, Laila
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387