UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models

被引:0
|
作者
Li, Xiaoxi [1 ]
Zhou, Yujia
Dou, Zhicheng
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative information retrieval, encompassing two major tasks of Generative Document Retrieval (GDR) and Grounded Answer Generation (GAR), has gained significant attention in the area of information retrieval and natural language processing. Existing methods for GDR and GAR rely on separate retrieval and reader modules, which hinder simultaneous optimization. To overcome this, we present UniGen, a Unified Generative framework for retrieval and question answering that integrates both tasks into a single generative model leveraging the capabilities of large language models. UniGen employs a shared encoder and two distinct decoders for generative retrieval and question answering. To facilitate the learning of both tasks, we introduce connectors, generated by large language models, to bridge the gaps between query inputs and generation targets, as well as between document identifiers and answers. Furthermore, we propose an iterative enhancement strategy that leverages generated answers and retrieved documents to iteratively improve both tasks. Through extensive experiments on the MS MARCO and NQ datasets, we demonstrate the effectiveness of UniGen, showcasing its superior performance in both the retrieval and the question answering tasks.
引用
收藏
页码:8688 / 8696
页数:9
相关论文
共 50 条
  • [21] Large Language Models for Scientific Question Answering: An Extensive Analysis of the SciQA Benchmark
    Lehmann, Jens
    Meloni, Antonello
    Motta, Enrico
    Osborne, Francesco
    Recupero, Diego Reforgiato
    Salatino, Angelo Antonio
    Vandati, Sahar
    SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 199 - 217
  • [22] Finetuning Language Models for Multimodal Question Answering
    Zhang, Xin
    Xie, Wen
    Dai, Ziqi
    Rao, Jun
    Wen, Haokun
    Luo, Xuan
    Zhang, Meishan
    Zhang, Min
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9420 - 9424
  • [23] Retrieval-Augmented Generation Approach: Document Question Answering using Large Language Model
    Muludi, Kurnia
    Fitria, Kaira Milani
    Triloka, Joko
    Sutedi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 776 - 785
  • [24] UniRaG: Unification, Retrieval, and Generation for Multimodal Question Answering With Pre-Trained Language Models
    Lim, Qi Zhi
    Lee, Chin Poo
    Lim, Kian Ming
    Samingan, Ahmad Kamsani
    IEEE ACCESS, 2024, 12 : 71505 - 71519
  • [25] Advancing Faithfulness of Large Language Models in Goal-Oriented Dialogue Question Answering
    Sticha, Abigail
    Braunschweiler, Norbert
    Doddipatla, Rama
    Knill, Kate
    PROCEEDINGS OF THE 6TH CONFERENCE ON ACM CONVERSATIONAL USER INTERFACES, CUI 2024, 2024,
  • [26] Review of Research Progress on Question-Answering Techniques Based on Large Language Models
    Wen, Sen
    Qian, Li
    Hu, Maodi
    Chang, Zhijun
    Data Analysis and Knowledge Discovery, 2024, 8 (06) : 16 - 29
  • [27] Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering
    Luo, Hongyin
    Mohtarami, Mitra
    Glass, James
    Krishnanzurthy, Karthik
    Richardson, Brigitte
    INTERSPEECH 2019, 2019, : 599 - 603
  • [28] A Unified Framework for Multilingual and Code-Mixed Visual Question Answering
    Gupta, Deepak
    Lenka, Pabitra
    Ekbal, Asif
    Bhattacharyya, Pushpak
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 900 - 913
  • [29] Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
    Lan, Yunshi
    Li, Xiang
    Liu, Xin
    Li, Yang
    Qin, Wei
    Qian, Weining
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4389 - 4400
  • [30] UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
    Liu, Qi
    He, Yongyi
    Xu, Tong
    Lian, Defu
    Liu, Che
    Zheng, Zhi
    Chen, Enhong
    International Conference on Information and Knowledge Management, Proceedings, : 1909 - 1919