UniGen: A Unified Generative Framework for Retrieval and Question Answering with Large Language Models

被引:0
|
作者
Li, Xiaoxi [1 ]
Zhou, Yujia
Dou, Zhicheng
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative information retrieval, encompassing two major tasks of Generative Document Retrieval (GDR) and Grounded Answer Generation (GAR), has gained significant attention in the area of information retrieval and natural language processing. Existing methods for GDR and GAR rely on separate retrieval and reader modules, which hinder simultaneous optimization. To overcome this, we present UniGen, a Unified Generative framework for retrieval and question answering that integrates both tasks into a single generative model leveraging the capabilities of large language models. UniGen employs a shared encoder and two distinct decoders for generative retrieval and question answering. To facilitate the learning of both tasks, we introduce connectors, generated by large language models, to bridge the gaps between query inputs and generation targets, as well as between document identifiers and answers. Furthermore, we propose an iterative enhancement strategy that leverages generated answers and retrieved documents to iteratively improve both tasks. Through extensive experiments on the MS MARCO and NQ datasets, we demonstrate the effectiveness of UniGen, showcasing its superior performance in both the retrieval and the question answering tasks.
引用
收藏
页码:8688 / 8696
页数:9
相关论文
共 50 条
  • [41] Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
    Hu, Zhongjian
    Yang, Peng
    Liu, Fengyuan
    Meng, Yuan
    Liu, Xingyu
    BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 843 - 857
  • [42] On the Question of Authorship in Large Language Models
    Soos, Carlin
    Haroutunian, Levon
    KNOWLEDGE ORGANIZATION, 2024, 51 (02): : 83 - 95
  • [43] Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
    Pan, Junting
    Lin, Ziyi
    Ge, Yuying
    Zhu, Xiatian
    Zhang, Renrui
    Wang, Yi
    Qiao, Yu
    Li, Hongsheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 272 - 283
  • [44] Journal policy on large language generative models
    Sessler, Daniel I.
    Turan, Alparslan
    JOURNAL OF CLINICAL ANESTHESIA, 2024, 96
  • [45] Generative Relevance Feedback with Large Language Models
    Mackie, Iain
    Chatterjee, Shubham
    Dalton, Jeffrey
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2026 - 2031
  • [46] IRQAS: Information retrieval and question answering system based on a unified logical-linguistic model
    Sembok, Tengku M. T.
    Zaman, Halimah Badioze
    Kadir, Rabiah Abdul
    ADVANCES ON ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING AND DATA BASES, PROCEEDINGS, 2008, : 460 - +
  • [47] UNIQORN: Unified question answering over RDF knowledge graphs and natural language text
    Pramanik, Soumajit
    Alabi, Jesujoba
    Roy, Rishiraj Saha
    Weikum, Gerhard
    JOURNAL OF WEB SEMANTICS, 2024, 83
  • [49] How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering
    Jiang, Zhengbao
    Araki, Jun
    Ding, Haibo
    Neubig, Graham
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 (09) : 962 - 977
  • [50] QUESTION ANSWERING SYSTEM ON MATHEMATICAL-MODELS (QAS) - DESCRIPTION OF LANGUAGE
    KONOPASEK, M
    PAPACONSTADOPOULOS, C
    COMPUTER LANGUAGES, 1978, 3 (03): : 145 - 155