Visconde: Multi-document QA with GPT-3 and Neural Reranking

被引:8
|
作者
Pereira, Jayr [1 ,2 ]
Fidalgo, Robson [2 ]
Lotufo, Roberto [1 ]
Nogueira, Rodrigo [1 ]
机构
[1] NeuralMind, Campinas, Brazil
[2] Univ Fed Pernambuco, Ctr Informat, Recife, Brazil
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1007/978-3-031-28238-6_44
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a question-answering system that can answer questions whose supporting evidence is spread over multiple (potentially long) documents. The system, called Visconde, uses a three-step pipeline to perform the task: decompose, retrieve, and aggregate. The first step decomposes the question into simpler questions using a few-shot large language model (LLM). Then, a state-of-the-art search engine is used to retrieve candidate passages from a large collection for each decomposed question. In the final step, we use the LLM in a few-shot setting to aggregate the contents of the passages into the final answer. The system is evaluated on three datasets: IIRC, Qasper, and StrategyQA. Results suggest that current retrievers are the main bottleneck and that readers are already performing at the human level as long as relevant passages are provided. The system is also shown to be more effective when the model is induced to give explanations before answering a question. Code is available at https://github.com/neuralmind- ai/visconde.
引用
收藏
页码:534 / 543
页数:10
相关论文
共 14 条
  • [1] MeanSum : A Neural Model for Unsupervised Multi-Document Abstractive Summarization
    Chu, Eric
    Liu, Peter J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [2] Multi-Task Deep Neural Networks for Multi-Document Reading Comprehension
    Liu, Chang
    Liu, Zhuang
    Lin, Wayne
    Zhao, Jun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] BioinQA: metadata-based multi-document QA system for addressing the issues in biomedical domain
    Mittal, Sparsh
    Gupta, Saket
    Mittal, Ankush
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2013, 5 (01) : 37 - 56
  • [4] Ranking with Recursive Neural Networks and Its Application to Multi-Document Summarization
    Cao, Ziqiang
    Wei, Furu
    Dong, Li
    Li, Sujian
    Zhou, Ming
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2153 - 2159
  • [5] Neural sentence fusion for diversity driven abstractive multi-document summarization
    Fuad, Tanvir Ahmed
    Nayeem, Mir Tafseer
    Mahmud, Asif
    Chali, Yllias
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 216 - 230
  • [6] Applying Siamese Hierarchical Attention Neural Networks for multi-document summarization
    Angel Gonzalez, Jose
    Delonca, Julien
    Sanchis, Emilio
    Garcia-Granada, Fernando
    Segarra, Encarna
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2019, (63): : 111 - 118
  • [7] Discriminative Marginalized Probabilistic Neural Method for Multi-Document Summarization of Medical Literature
    Moro, Gianluca
    Ragazzi, Luca
    Valgimigli, Lorenzo
    Freddi, Davide
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 180 - 189
  • [8] Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization
    Lebanoff, Logan
    Song, Kaiqiang
    Liu, Fei
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4131 - 4141
  • [9] Multi-Document Abstractive Summarization using Chunk-graph and Recurrent Neural Network
    Niu, Jianwei
    Chen, Huan
    Zhao, Qingjuan
    Sun, Limin
    Atiquzzaman, Mohammed
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [10] CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization
    Kulkarni, Sayali
    Chammas, Sheide
    Zhu, Wan
    Sha, Fei
    Ie, Eugene
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 84 - 98