SESAME - self-supervised framework for extractive question answering over document collections

被引:0
|
作者
Batista, Vitor A. [1 ,2 ]
Gomes, Diogo S. M. [2 ]
Evsukoff, Alexandre [1 ]
机构
[1] Fed Univ Rio Janeiro, PEC Coppe, POB 68506, BR-21941972 Rio De Janeiro, RJ, Brazil
[2] PETROBRAS SA, Rua Gen Canabarro, 500, Rio De Janeiro, RJ, Brazil
关键词
Question answering; NLP; Neural networks; Transformers; LLM;
D O I
10.1007/s10844-024-00869-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question Answering is one of the most relevant areas in the field of Natural Language Processing, rapidly evolving with promising results due to the increasing availability of suitable datasets and the advent of new technologies, such as Generative Models. This article introduces SESAME, a Self-supervised framework for Extractive queStion Answering over docuMent collEctions. SESAME aims to enhance open-domain question answering systems (ODQA) by leveraging domain adaptation with synthetic datasets, enabling efficient question answering over private document collections with low resource usage. The framework incorporates recent advances with large language models, and an efficient hybrid method for context retrieval. We conducted several sets of experiments with the Machine Reading for Question Answering (MRQA) 2019 Shared Task datasets, FAQuAD - a Brazilian Portuguese reading comprehension dataset, Wikipedia, and Retrieval-Augmented Generation Benchmark, to demonstrate SESAME's effectiveness. The results indicate that SESAME's domain adaptation using synthetic data significantly improves QA performance, generalizes across different domains and languages, and competes with or surpasses state-of-the-art systems in ODQA. Finally, SESAME is an open-source tool, and all code, datasets and experimental data are available for public use in our repository.
引用
收藏
页码:1725 / 1747
页数:23
相关论文
共 50 条
  • [31] Textbook Question Answering with Multi-modal Context Graph Understanding and Self-supervised Open-set Comprehension
    Kim, Daesik
    Kim, Seonhoon
    Kwak, Nojun
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3568 - 3584
  • [32] Self-supervised Pre-training and Semi-supervised Learning for Extractive Dialog Summarization
    Zhuang, Yingying
    Song, Jiecheng
    Sadagopan, Narayanan
    Beniwal, Anurag
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1069 - 1076
  • [33] Self-Supervised and Controlled Multi-Document Opinion Summarization
    Elsahar, Hady
    Coavoux, Maximin
    Galle, Matthias
    Rozen, Jos
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1646 - 1662
  • [34] Self-supervised Segment Contrastive Learning for Medical Document Representation
    Abro, Waheed Ahmed
    Kteich, Hanane
    Bouraoui, Zied
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
  • [35] Self-supervised scientific document recommendation based on contrastive learning
    Shicheng Tan
    Tao Zhang
    Shu Zhao
    Yanping Zhang
    [J]. Scientometrics, 2023, 128 : 5027 - 5049
  • [36] Self-supervised scientific document recommendation based on contrastive learning
    Tan, Shicheng
    Zhang, Tao
    Zhao, Shu
    Zhang, Yanping
    [J]. SCIENTOMETRICS, 2023, 128 (09) : 5027 - 5049
  • [37] An Improved Self-Supervised Framework for Feature Point Detection
    Wu, Yunhui
    Li, Jun
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [38] A Self-Supervised Decision Fusion Framework for Building Detection
    Senaras, Caglar
    Vural, Fatos T. Yarman
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (05) : 1780 - 1791
  • [39] Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering
    Liu, Gang
    He, Jinlong
    Li, Pengfei
    Zhao, Zixu
    Zhong, Shenjun
    [J]. Journal of Biomedical Informatics, 2024, 160
  • [40] A weakly-supervised extractive framework for sentiment-preserving document summarization
    Yun Ma
    Qing Li
    [J]. World Wide Web, 2019, 22 : 1401 - 1425