SESAME - self-supervised framework for extractive question answering over document collections

被引：0

作者：

Batista, Vitor A. ^{[1
,2
]}

Gomes, Diogo S. M. ^{[2
]}

Evsukoff, Alexandre ^{[1
]}

机构：

[1] Fed Univ Rio Janeiro, PEC Coppe, POB 68506, BR-21941972 Rio De Janeiro, RJ, Brazil

[2] PETROBRAS SA, Rua Gen Canabarro, 500, Rio De Janeiro, RJ, Brazil

来源：

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS | 2024年

关键词：

Question answering; NLP; Neural networks; Transformers; LLM;

D O I：

10.1007/s10844-024-00869-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Question Answering is one of the most relevant areas in the field of Natural Language Processing, rapidly evolving with promising results due to the increasing availability of suitable datasets and the advent of new technologies, such as Generative Models. This article introduces SESAME, a Self-supervised framework for Extractive queStion Answering over docuMent collEctions. SESAME aims to enhance open-domain question answering systems (ODQA) by leveraging domain adaptation with synthetic datasets, enabling efficient question answering over private document collections with low resource usage. The framework incorporates recent advances with large language models, and an efficient hybrid method for context retrieval. We conducted several sets of experiments with the Machine Reading for Question Answering (MRQA) 2019 Shared Task datasets, FAQuAD - a Brazilian Portuguese reading comprehension dataset, Wikipedia, and Retrieval-Augmented Generation Benchmark, to demonstrate SESAME's effectiveness. The results indicate that SESAME's domain adaptation using synthetic data significantly improves QA performance, generalizes across different domains and languages, and competes with or surpasses state-of-the-art systems in ODQA. Finally, SESAME is an open-source tool, and all code, datasets and experimental data are available for public use in our repository.

引用

页码：1725 / 1747

页数：23

共 50 条

[31] Textbook Question Answering with Multi-modal Context Graph Understanding and Self-supervised Open-set Comprehension
Kim, Daesik
Kim, Seonhoon
Kwak, Nojun
[J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3568 - 3584
[32] Self-supervised Pre-training and Semi-supervised Learning for Extractive Dialog Summarization
Zhuang, Yingying
Song, Jiecheng
Sadagopan, Narayanan
Beniwal, Anurag
[J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1069 - 1076
[33] Self-Supervised and Controlled Multi-Document Opinion Summarization
Elsahar, Hady
Coavoux, Maximin
Galle, Matthias
Rozen, Jos
[J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1646 - 1662
[34] Self-supervised Segment Contrastive Learning for Medical Document Representation
Abro, Waheed Ahmed
Kteich, Hanane
Bouraoui, Zied
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PT I, AIME 2024, 2024, 14844 : 312 - 321
[35] Self-supervised scientific document recommendation based on contrastive learning
Shicheng Tan
Tao Zhang
Shu Zhao
Yanping Zhang
[J]. Scientometrics, 2023, 128 : 5027 - 5049
[36] Self-supervised scientific document recommendation based on contrastive learning
Tan, Shicheng
Zhang, Tao
Zhao, Shu
Zhang, Yanping
[J]. SCIENTOMETRICS, 2023, 128 (09) : 5027 - 5049
[37] An Improved Self-Supervised Framework for Feature Point Detection
Wu, Yunhui
Li, Jun
[J]. ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
[38] A Self-Supervised Decision Fusion Framework for Building Detection
Senaras, Caglar
Vural, Fatos T. Yarman
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (05) : 1780 - 1791
[39] Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering
Liu, Gang
He, Jinlong
Li, Pengfei
Zhao, Zixu
Zhong, Shenjun
[J]. Journal of Biomedical Informatics, 2024, 160
[40] A weakly-supervised extractive framework for sentiment-preserving document summarization
Yun Ma
Qing Li
[J]. World Wide Web, 2019, 22 : 1401 - 1425

← 1 2 3 4 5 →