Training Question Answering Models From Synthetic Data

被引:0
|
作者
Puri, Raul [1 ]
Spring, Ryan [2 ]
Shoeybi, Mohammad [1 ]
Patwary, Mostofa [1 ]
Catanzaro, Bryan [1 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Rice Univ, Houston, TX USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question and answer generation is a data augmentation method that aims to improve question answering (QA) models given the limited amount of human labeled data. However, a considerable gap remains between synthetic and human-generated question-answer pairs. This work aims to narrow this gap by taking advantage of large language models and explores several factors such as model size, quality of pretrained models, scale of data synthesized, and algorithmic choices. On the SQUAD1.1 question answering task, we achieve higher accuracy using solely synthetic questions and answers than when using the SQUAD1.1 training set questions alone. Removing access to real Wikipedia data, we synthesize questions and answers from a synthetic text corpus generated by an 8.3 billion parameter GPT-2 model and achieve 88.4 Exact Match (EM) and 93.9 F1 score on the SQUAD1.1 dev set. We further apply our methodology to SQUAD2.0 and show a 2.8 absolute gain on EM score compared to prior work using synthetic data.
引用
收藏
页码:5811 / 5826
页数:16
相关论文
共 50 条
  • [41] User authority ranking models for community question answering
    Raoa, Yanghui
    Xie, Haoran
    Liu, Xuebo
    Li, Qing
    Wang, Fu Lee
    Wong, Tak-Lam
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (05) : 2533 - 2542
  • [42] Calibrated Large Language Models for Binary Question Answering
    Giovannotti, Patrizio
    Gammerman, Alex
    13TH SYMPOSIUM ON CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, 2024, 230 : 218 - 235
  • [43] On a Combination of Probabilistic and Boolean IR Models for Question Answering
    Yoshioka, Masaharu
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 588 - 598
  • [44] Enhancing Biomedical Question Answering with Large Language Models
    Yang, Hua
    Li, Shilong
    Goncalves, Teresa
    INFORMATION, 2024, 15 (08)
  • [45] Joint Models for Answer Verification in Question Answering Systems
    Zhang, Zeyu
    Vu, Thuy
    Moschitti, Alessandro
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3252 - 3262
  • [46] Generative Models in Medical Visual Question Answering: A Survey
    Dong, Wenjie
    Shen, Shuhao
    Han, Yuqiang
    Tan, Tao
    Wu, Jian
    Xu, Hongxia
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [47] QUESTION ANSWERING FROM NATURAL-LANGUAGE MEDICAL DATA-BASES
    GRISHMAN, R
    HIRSCHMAN, L
    ARTIFICIAL INTELLIGENCE, 1978, 11 (1-2) : 25 - 43
  • [48] Questionnaires for eliciting evaluation data from users of interactive question answering systems
    Kelly, D.
    Kantor, P. B.
    Morse, E. L.
    Scholtz, J.
    Sun, Y.
    NATURAL LANGUAGE ENGINEERING, 2009, 15 : 119 - 141
  • [49] What is the ultimate question answering system? Lessons learned from existing question answering systems
    Loerch, UW
    Guesgen, HW
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, Vols 1and 2, 2004, : 323 - 329
  • [50] Contrastive training of a multimodal encoder for medical visual question answering
    Silva, Joao Daniel
    Martins, Bruno
    Magalhaes, Joao
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18