ChiMed: A Chinese Medical Corpus for Question Answering

被引:0
|
作者
Tian, Yuanhe [1 ]
Ma, Weicheng [2 ]
Xia, Fei [1 ]
Song, Yan [3 ]
机构
[1] Univ Washington, Dept Linguist, Seattle, WA 98195 USA
[2] NYU, Comp Sci Dept, New York, NY 10003 USA
[3] Tencent AI Lab, Bellevue, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) is a challenging task in natural language processing (NLP), especially when it is applied to specific domains. While models trained in the general domain can be adapted to a new target domain, their performance often degrades significantly due to domain mismatch. Alternatively, one can require a large amount of domain-specific QA data, but such data are rare, especially for the medical domain. In this study, we first collect a large-scale Chinese medical QA corpus called ChiMed; second we annotate a small fraction of the corpus to check the quality of the answers; third, we extract two datasets from the corpus and use them for the relevancy prediction task and the adoption prediction task. Several benchmark models are applied to the datasets, producing good results for both tasks.
引用
收藏
页码:250 / 260
页数:11
相关论文
共 50 条
  • [1] A Chinese Question Answering System in Medical Domain
    Feng G.
    Du Z.
    Wu X.
    Journal of Shanghai Jiaotong University (Science), 2018, 23 (5) : 678 - 683
  • [2] A Chinese Question Answering System in Medical Domain
    冯郭飞
    杜智康
    武星
    Journal of Shanghai Jiaotong University(Science), 2018, 23 (05) : 678 - 683
  • [3] emrQA: A Large Corpus for Question Answering on Electronic Medical Records
    Pampari, Anusri
    Raghavan, Preethi
    Liang, Jennifer
    Peng, Jian
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2357 - 2368
  • [4] Knowledge Corpus Error in Question Answering
    Lee, Yejoon
    Oh, Philhoon
    Thorne, James
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9183 - 9197
  • [5] A Chinese Medical Question Answering System Based on Knowledge Graph
    Zhou, Chengyang
    Guan, Renchu
    Zhao, Chuntao
    Chai, Gonglei
    Wang, Leigang
    Han, Xiaosong
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 28 - 33
  • [6] A Corpus for Hybrid Question Answering Systems
    Grau, Brigitte
    Ligozat, Anne-Laure
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1081 - 1086
  • [7] A system for Chinese question answering
    Huang, GT
    Yao, HH
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 458 - 461
  • [8] Corpus-based question classification in question answering systems
    Tomas, David
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 155 - 156
  • [9] Applying deep matching networks to Chinese medical question answering: a study and a dataset
    Junqing He
    Mingming Fu
    Manshu Tu
    BMC Medical Informatics and Decision Making, 19
  • [10] Applying deep matching networks to Chinese medical question answering: a study and a dataset
    He, Junqing
    Fu, Mingming
    Tu, Manshu
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)