ChiMed: A Chinese Medical Corpus for Question Answering

被引:0
|
作者
Tian, Yuanhe [1 ]
Ma, Weicheng [2 ]
Xia, Fei [1 ]
Song, Yan [3 ]
机构
[1] Univ Washington, Dept Linguist, Seattle, WA 98195 USA
[2] NYU, Comp Sci Dept, New York, NY 10003 USA
[3] Tencent AI Lab, Bellevue, WA USA
来源
SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019) | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) is a challenging task in natural language processing (NLP), especially when it is applied to specific domains. While models trained in the general domain can be adapted to a new target domain, their performance often degrades significantly due to domain mismatch. Alternatively, one can require a large amount of domain-specific QA data, but such data are rare, especially for the medical domain. In this study, we first collect a large-scale Chinese medical QA corpus called ChiMed; second we annotate a small fraction of the corpus to check the quality of the answers; third, we extract two datasets from the corpus and use them for the relevancy prediction task and the adoption prediction task. Several benchmark models are applied to the datasets, producing good results for both tasks.
引用
收藏
页码:250 / 260
页数:11
相关论文
共 50 条
  • [21] LiteratureQA: A Question Answering Corpus with Graph Knowledge on Academic Literature
    Wang, Haiwen
    Zhou, Le
    Zhang, Weinan
    Wang, Xinbing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4623 - 4632
  • [22] Chinese Event Extraction Using Question Answering
    Liu, Zeyi
    Yu, Wenhua
    Hong, Zhiyong
    Ke, Guanzhou
    Tan, Rongjie
    Computer Engineering and Applications, 2024, 59 (02) : 153 - 160
  • [23] Question Recommendation in Medical Community-Based Question Answering
    Cai, Hong
    Yan, Cuiting
    Yin, Airu
    Zhao, Xuesong
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 228 - 236
  • [24] Creating the DISEQuA corpus:: A test set for multilingual question answering
    Magnini, B
    Romagnoli, S
    Vallin, A
    Herrera, J
    Peñas, A
    Peinado, V
    Verdejo, F
    de Rijke, M
    COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 487 - 500
  • [25] The design and implementation of chinese question and answering system
    Meng, IH
    Yang, WP
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 1, PROCEEDINGS, 2003, 2667 : 601 - 613
  • [26] CQuAE: A new Contextualized QUestion Answering corpus on Education domain
    Gerald, Thomas
    Tamames, Louis
    Ettayeb, Sofiane
    Le, Ha-Quang
    Paroubek, Patrick
    Vilnat, Anne
    DATA & KNOWLEDGE ENGINEERING, 2024, 151
  • [27] Information extraction supported Chinese question answering
    Yu, Jiangde
    Li, Xueyu
    Wang, Lei
    Journal of Computational Information Systems, 2008, 4 (06): : 2599 - 2606
  • [28] A Corpus for Visual Question Answering Annotated with Frame Semantic Information
    Alizadeh, Mehrdad
    Di Eugenio, Barbara
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5524 - 5531
  • [29] Improving Question Analysis for Arabic Question Answering in the Medical Domain
    Dardour, Sondes
    Fehri, Hela
    Haddar, Kais
    COMPUTACION Y SISTEMAS, 2022, 26 (03): : 1233 - 1241
  • [30] A Chinese Question Answering System for Specific Domain
    Li, Tanche
    Hao, Yu
    Zhu, Xiaoyan
    Zhang, Xian
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 590 - 601