ChiMed: A Chinese Medical Corpus for Question Answering

被引:0
|
作者
Tian, Yuanhe [1 ]
Ma, Weicheng [2 ]
Xia, Fei [1 ]
Song, Yan [3 ]
机构
[1] Univ Washington, Dept Linguist, Seattle, WA 98195 USA
[2] NYU, Comp Sci Dept, New York, NY 10003 USA
[3] Tencent AI Lab, Bellevue, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) is a challenging task in natural language processing (NLP), especially when it is applied to specific domains. While models trained in the general domain can be adapted to a new target domain, their performance often degrades significantly due to domain mismatch. Alternatively, one can require a large amount of domain-specific QA data, but such data are rare, especially for the medical domain. In this study, we first collect a large-scale Chinese medical QA corpus called ChiMed; second we annotate a small fraction of the corpus to check the quality of the answers; third, we extract two datasets from the corpus and use them for the relevancy prediction task and the adoption prediction task. Several benchmark models are applied to the datasets, producing good results for both tasks.
引用
收藏
页码:250 / 260
页数:11
相关论文
共 50 条
  • [41] An Integrated Approach for Question Classification in Chinese Cuisine Question Answering System
    Xia, Ling
    Teng, Zhi
    Ren, Fuji
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON UNIVERSAL COMMUNICATION, 2008, : 317 - +
  • [42] Question similarity calculating method towards medical question answering system
    Wan, Fucheng
    Zhang, Dongjiao
    Zhang, Lei
    Zhu, Ao
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 278 - 278
  • [43] A Question-Centric Model for Visual Question Answering in Medical Imaging
    Vu, Minh H.
    Lofstedt, Tommy
    Nyholm, Tufve
    Sznitman, Raphael
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (09) : 2856 - 2868
  • [44] MMQL: Multi-Question Learning for Medical Visual Question Answering
    Chen, Qishen
    Bian, Minjie
    Xu, Huahu
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT V, 2024, 15005 : 480 - 489
  • [45] Corpus Development for Indonesian Consumer-Health Question Answering System
    Hakim, Abid Nurul
    Mahendra, Rahmad
    Adriani, Mirna
    Ekakristi, Adrianus Saga
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 222 - 227
  • [46] BioASQ-QA: A manually curated corpus for Biomedical Question Answering
    Anastasia Krithara
    Anastasios Nentidis
    Konstantinos Bougiatiotis
    Georgios Paliouras
    Scientific Data, 10
  • [47] BioASQ-QA: A manually curated corpus for Biomedical Question Answering
    Krithara, Anastasia
    Nentidis, Anastasios
    Bougiatiotis, Konstantinos
    Paliouras, Georgios
    SCIENTIFIC DATA, 2023, 10 (01)
  • [48] Answer Extraction Algorithm of Chinese Question Answering System
    Tang, Zhao-xia
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ENGINEERING (ACSE 2014), 2014, : 130 - 133
  • [49] A SEMANTIC PATTERN FOR RESTRICTED DOMAIN CHINESE QUESTION ANSWERING
    Wang, Zhen-Yu
    Luo, Xiao-Sheng
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1333 - +
  • [50] ExQuestions: An Expanded Factual Corpus for Question Answering over Knowledge Graphs
    Franco, Wellington
    Franco, Artur O. R.
    Avila, Caio Viktor
    Cabral, Lucas
    Maia, Gilvan
    Pinheiro, Vladia
    Vidal, Vania
    Machado, Javam
    16TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2022), 2022, : 235 - 242