Domain adaptation for semantic role labeling of clinical text

被引:14
|
作者
Zhang, Yaoyun [1 ]
Tang, Buzhou [1 ,2 ]
Jiang, Min [1 ]
Wang, Jingqi [1 ]
Xu, Hua [1 ]
机构
[1] Univ Texas Houston, Sch Biomed Informat Houston, Houston, TX 77030 USA
[2] Shenzhen Grad Sch, Harbin Inst Technol, Dept Comp Sci, Shenzhen, Guangdong, Peoples R China
关键词
semantic role labeling; shallow semantic parsing; clinical natural language processing; domain adaptation; transfer learning; BIOMEDICAL LITERATURE; ANNOTATED CORPUS; INFORMATION; EXTRACTION; KNOWLEDGE; SYSTEM;
D O I
10.1093/jamia/ocu048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Semantic role labeling (SRL), which extracts a shallow semantic relation representation from different surface textual forms of free text sentences, is important for understanding natural language. Few studies in SRL have been conducted in the medical domain, primarily due to lack of annotated clinical SRL corpora, which are time-consuming and costly to build. The goal of this study is to investigate domain adaptation techniques for clinical SRL leveraging resources built from newswire and biomedical literature to improve performance and save annotation costs. Materials and Methods Multisource Integrated Platform for Answering Clinical Questions (MiPACQ), a manually annotated SRL clinical corpus, was used as the target domain dataset. PropBank and NomBank from newswire and BioProp from biomedical literature were used as source domain datasets. Three state-of-the-art domain adaptation algorithms were employed: instance pruning, transfer self-training, and feature augmentation. The SRL performance using different domain adaptation algorithms was evaluated by using 10-fold cross-validation on the MiPACQ corpus. Learning curves for the different methods were generated to assess the effect of sample size. Results and Conclusion When all three source domain corpora were used, the feature augmentation algorithm achieved statistically significant higher F-measure (83.18%), compared to the baseline with MiPACQ dataset alone (F-measure, 81.53%), indicating that domain adaptation algorithms may improve SRL performance on clinical text. To achieve a comparable performance to the baseline method that used 90% of MiPACQ training samples, the feature augmentation algorithm required < 50% of training samples in MiPACQ, demonstrating that annotation costs of clinical SRL can be reduced significantly by leveraging existing SRL resources from other domains.
引用
收藏
页码:967 / 979
页数:13
相关论文
共 50 条
  • [21] Semantic Concentration for Domain Adaptation
    Li, Shuang
    Xie, Mixue
    Lv, Fangrui
    Liu, Chi Harold
    Liang, Jian
    Qin, Chen
    Li, Wei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9082 - 9091
  • [22] Semantic adaptation network for unsupervised domain adaptation
    Zhou, Qiang
    Zhou, Wen'an
    Wang, Shirui
    NEUROCOMPUTING, 2021, 454 : 313 - 323
  • [23] Entropy-weighted reconstruction adversary and curriculum pseudo labeling for domain adaptation in semantic segmentation
    Bi, Xiwen
    Zhang, Xiaohong
    Wang, Shidong
    Zhang, Haofeng
    NEUROCOMPUTING, 2022, 506 : 277 - 289
  • [24] SRL-ESA-TextSum: A text summarization approach based on semantic role labeling and explicit semantic analysis
    Mohamed, Muhidin
    Oussalah, Mourad
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) : 1356 - 1372
  • [25] Semantic Role Labeling for Amharic Text Using Multiple Embeddings and Deep Neural Network
    Hailu, Bemnet Meresa
    Assabie, Yaregal
    Sinshaw, Yenewondim Biadgie
    IEEE ACCESS, 2023, 11 : 33274 - 33295
  • [26] Syntax for Semantic Role Labeling, To Be, Or Not To Be
    He, Shexia
    Li, Zuchao
    Zhao, Hai
    Bai, Hongxiao
    Liu, Gongshen
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2061 - 2071
  • [27] An Improved Semantic Role Labeling
    Kyu, Zin Mar
    Wah, Naw Lay
    INTERNATIONAL JOURNAL OF NETWORKED AND DISTRIBUTED COMPUTING, 2019, 7 (02) : 51 - 58
  • [28] Conversational Semantic Role Labeling
    Xu, Kun
    Wu, Han
    Song, Linfeng
    Zhang, Haisong
    Song, Linqi
    Yu, Dong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2465 - 2475
  • [29] Polyglot Semantic Role Labeling
    Mulcaire, Phoebe
    Swayamdipta, Swabha
    Smith, Noah A.
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 667 - 672
  • [30] Semantic Labeling: A Domain-Independent Approach
    Minh Pham
    Alse, Suresh
    Knoblock, Craig A.
    Szekely, Pedro
    SEMANTIC WEB - ISWC 2016, PT I, 2016, 9981 : 446 - 462