Cross-Lingual Transfer Learning for Medical Named Entity Recognition

被引:2
|
作者
Ding, Pengjie [1 ,2 ]
Wang, Lei [2 ]
Liang, Yaobo [3 ]
Lu, Wei [1 ]
Li, Linfeng [2 ,4 ]
Wang, Chun [6 ]
Tang, Buzhou [5 ]
Yan, Jun [2 ]
机构
[1] Renmin Univ China, Sch Informat & DEKE, Beijing, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
[4] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[5] Harbin Inst Technol, Shenzhen, Peoples R China
[6] China Med Univ, Dept Cardiac Surg, Hosp 1, Shenyang, Peoples R China
关键词
Transfer learning; Cross-lingual pretraining; Word embedding alignment; Medical terminology systems; Medical NER; INFORMATION EXTRACTION; SYSTEM;
D O I
10.1007/978-3-030-59410-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extensive technologies have been employed to explore a best way for cross-lingual transfer learning. In medical domain, Named Entity Recognition is pivotal for many downstream tasks, such as medical entity linking and clinical decision support systems. Nevertheless, the lack of annotation limits the applicability in many languages without enough labeled data. To alleviate this issue and make use of languages with sufficient annotated data, we find a new way to obtain medical parallel corpus from medical terminology systems and knowledge bases and propose a methodology which combines cross-lingual language model pretraining and bilingual word embedding alignment with the help of the parallel corpus. Moreover, our combined architecture which maintains the framework of pretrained model can not only be used for NER task but also other downstream NLP tasks. Experiments demonstrated that incorporating Chinese and English medical data can effectively improve the performance for an English medical NER dataset (i2b2).
引用
下载
收藏
页码:403 / 418
页数:16
相关论文
共 50 条
  • [1] Cross-lingual Transfer Learning for Japanese Named Entity Recognition
    Johnson, Andrew
    Karanasou, Penny
    Gaspers, Judith
    Klakow, Dietrich
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES(NAACL HLT 2019), VOL. 2 (INDUSTRY PAPERS), 2019, : 182 - 189
  • [2] Cross-lingual Named Entity Recognition
    Steinberger, Ralf
    Pouliquen, Bruno
    LINGUISTICAE INVESTIGATIONES, 2007, 30 (01): : 135 - 162
  • [3] Analyzing transfer learning impact in biomedical cross-lingual named entity recognition and normalization
    Renzo M. Rivera-Zavala
    Paloma Martínez
    BMC Bioinformatics, 22
  • [4] Data augmentation and transfer learning for cross-lingual Named Entity Recognition in the biomedical domain
    Lancheros, Brayan Stiven
    Pastor, Gloria Corpas
    Mitkov, Ruslan
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [5] Analyzing transfer learning impact in biomedical cross-lingual named entity recognition and normalization
    Rivera-Zavala, Renzo M.
    Martinez, Paloma
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 1)
  • [6] WASSERSTEIN CROSS-LINGUAL ALIGNMENT FOR NAMED ENTITY RECOGNITION
    Wang, Rui
    Henao, Ricardo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8342 - 8346
  • [7] Cross-Lingual Named Entity Recognition for Heterogenous Languages
    Fu, Yingwen
    Lin, Nankai
    Chen, Boyu
    Yang, Ziyu
    Jiang, Shengyi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 371 - 382
  • [8] Neural Cross-Lingual Named Entity Recognition with Minimal Resources
    Xie, Jiateng
    Yang, Zhilin
    Neubig, Graham
    Smith, Noah A.
    Carbonell, Jaime
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 369 - 379
  • [9] Zero-Resource Cross-Lingual Named Entity Recognition
    Bari, M. Saiful
    Joty, Shafiq
    Jwalapuram, Prathyusha
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7415 - 7423
  • [10] Improving Low Resource Named Entity Recognition using Cross-lingual Knowledge Transfer
    Feng, Xiaocheng
    Feng, Xiachong
    Qin, Bing
    Feng, Zhangyin
    Liu, Ting
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4071 - 4077