Cross-Lingual Transfer Learning for Medical Named Entity Recognition

被引:2
|
作者
Ding, Pengjie [1 ,2 ]
Wang, Lei [2 ]
Liang, Yaobo [3 ]
Lu, Wei [1 ]
Li, Linfeng [2 ,4 ]
Wang, Chun [6 ]
Tang, Buzhou [5 ]
Yan, Jun [2 ]
机构
[1] Renmin Univ China, Sch Informat & DEKE, Beijing, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
[4] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[5] Harbin Inst Technol, Shenzhen, Peoples R China
[6] China Med Univ, Dept Cardiac Surg, Hosp 1, Shenyang, Peoples R China
关键词
Transfer learning; Cross-lingual pretraining; Word embedding alignment; Medical terminology systems; Medical NER; INFORMATION EXTRACTION; SYSTEM;
D O I
10.1007/978-3-030-59410-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extensive technologies have been employed to explore a best way for cross-lingual transfer learning. In medical domain, Named Entity Recognition is pivotal for many downstream tasks, such as medical entity linking and clinical decision support systems. Nevertheless, the lack of annotation limits the applicability in many languages without enough labeled data. To alleviate this issue and make use of languages with sufficient annotated data, we find a new way to obtain medical parallel corpus from medical terminology systems and knowledge bases and propose a methodology which combines cross-lingual language model pretraining and bilingual word embedding alignment with the help of the parallel corpus. Moreover, our combined architecture which maintains the framework of pretrained model can not only be used for NER task but also other downstream NLP tasks. Experiments demonstrated that incorporating Chinese and English medical data can effectively improve the performance for an English medical NER dataset (i2b2).
引用
收藏
页码:403 / 418
页数:16
相关论文
共 50 条
  • [31] Cross-Lingual Cross-Domain Nested Named Entity Evaluation on EnglishWeb Texts
    Plank, Barbara
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1808 - 1815
  • [32] Discrepancy and Uncertainty Aware Denoising Knowledge Distillation for Zero-Shot Cross-Lingual Named Entity Recognition
    Ge, Ling
    Hu, Chunming
    Ma, Guanghui
    Liu, Jihong
    Zhang, Hong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18056 - 18064
  • [33] Zero-Shot Cross-Lingual Named Entity Recognition via Progressive Multi-Teacher Distillation
    Li, Zhuoran
    Hu, Chunming
    Zhang, Richong
    Chen, Junfan
    Guo, Xiaohui
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 4617 - 4630
  • [34] Medical Crossing: a Cross-lingual Evaluation of Clinical Entity Linking
    Alekseev, Anton
    Miftahutdinov, Zulfat
    Tutubalina, Elena
    Shelmanov, Artem
    Ivanov, Vladimir
    Kokh, Vladimir
    Nesterov, Alexander
    Avetisian, Manvel
    Chertok, Andrey
    Nikolenko, Sergey
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4212 - 4220
  • [35] Choosing Transfer Languages for Cross-Lingual Learning
    Lin, Yu-Hsiang
    Chen, Chian-Yu
    Lee, Jean
    Li, Zirui
    Zhang, Yuyan
    Xia, Mengzhou
    Rijhwani, Shruti
    He, Junxian
    Zhang, Zhisong
    Ma, Xuezhe
    Anastasopoulos, Antonios
    Littell, Patrick
    Neubig, Graham
    Anastasopoulos, Antonios
    Littell, Patrick
    Neubig, Graham
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3125 - 3135
  • [36] Translation Artifacts in Cross-lingual Transfer Learning
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7674 - 7684
  • [37] Neural Cross-Lingual Entity Linking
    Sil, Avirup
    Kundu, Gourab
    Florian, Radu
    Hamza, Wael
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5464 - 5472
  • [38] Transfer Learning for Indonesian Named Entity Recognition
    Kosasih, Joshua Aditya
    Khodra, Masayu Leylia
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 173 - 178
  • [39] Unsupervised Cross-lingual Representation Learning for Speech Recognition
    Conneau, Alexis
    Baevski, Alexei
    Collobert, Ronan
    Mohamed, Abdelrahman
    Auli, Michael
    INTERSPEECH 2021, 2021, : 2426 - 2430
  • [40] Zero-Shot Neural Transfer for Cross-Lingual Entity Linking
    Rijhwani, Shruti
    Xie, Jiateng
    Neubig, Graham
    Carbonell, Jaime
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6924 - 6931