Vocabulary Modifications for Domain-adaptive Pretraining of Clinical Language Models

被引:0
|
作者
Lamproudis, Anastasios [1 ]
Henriksson, Aron [1 ]
Dalianis, Hercules [1 ]
机构
[1] Stockholm Univ, Dept Comp & Syst Sci, Stockholm, Sweden
关键词
Natural Language Processing; Language Models; Domain-adaptive Pretraining; Clinical Text; Swedish;
D O I
10.5220/0010893800003123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Research has shown that using generic language models - specifically, BERT models - in specialized domains may be sub-optimal due to domain differences in language use and vocabulary. There are several techniques for developing domain-specific language models that leverage the use of existing generic language models, including continued and domain-adaptive pretraining with in-domain data. Here, we investigate a strategy based on using a domain-specific vocabulary, while leveraging a generic language model for initialization. The results demonstrate that domain-adaptive pretraining, in combination with a domain-specific vocabulary - as opposed to a general-domain vocabulary - yields improvements on two downstream clinical NLP tasks for Swedish. The results highlight the value of domain-adaptive pretraining when developing specialized language models and indicate that it is beneficial to adapt the vocabulary of the language model to the target domain prior to continued, domain-adaptive pretraining of a generic language model.
引用
收藏
页码:180 / 188
页数:9
相关论文
共 50 条
  • [1] Domain-Adaptive Pretraining Methods for Dialogue Understanding
    Wu, Han
    Xu, Kun
    Song, Linfeng
    Jin, Lifeng
    Zhang, Haisong
    Song, Linqi
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 665 - 669
  • [2] Predicting Email and Article Clickthroughs with Domain-adaptive Language Models
    Jaidka, Kokil
    Goyal, Tanya
    Chhaya, Niyati
    [J]. WEBSCI'18: PROCEEDINGS OF THE 10TH ACM CONFERENCE ON WEB SCIENCE, 2018, : 177 - 184
  • [3] Improving prediction performance of general protein language model by domain-adaptive pretraining on DNA-binding protein
    Zeng, Wenwu
    Dou, Yutao
    Pan, Liangrui
    Xu, Liwen
    Peng, Shaoliang
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [4] General and Domain-adaptive Chinese Spelling Check with Error-consistent Pretraining
    Lv, Qi
    Cao, Ziqiang
    Geng, Lei
    Ai, Chunhui
    Yan, Xu
    Fu, Guohong
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (05)
  • [5] Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models
    Wang, Boxin
    Ping, Wei
    Xiao, Chaowei
    Xu, Peng
    Patwary, Mostofa
    Shoeybi, Mohammad
    Li, Bo
    Anandkumar, Anima
    Catanzaro, Bryan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Generalized Domain-Adaptive Dictionaries
    Shekhar, Sumit
    Patel, Vishal M.
    Nguyen, Hien V.
    Chellappa, Rama
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 361 - 368
  • [7] Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language
    Farzana, Shahla
    Parde, Natalie
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11965 - 11978
  • [8] A Domain-adaptive Pre-training Approach for Language Bias Detection in News
    Krieger, Jan-David
    Spinde, Timo
    Ruas, Terry
    Kulshrestha, Juhi
    Gipp, Bela
    [J]. 2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [9] APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning
    Sanyal, Soumya
    Xu, Yichong
    Wang, Shuohang
    Yang, Ziyi
    Pryzant, Reid
    Yu, Wenhao
    Zhu, Chenguang
    Ren, Xiang
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6308 - 6321
  • [10] Robust domain-adaptive discriminant analysis
    Kouw, Wouter M.
    Loog, Marco
    [J]. PATTERN RECOGNITION LETTERS, 2021, 148 : 107 - 113