Character-Level Neural Language Modelling in the Clinical Domain

被引:0
|
作者
Kreuzthaler, Markus [1 ,2 ]
Oleynik, Michel [1 ]
Schulz, Stefan [1 ]
机构
[1] Med Univ Graz, Inst Med Informat Stat & Documentat, Graz, Austria
[2] CBmed GmbH, Ctr Biomarker Res Med, Graz, Austria
来源
关键词
Neural Networks; Electronic Health Records; Natural Language Processing;
D O I
10.3233/SHTI200127
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Word embeddings have become the predominant representation scheme on a token-level for various clinical natural language processing (NLP) tasks. More recently, character-level neural language models, exploiting recurrent neural networks, have again received attention, because they achieved similar performance against various NLP benchmarks. We investigated to what extent character-based language models can be applied to the clinical domain and whether they are able to capture reasonable lexical semantics using this maximally fine-grained representation scheme. We trained a long short-term memory network on an excerpt from a table of de-identified 50-character long problem list entries in German, each of which assigned to an ICD-10 code. We modelled the task as a time series of one-hot encoded single character inputs. After the training phase we accessed the top 10 most similar character-induced word embeddings related to a clinical concept via a nearest neighbour search and evaluated the expected interconnected semantics. Results showed that traceable semantics were captured on a syntactic level above single characters, addressing the idiosyncratic nature of clinical language. The results support recent work on general language modelling that raised the question whether token-based representation schemes are still necessary for specific NLP tasks.
引用
收藏
页码:83 / 87
页数:5
相关论文
共 50 条
  • [1] CHARACTER-LEVEL LANGUAGE MODELING WITH HIERARCHICAL RECURRENT NEURAL NETWORKS
    Hwang, Kyuyeon
    Sung, Wonyong
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5720 - 5724
  • [2] Character-Level Quantum Mechanical Approach for a Neural Language Model
    Wang, Zhihao
    Ren, Min
    Tian, Xiaoyan
    Liang, Xia
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 1613 - 1621
  • [3] Character-Level Quantum Mechanical Approach for a Neural Language Model
    Zhihao Wang
    Min Ren
    Xiaoyan Tian
    Xia Liang
    [J]. International Journal of Computational Intelligence Systems, 2019, 12 : 1613 - 1621
  • [4] Application of Character-Level Language Models in the Domain of Polish Statutory Law
    Smywinski-Pohl, Aleksander
    Wrobel, Krzysztof
    Lasocki, Karol
    Jungiewicz, Michal
    [J]. LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2019), 2019, 322 : 217 - 222
  • [5] Correcting writing errors in Turkish with a character-level neural language model
    Benligiray, Burak
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [6] Character-level Language Modeling with Gated Hierarchical Recurrent Neural Networks
    Choi, Iksoo
    Park, Jinhwan
    Sung, Wonyong
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 411 - 415
  • [7] Character Eyes: Seeing Language through Character-Level Taggers
    Pinter, Yuval
    Marone, Marc
    Eisenstein, Jacob
    [J]. BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 95 - 102
  • [8] Neural Character-Level Syntactic Parsing for Chinese
    Li, Zuchao
    Zhou, Junru
    Zhao, Hai
    Zhang, Zhisong
    Li, Haonan
    Ju, Yuqi
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 461 - 509
  • [9] Neural Character-Level Syntactic Parsing for Chinese
    Li, Zuchao
    Zhou, Junru
    Zhao, Hai
    Zhang, Zhisong
    Li, Haonan
    Ju, Yuqi
    [J]. Journal of Artificial Intelligence Research, 2022, 73 : 461 - 509
  • [10] Neural Character-Level Dependency Parsing for Chinese
    Li, Haonan
    Zhang, Zhisong
    Ju, Yuqi
    Zhao, Hai
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5205 - 5212