Deep Neural Models for Medical Concept Normalization in User-Generated Texts

被引:0
|
作者
Miftahutdinov, Zulfat [1 ]
Tutubalina, Elena [1 ,2 ]
机构
[1] Kazan Fed Univ, Kazan, Russia
[2] PDMI RAS, Samsung PDMI Joint AI Ctr, St Petersburg, Russia
基金
俄罗斯科学基金会;
关键词
BIOMEDICAL TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we consider the medical concept normalization problem, i.e., the problem of mapping a health-related entity mention in a free-form text to a concept in a controlled vocabulary, usually to the standard thesaurus in the Unified Medical Language System (UMLS). This is a challenging task since medical terminology is very different when coming from health care professionals or from the general public in the form of social media texts. We approach it as a sequence learning problem with powerful neural networks such as recurrent neural networks and contextualized word representation models trained to obtain semantic representations of social media expressions. Our experimental evaluation over three different benchmarks shows that neural architectures leverage the semantic meaning of the entity mention and significantly outperform an existing state of the art models.
引用
收藏
页码:393 / 399
页数:7
相关论文
共 50 条
  • [1] Medical Concept Normalization for Online User-Generated Texts
    Lee, Kathy
    Hasan, Sadid A.
    Farri, Oladimeji
    Choudhary, Alok
    Agrawal, Ankit
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 462 - 469
  • [2] Lexical Normalization of User-Generated Medical Forum Data
    Dirkson, Anne
    Verberne, Suzan
    Kraaij, Wessel
    [J]. SOCIAL MEDIA MINING FOR HEALTH APPLICATIONS (#SMM4H) WORKSHOP & SHARED TASK, 2019, : 11 - 20
  • [3] User-generated Content Curation with Deep Convolutional Neural Networks
    Tous, Ruben
    Wust, Otto
    Gomez, Mauro
    Poveda, Jonatan
    Elena, Marc
    Torres, Jordi
    Makni, Mouna
    Ayguade, Eduard
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2535 - 2540
  • [4] Multimodular Text Normalization of Dutch User-Generated Content
    Schulz, Sarah
    De Pauw, Guy
    De Clercq, Orphee
    Desmet, Bart
    Hoste, Veronique
    Daelemans, Walter
    Macken, Lieve
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 7 (04)
  • [5] Selection of correction candidates for the normalization of Spanish user-generated content
    Melero, M.
    Costa-Jussa, M. R.
    Lambert, P.
    Quixal, M.
    [J]. NATURAL LANGUAGE ENGINEERING, 2016, 22 (01) : 135 - 161
  • [6] Deep Convolutional Neural Network Based Medical Concept Normalization
    Song, Guojie
    Long, Qingqing
    Luo, Yi
    Wang, Yiming
    Jin, Yilun
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (05) : 1195 - 1208
  • [7] Synthesizing Parallel Data of User-Generated Texts with Zero-Shot Neural Machine Translation
    Marie, Benjamin
    Fujita, Atsushi
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 710 - 725
  • [8] Entity disambiguation with context awareness in user-generated short texts
    Yang, Jiaqi
    Li, Yongjun
    Gao, Congjie
    Dong, Wei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [9] Additive Regularization for Topic Modeling in Sociological Studies of User-Generated Texts
    Apishev, Murat
    Koltcov, Sergei
    Koltsova, Olessia
    Nikolenko, Sergey
    Vorontsov, Konstantin
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2016, PT I, 2017, 10061 : 169 - 184
  • [10] User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization
    Higashiyama, Shohei
    Utiyama, Masao
    Watanabe, Taro
    Sumita, Eiichiro
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5532 - 5541