DWE-Med: Dynamic Word Embeddings for Medical Domain

被引:3
|
作者
Jha, Kishlay [1 ]
Xun, Guangxu [1 ]
Gopalakrishnan, Vishrawas [2 ]
Zhang, Aidong [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Rice Hall 230,85 Engineers Way, Charlottesville, VA 22904 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, 338 Davis Hall, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
Biomedical domain; word embeddings; temporal dynamics;
D O I
10.1145/3310254
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in unsupervised language processing methods have created an opportunity to exploit massive text corpora for developing high-quality vector space representation (also known as word embeddings) of words. Towards this direction, practitioners have developed and applied several data driven embedding models with quite good rate of success. However, a drawback of these models lies in their premise of static context; wherein, the meaning of a word is assumed to remain the same over the period of time. This is limiting because it is known that the semantic meaning of a concept evolves over time. While such semantic drifts are routinely observed in almost all the domains; their effect is acute in domain such as biomedicine, where the semantic meaning of a concept changes relatively fast. To address this, in this study, we aim to learn temporally aware vector representation of medical concepts from the timestamped text data, and in doing so provide a systematic approach to formalize the problem. More specifically, a dynamic word embedding based model that jointly learns the temporal characteristics of medical concepts and performs across time-alignment is proposed. Apart from capturing the evolutionary characteristics in an optimal manner, the model also factors in the implicit medical properties useful for a variety of bio-medical applications. Empirical studies conducted on two important bio-medical use cases validates the effectiveness of the proposed approach and suggests that the model not only learns quality embeddings but also facilitates intuitive trajectory visualizations.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Interpretable Word Embeddings For Medical Domain
    Jha, Kishlay
    Wang, Yaqing
    Xun, Guangxu
    Zhang, Aidong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1061 - 1066
  • [2] HieNN-DWE: A hierarchical neural network with dynamic word embeddings for document level sentiment classification
    Liu, Fagui
    Zheng, Lailei
    Zheng, Jingzhong
    [J]. NEUROCOMPUTING, 2020, 403 : 21 - 32
  • [3] Dynamic Word Embeddings
    Bamler, Robert
    Mandt, Stephan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [4] Word Embeddings for the Software Engineering Domain
    Efstathiou, Vasiliki
    Chatzilenas, Christos
    Spinellis, Diomidis
    [J]. 2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR), 2018, : 38 - 41
  • [5] Domain Adaptation for Word Sense Disambiguation Using Word Embeddings
    Komiya, Kanako
    Suzuki, Shota
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Okumura, Manabu
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 195 - 206
  • [6] Domain Ontology Induction using Word Embeddings
    Gupta, Niharika
    Podder, Sanjay
    Annervaz, K. M.
    Sengupta, Shubhashis
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 115 - 119
  • [7] The impact of corpus domain on word representation: a study on Persian word embeddings
    Amir Hadifar
    Saeedeh Momtazi
    [J]. Language Resources and Evaluation, 2018, 52 : 997 - 1019
  • [8] The impact of corpus domain on word representation: a study on Persian word embeddings
    Hadifar, Amir
    Momtazi, Saeedeh
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (04) : 997 - 1019
  • [9] Improving Cross-Domain Chinese Word Segmentation with Word Embeddings
    Ye, Yuxiao
    Zhang, Yue
    Li, Weikang
    Qiu, Likun
    Sun, Jian
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2726 - 2735
  • [10] Dynamic Word Embeddings for Evolving Semantic Discovery
    Yao, Zijun
    Sun, Yifan
    Ding, Weicong
    Rao, Nikhil
    Xiong, Hui
    [J]. WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 673 - 681