Utilizing Different Word Representation Methods for Twitter Data in Adverse Drug Reactions Extraction

被引:0
|
作者
Lin, Wei-San [1 ]
Dai, Hong-Jie [2 ]
Jonnagaddala, Jitendra [3 ]
Chang, Nai-Wun [4 ]
Jue, Toni Rose [5 ]
Iqbal, Usman [6 ]
Shao, Joni Yu-Hsuan [6 ]
Chiang, I-Jen [6 ]
Li, Yu-Chuan [6 ]
机构
[1] Taipei Med Univ, Grad Inst Biomed Informat, Taipei, Taiwan
[2] Natl Taitung Univ, Dept Comp Sci & Informat Engn, Taitung, Taiwan
[3] Univ New South Wales, Sch Publ Hlth & Community Med, Sydney, NSW, Australia
[4] Natl Taiwan Univ, Grad Inst Biomed Elect & Bioinformat, Taipei, Taiwan
[5] Univ New South Wales, Prince Wales Clin Sch, Sydney, NSW, Australia
[6] Taipei Med Univ, Coll Med Sci & Technol, Grad Inst Biomed Informat, Taipei, Taiwan
关键词
adverse drug reactions; named entity recognition; word embedding; social media; natural language processing; SOCIAL MEDIA; RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advancement of technology and development of social media, patients discuss medications and other related information including adverse drug reactions (ADRs) with their friends, family or other patients. Although, there are various pros and cons of using social media for automatic ADR monitoring, information on social media provided by patients about drugs are widely considered a valuable resource for post-marketing drug surveillance. In this study, we developed a named entity recognition (NER) system based on conditional random fields to identify ADRs-related information from Twitter data. The representation of words for the input text is one of the crucial steps in supervised learning. Recently, the word vector representation is becoming popular, which uses unlabeled data to provide a generalization for reducing the data sparsity in word representation. This study examines different word representation methods for the ADR recognition task, including token normalization, and two state-of-the-art word embedding methods, namely word2vec and the global vectors (GloVe). The experimental results demonstrate that all of the studied representation scheme can improve the recall rate and overall F-measure with the cost of the reduced precision. The manual analysis of the generated clusters demonstrates that word2vec has stronger cluster trends compared to GloVe.
引用
收藏
页码:260 / 265
页数:6
相关论文
共 50 条
  • [1] Classifying adverse drug reactions from imbalanced twitter data
    Dai, Hong-Jie
    Wang, Chen-Kai
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 (122-132) : 122 - 132
  • [2] Adverse Drug Extraction in Twitter Data using Convolutional Neural Network
    Akhtyamova, Liliya
    Cardiff, John
    Alexandrov, Mikhail
    2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 88 - 92
  • [3] Detecting Adverse Drug Reactions on Twitter with Convolutional Neural Networks and Word Embedding Features
    Masino A.J.
    Forsyth D.
    Fiks A.G.
    Journal of Healthcare Informatics Research, 2018, 2 (1-2) : 25 - 43
  • [4] Twitter Opinion Mining for Adverse Drug Reactions
    Wu, Liang
    Moh, Teng-Sheng
    Khuri, Natalia
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1570 - 1574
  • [5] Comparison of different methods for causality assessment of adverse drug reactions
    Sapan Kumar Behera
    Saibal Das
    Alphienes Stanley Xavier
    Srinivas Velupula
    Selvarajan Sandhiya
    International Journal of Clinical Pharmacy, 2018, 40 : 903 - 910
  • [6] Comparison of different methods for causality assessment of adverse drug reactions
    Behera, Sapan Kumar
    Das, Saibal
    Xavier, Alphienes Stanley
    Velupula, Srinivas
    Sandhiya, Selvarajan
    INTERNATIONAL JOURNAL OF CLINICAL PHARMACY, 2018, 40 (04) : 903 - 910
  • [7] Twitter can Help to Find Adverse Drug Reactions
    Cieliebak, Mark
    Egger, Dominic
    Uzdilli, Fatih
    ERCIM NEWS, 2016, (104): : 31 - 32
  • [8] Identifying Adverse Drug Reactions by Analyzing Twitter Messages
    Rajapaksha, Parinda
    Weerasinghe, Ruvan
    2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer), 2015, : 37 - 42
  • [9] Detecting Adverse Drug Reactions from User-Generated Twitter Data: A Case Study
    Shah, Mihir
    Patel, Maitry
    Patel, Priyank
    Tan, Xing
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 552 - 558
  • [10] GAR: Graph adversarial representation for adverse drug event detection on Twitter
    Shen, Chen
    Li, Zhiheng
    Chu, Yonghe
    Zhao, Zhongying
    APPLIED SOFT COMPUTING, 2021, 106