Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model

被引:61
|
作者
Sampathkumar, Hariprasad [1 ]
Chen, Xue-wen [2 ]
Luo, Bo [1 ]
机构
[1] Univ Kansas, EECS, Lawrence, KS 66045 USA
[2] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
基金
美国国家科学基金会;
关键词
Adverse drug reaction; Pharmacovigilance; Text mining; Machine learning; Online healthcare forums; Hidden Markov model; DISCOVERY; RECORDS; EVENTS; SAFETY;
D O I
10.1186/1472-6947-14-91
中图分类号
R-058 [];
学科分类号
摘要
Background: Adverse Drug Reactions are one of the leading causes of injury or death among patients undergoing medical treatments. Not all Adverse Drug Reactions are identified before a drug is made available in the market. Current post-marketing drug surveillance methods, which are based purely on voluntary spontaneous reports, are unable to provide the early indications necessary to prevent the occurrence of such injuries or fatalities. The objective of this research is to extract reports of adverse drug side-effects from messages in online healthcare forums and use them as early indicators to assist in post-marketing drug surveillance. Methods: We treat the task of extracting adverse side-effects of drugs from healthcare forum messages as a sequence labeling problem and present a Hidden Markov Model(HMM) based Text Mining system that can be used to classify a message as containing drug side-effect information and then extract the adverse side-effect mentions from it. A manually annotated dataset from www. medications. com is used in the training and validation of the HMM based Text Mining system. Results: A 10-fold cross-validation on the manually annotated dataset yielded on average an F-Score of 0.76 from the HMM Classifier, in comparison to 0.575 from the Baseline classifier. Without the Plain Text Filter component as a part of the Text Processing module, the F-Score of the HMM Classifier was reduced to 0.378 on average, while absence of the HTML Filter component was found to have no impact. Reducing the Drug names dictionary size by half, on average reduced the F-Score of the HMM Classifier to 0.359, while a similar reduction to the side-effects dictionary yielded an F-Score of 0.651 on average. Adverse side-effects mined from www. medications. com and www. steadyhealth. com were found to match the Adverse Drug Reactions on the Drug Package Labels of several drugs. In addition, some novel adverse side-effects, which can be potential Adverse Drug Reactions, were also identified. Conclusions: The results from the HMM based Text Miner are encouraging to pursue further enhancements to this approach. The mined novel side-effects can act as early indicators for health authorities to help focus their efforts in post-marketing drug surveillance.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Sharing experience with adverse drug reactions on internet: Which adverse drug reactions involving statins are described by patients on internet-based-forums?
    Kheloufi, F.
    Jean-Pastor, M. J.
    [J]. FUNDAMENTAL & CLINICAL PHARMACOLOGY, 2014, 28 : 56 - 56
  • [32] Ontology-based visualization of healthcare data mined from Online Healthcare Forums
    Sampathkumar, Hariprasad
    Chen, Xue-wen
    Luo, Bo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2015), 2015, : 325 - 334
  • [33] Applying data mining to detection of adverse drug reactions
    Koide, D
    Ohe, K
    [J]. MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1421 - 1421
  • [34] Causal Association Mining for Detection of Adverse Drug Reactions
    Abin, Deepa
    Mahajan, Tanushree C.
    Bhoj, Manali S.
    Bagde, Swapnil
    Rajeswari, K.
    [J]. 1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 382 - +
  • [35] Online Map Matching Algorithm Using Segment Angle Based on Hidden Markov Model
    Xu, Jie
    Ta, Na
    Xing, Chunxiao
    Zhang, Yong
    [J]. 2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 50 - 55
  • [36] Android resource usage risk assessment using hidden Markov model and online learning
    Rashidi, Bahman
    Fung, Carol
    Bertino, Elisa
    [J]. COMPUTERS & SECURITY, 2017, 65 : 90 - 107
  • [37] Online Degradation Assessment and Adaptive Fault Detection Using Modified Hidden Markov Model
    Lee, Seungchul
    Li, Lin
    Ni, Jun
    [J]. JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2010, 132 (02): : 0210101 - 02101011
  • [38] Online scenario labeling using a hidden Markov model for assessment of nuclear plant state
    Zamalieva, Daniya
    Yilmaz, Alper
    Aldemir, Tunc
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2013, 110 : 1 - 13
  • [39] An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
    Gao, Jun
    Liu, Ninghao
    Lawley, Mark
    Hu, Xia
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2017, 2017
  • [40] A Hidden Markov Model approach to online handwritten signature verification
    Kashi R.
    Hu J.
    Nelson W.L.
    Turin W.
    [J]. International Journal on Document Analysis and Recognition, 1998, 1 (2) : 102 - 109