Learning structured medical information from social media

被引:4
|
作者
Hasan, Abul [1 ]
Levene, Mark [1 ]
Weston, David [1 ]
机构
[1] Birkbeck Univ London, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
关键词
Social media mining; Medical concept extraction; Pharmacovigilance; Conditional random fields; Semi-supervised algorithm; ADVERSE DRUG-REACTIONS;
D O I
10.1016/j.jbi.2020.103568
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Our goal is to summarise and aggregate information from social media regarding the symptoms of a disease, the drugs used and the treatment effects both positive and negative. To achieve this we first apply a supervised machine learning method to automatically extract medical concepts from natural language text. In an environment such as social media, where new data is continuously streamed, we need a methodology that will allow us to continuously train with the new data. To attain such incremental re-training, a semi-supervised methodology is developed, which is capable of learning new concepts from a small set of labelled data together with the much larger set of unlabelled data. The semi-supervised methodology deploys a conditional random field (CRF) as the base-line training algorithm for extracting medical concepts. The methodology iteratively augments to the training set sentences having high confidence, and adds terms to existing dictionaries to be used as features with the base-line model for further classification. Our empirical results show that the base-line CRF performs strongly across a range of different dictionary and training sizes; when the base-line is built with the full training data the F, score reaches the range 84%-90%. Moreover, we show that the semi-supervised method produces a mild but significant improvement over the base-line. We also discuss the significance of the potential improvement of the semi-supervised methodology and found that it is significantly more accurate in most cases than the underlying base-line model.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Leveraging Machine Learning and Semi-Structured Information to Identify Political Views from Social Media Posts
    Olteanu, Adriana
    Cernian, Alexandra
    Gaga, Sebastian-Augustin
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [2] The space for social media in structured online learning
    Salmon, Gilly
    Ross, Bella
    Pechenkina, Ekaterina
    Chase, Anne-Marie
    [J]. RESEARCH IN LEARNING TECHNOLOGY, 2015, 23 (01) : 1 - 14
  • [3] A Prescriptive Approach For Structured Information Extraction From Web Forums And Social Media
    Cumberland, Ethan
    Day, Tony
    [J]. 2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 95 - 101
  • [4] Rumor Detection on Social Media with Graph Structured Adversarial Learning
    Yang, Xiaoyu
    Lyu, Yuefei
    Tian, Tian
    Liu, Yifei
    Liu, Yudong
    Zhang, Xi
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1417 - 1423
  • [5] Social Media for Medical and Health Information: Malaysian Medical Tourism Hospital
    Timan, Hazila
    Kama, Nazri
    Yusoff, Rasimah Che Mohd
    Selamat, Ali
    [J]. NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 143 - 156
  • [6] Leveraging social media for medical education: Learning from patients in online spaces
    Giroux, Catherine M.
    Moreau, Katherine A.
    [J]. MEDICAL TEACHER, 2020, 42 (09) : 970 - 972
  • [7] Detecting Traffic Information From Social Media Texts With Deep Learning Approaches
    Chen, Yuanyuan
    Lv, Yisheng
    Wang, Xiao
    Li, Lingxi
    Wang, Fei-Yue
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (08) : 3049 - 3058
  • [8] Structured Information Extraction from Medical Texts in Bulgarian
    Boytcheva, Svetla
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2012, 12 (04) : 52 - 65
  • [9] A Method Extracting Task-related Information from Social Media based on Structured Domain Knowledge
    Link, Daniel
    Horita, Flavio E. A.
    de Albuquerque, Joao Porto
    Hellingrath, Bernd
    Ghasemivandhonaryar, Shabdiz
    [J]. AMCIS 2015 PROCEEDINGS, 2015,
  • [10] Medical information and social media in the time of COVID-19
    Mulrennan, Siobhain
    Colt, Henri
    [J]. RESPIROLOGY, 2020, 25 (06) : 578 - 579