Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records

被引:57
|
作者
Carson, Nicholas J. [1 ,2 ]
Mullin, Brian [1 ]
Sanchez, Maria Jose [1 ,3 ]
Lu, Frederick [1 ]
Yang, Kelly [1 ,4 ]
Menezes, Michelle [1 ,5 ]
Le Cook, Benjamin [1 ,2 ]
机构
[1] Cambridge Hlth Alliance, Hlth Equ Res Lab, Cambridge, MA 02139 USA
[2] Harvard Med Sch, Dept Psychiat, Boston, MA 02115 USA
[3] George Washington Univ, Prevent & Community Hlth Dept, Milken Sch Publ Hlth, Washington, DC USA
[4] Albert Einstein Coll Med, Dept Psychiat, Bronx, NY USA
[5] Univ Virginia, Charlottesville, VA USA
来源
PLOS ONE | 2019年 / 14卷 / 02期
基金
美国国家卫生研究院;
关键词
RISK-FACTORS; IDEATION; RESILIENCE; PREDICTION; DIAGNOSES; VALIDITY; SUPPORT; TIME; CARE; AGE;
D O I
10.1371/journal.pone.0211116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective The rapid proliferation of machine learning research using electronic health records to classify healthcare outcomes offers an opportunity to address the pressing public health problem of adolescent suicidal behavior. We describe the development and evaluation of a machine learning algorithm using natural language processing of electronic health records to identify suicidal behavior among psychiatrically hospitalized adolescents. Methods Adolescents hospitalized on a psychiatric inpatient unit in a community health system in the northeastern United States were surveyed for history of suicide attempt in the past 12 months. A total of 73 respondents had electronic health records available prior to the index psychiatric admission. Unstructured clinical notes were downloaded from the year preceding the index inpatient admission. Natural language processing identified phrases from the notes associated with the suicide attempt outcome. We enriched this group of phrases with a clinically focused list of terms representing known risk and protective factors for suicide attempt in adolescents. We then applied the random forest machine learning algorithm to develop a classification model. The model performance was evaluated using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy. Results The final model had a sensitivity of 0.83, specificity of 0.22, AUC of 0.68, a PPV of 0.42, NPV of 0.67, and an accuracy of 0.47. The terms mostly highly associated with suicide attempt clustered around terms related to suicide, family members, psychiatric disorders, and psychotropic medications. Conclusion This analysis demonstrates modest success of a natural language processing and machine learning approach to identifying suicide attempt among a small sample of hospitalized adolescents in a psychiatric setting.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Natural Language Processing and Machine Learning for Identifying Incident Stroke From Electronic Health Records: Algorithm Development and Validation
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul A.
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicholas B.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [22] IDENTIFICATION OF PANCREATIC DUCTAL ADENOCARCINOMA RISK FACTORS FROM ELECTRONIC HEALTH RECORDS USING NATURAL LANGUAGE PROCESSING
    Sarwal, Dhruv
    Wang, Liwei
    Gandhi, Sonal
    Sagheb, Elham
    Janssens, Laurens
    Goncalves, Sandy
    Delgado, Adriana
    Doering, Karen
    Liu Hongfang
    Majumder, Shounak
    GASTROENTEROLOGY, 2022, 162 (07) : S243 - S243
  • [23] Identifying Information Gaps in Electronic Health Records by Using Natural Language Processing: Gynecologic Surgery History Identification
    Moon, Sungrim
    Carlson, Luke A.
    Moser, Ethan D.
    Kshatriya, Bhavani Singh Agnikula
    Smith, Carin Y.
    Rocca, Walter A.
    Rocca, Liliana Gazzuola
    Bielinski, Suzette J.
    Liu, Hongfang
    Larson, Nicholas B.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (01)
  • [24] Identifying Goals of Care Conversations in the Electronic Health Record Using Natural Language Processing and Machine Learning
    Lee, Robert Y.
    Brumback, Lyndia C.
    Lober, William B.
    Sibley, James
    Nielsen, Elizabeth L.
    Treece, Patsy D.
    Kross, Erin K.
    Loggers, Elizabeth T.
    Fausto, James A.
    Lindvall, Charlotta
    Engelberg, Ruth A.
    Curtis, J. Randall
    JOURNAL OF PAIN AND SYMPTOM MANAGEMENT, 2021, 61 (01) : 136 - +
  • [25] Using Natural Language Processing to Identify Different Lens Pathology in Electronic Health Records
    Stein, Joshua d.
    Zhou, Yunshu
    Andrews, Chris a.
    Kim, Judy e.
    Addis, Victoria
    Bixler, Jill
    Grove, Nathan
    Mcmillan, Brian
    Munir, Saleha z.
    Pershing, Suzann
    Schultz, Jeffrey s.
    Stagg, Brian c.
    Wang, Sophia y.
    Woreta, Fasika
    AMERICAN JOURNAL OF OPHTHALMOLOGY, 2024, 262 : 153 - 160
  • [26] Ascertainment of Delirium Status Using Natural Language Processing From Electronic Health Records
    Fu, Sunyang
    Lopes, Guilherme S.
    Pagali, Sandeep R.
    Thorsteinsdottir, Bjoerg
    LeBrasseur, Nathan K.
    Wen, Andrew
    Liu, Hongfang
    Rocca, Walter A.
    Olson, Janet E.
    St Sauver, Jennifer
    Sohn, Sunghwan
    JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2022, 77 (03): : 524 - 530
  • [27] Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis
    Hutto, Alissa
    Zikry, Tarek M.
    Bohac, Buck
    Rose, Terra
    Staebler, Jasmine
    Slay, Janet
    Cheever, C. Ray
    Kosorok, Michael R.
    Nash, Rebekah P.
    HEALTH INFORMATICS JOURNAL, 2024, 30 (04)
  • [28] RETRACTED ARTICLE: Analysis of Electronic Health Records Based on Deep Learning with Natural Language Processing
    Yi-Cheng Shen
    Te-Chun Hsia
    Ching-Hsien Hsu
    Arabian Journal for Science and Engineering, 2023, 48 : 2597 - 2597
  • [29] Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records
    Solomon, Matthew D.
    Tabada, Grace
    Allen, Amanda
    Sung, Sue Hee
    Go, Alan S.
    CARDIOVASCULAR DIGITAL HEALTH JOURNAL, 2021, 2 (03): : 156 - 163
  • [30] DEVELOPMENT OF A NATURAL LANGUAGE PROCESSING MACHINE TO IDENTIFY OPIOID USE DISORDER IN ELECTRONIC HEALTH RECORDS.
    Vorontsova, Y.
    Broyles, A.
    Cummins, J.
    Hood, D.
    Stratford, R.
    Quinney, S.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2021, 109 : S60 - S60