Identification of suicidal behavior among psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records

被引:57
|
作者
Carson, Nicholas J. [1 ,2 ]
Mullin, Brian [1 ]
Sanchez, Maria Jose [1 ,3 ]
Lu, Frederick [1 ]
Yang, Kelly [1 ,4 ]
Menezes, Michelle [1 ,5 ]
Le Cook, Benjamin [1 ,2 ]
机构
[1] Cambridge Hlth Alliance, Hlth Equ Res Lab, Cambridge, MA 02139 USA
[2] Harvard Med Sch, Dept Psychiat, Boston, MA 02115 USA
[3] George Washington Univ, Prevent & Community Hlth Dept, Milken Sch Publ Hlth, Washington, DC USA
[4] Albert Einstein Coll Med, Dept Psychiat, Bronx, NY USA
[5] Univ Virginia, Charlottesville, VA USA
来源
PLOS ONE | 2019年 / 14卷 / 02期
基金
美国国家卫生研究院;
关键词
RISK-FACTORS; IDEATION; RESILIENCE; PREDICTION; DIAGNOSES; VALIDITY; SUPPORT; TIME; CARE; AGE;
D O I
10.1371/journal.pone.0211116
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective The rapid proliferation of machine learning research using electronic health records to classify healthcare outcomes offers an opportunity to address the pressing public health problem of adolescent suicidal behavior. We describe the development and evaluation of a machine learning algorithm using natural language processing of electronic health records to identify suicidal behavior among psychiatrically hospitalized adolescents. Methods Adolescents hospitalized on a psychiatric inpatient unit in a community health system in the northeastern United States were surveyed for history of suicide attempt in the past 12 months. A total of 73 respondents had electronic health records available prior to the index psychiatric admission. Unstructured clinical notes were downloaded from the year preceding the index inpatient admission. Natural language processing identified phrases from the notes associated with the suicide attempt outcome. We enriched this group of phrases with a clinically focused list of terms representing known risk and protective factors for suicide attempt in adolescents. We then applied the random forest machine learning algorithm to develop a classification model. The model performance was evaluated using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy. Results The final model had a sensitivity of 0.83, specificity of 0.22, AUC of 0.68, a PPV of 0.42, NPV of 0.67, and an accuracy of 0.47. The terms mostly highly associated with suicide attempt clustered around terms related to suicide, family members, psychiatric disorders, and psychotropic medications. Conclusion This analysis demonstrates modest success of a natural language processing and machine learning approach to identifying suicide attempt among a small sample of hospitalized adolescents in a psychiatric setting.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Identification of recurrent atrial fibrillation using natural language processing applied to electronic health records
    Zheng, Chengyi
    Lee, Ming-sum
    Bansal, Nisha
    Go, Alan S.
    Chen, Cheng
    Harrison, Teresa N.
    Fan, Dongjie
    Allen, Amanda
    Garcia, Elisha
    Lidgard, Ben
    Singer, Daniel
    An, Jaejin
    EUROPEAN HEART JOURNAL-QUALITY OF CARE AND CLINICAL OUTCOMES, 2024, 10 (01) : 77 - 88
  • [12] Using Natural Language Processing to Predict Risk in Electronic Health Records
    Duy Van Le
    Montgomery, James
    Kirkby, Kenneth
    Scanlan, Joel
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 574 - 578
  • [13] Natural language processing and machine learning of electronic health records for prediction of first-time suicide attempts
    Tsui, Fuchiang R.
    Shi, Lingyun
    Ruiz, Victor
    Ryan, Neal D.
    Biernesser, Candice
    Iyengar, Satish
    Walsh, Colin G.
    Brent, David A.
    JAMIA OPEN, 2021, 4 (01)
  • [14] AUTOMATED, ACCURATE IDENTIFICATION OF VENTRICULAR TACHYCARDIA FROM ELECTRONIC HEALTH RECORDS USING NATURAL LANGUAGE PROCESSING
    Brennan, Kelly
    Azizi, Zahra
    Feng, Ruibin
    Goyal, Jatin
    Liu, Xichong
    Ganesan, Prasanth
    Ruiperez-Campillo, Samuel
    Baykaner, Tina
    Badhwar, Nitish
    John, Roy M.
    Viswanathan, Mohan
    Perino, Alexander
    Wang, Paul J.
    Rogers, Albert J.
    Narayan, Sanjiv M.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 2644 - 2644
  • [15] Physical and sexual abuse and their relation to psychiatric disorder and suicidal behavior among adolescents who are psychiatrically hospitalized
    Cohen, Y
    Spirito, A
    Sterling, C
    Seifer, R
    Plummer, B
    Donaldson, D
    Avila, R
    Ferrer, K
    JOURNAL OF CHILD PSYCHOLOGY AND PSYCHIATRY AND ALLIED DISCIPLINES, 1996, 37 (08): : 989 - 993
  • [16] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [17] Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing
    Wang, Sophia Y.
    Tseng, Benjamin
    Hernandez-Boussard, Tina
    OPHTHALMOLOGY SCIENCE, 2022, 2 (02):
  • [18] NATURAL LANGUAGE PROCESSING METHODS ENHANCE MACE IDENTIFICATION FROM ELECTRONIC HEALTH RECORDS
    St Laurent, S.
    Guo, M.
    Alfonso, R.
    Okoro, T.
    Johansen, K.
    Dember, L.
    Lindsay, A.
    VALUE IN HEALTH, 2018, 21 : S217 - S217
  • [19] A cross-cultural comparison of behavior disturbance and suicidal behavior among psychiatrically hospitalized adolescents in Israel and the United States
    Cohen, Y
    Spirito, A
    Apter, A
    Saini, S
    CHILD PSYCHIATRY & HUMAN DEVELOPMENT, 1997, 28 (02) : 89 - 102
  • [20] A Cross-Cultural Comparison of Behavior Disturbance and Suicidal Behavior Among Psychiatrically Hospitalized Adolescents in Israel and the United States
    Y. Cohen
    A. Spirito
    A. Apter
    S. Saini
    Child Psychiatry and Human Development, 1997, 28 : 89 - 102