Risk prediction using natural language processing of electronic mental health records in an inpatient forensic psychiatry setting

被引:35
|
作者
Duy Van Le [1 ]
Montgomery, James [1 ]
Kirkby, Kenneth C. [2 ]
Scanlan, Joel [1 ]
机构
[1] Univ Tasmania, Coll Sci & Engn, Sch Technol Environm & Design, Private Bag 87, Hobart, Tas 7001, Australia
[2] Univ Tasmania, Coll Hlth & Med, Sch Med, Private Bag 87, Hobart, Tas 7001, Australia
关键词
Text mining; Natural language processing; Electronic health record; Mental health; Psychiatry; MEDICAL-RECORDS; ONTOLOGY; DISEASE; INFORMATION; EXTRACTION; VIOLENCE; VERSION; UMLS; TEXT;
D O I
10.1016/j.jbi.2018.08.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Instruments rating risk of harm to self and others are widely used in inpatient forensic psychiatry settings. A potential alternate or supplementary means of risk prediction is from the automated analysis of case notes in Electronic Health Records (EHRs) using Natural Language Processing (NLP). This exploratory study rated presence or absence and frequency of words in a forensic EHR dataset, comparing four reference dictionaries. Seven machine learning algorithms and different time periods of EHR analysis were used to probe. which dictionary and which time period were most predictive of risk assessment scores on validated instruments. Materials and methods: The EHR dataset comprised de-identified forensic inpatient notes from the Wilfred Lopes Centre in Tasmania. The data comprised unstructured free-text case note entries and serial ratings of three risk assessment scales: Historical Clinical Risk Management-20 (HCR-20), Short-Term Assessment of Risk and Treatability (START) and. Dynamic Appraisal of Situational Aggression (DASA). Four NLP dictionary word lists were selected: 6865 mental health symptom words from the Unified Medical Language System (UMLS), 455 DSM-IV diagnoses from UMLS repository, 6790 English positive and negative sentiment words, and 1837 high frequency words from the Corpus of Contemporary American English (COCA). Seven machine learning methods Bagging, J48, Jrip, Logistic Model Trees (LMT), Logistic Regression, Linear Regression and Support Vector Machine (SVM) were used to identify the combination of dictionaries and algorithms that best predicted risk assessment scores. Results: The most accurate prediction was attained on the DASA dataset using the sentiment dictionary and the LMT and SVM algorithms. Conclusions: NLP, used in conjunction with NLP dictionaries and machine learning, predicted risk ratings on the HCR-20, START, and DASA, based on EHR content. Further research is required to ascertain the utility of NLP approaches in predicting endpoints of actual self-harm, harm to others or victimisation.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [41] Identifying Mentions of Pain in Mental Health Records Text: A Natural Language Processing Approach
    Chaturvedi, Jaya
    Velupillai, Sumithra
    Stewart, Robert
    Roberts, Angus
    [J]. MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 695 - 699
  • [42] Distributions of recorded pain in mental health records: a natural language processing based study
    Chaturvedi, Jaya
    Stewart, Robert
    Ashworth, Mark
    Roberts, Angus
    [J]. BMJ OPEN, 2024, 14 (04):
  • [43] Clinical Diagnostic Features and Dynamic Risk Factors in a New Zealand Inpatient Forensic Mental Health Setting
    Easden, Michael Haig
    Sakdalan, Joseph Allan
    [J]. PSYCHIATRY PSYCHOLOGY AND LAW, 2015, 22 (04) : 483 - 499
  • [44] Development of a natural language processing algorithm to detect chronic cough in electronic health records
    Bali, Vishal
    Weaver, Jessica
    Turzhitsky, Vladimir
    Schelfhout, Jonathan
    Paudel, Misti L.
    Hulbert, Erin
    Peterson-Brandt, Jesse
    Currie, Anne-Marie Guerra
    Bakka, Dylan
    [J]. BMC PULMONARY MEDICINE, 2022, 22 (01)
  • [45] Cohort design and natural language processing to reduce bias in electronic health records research
    Khurshid, Shaan
    Reeder, Christopher
    Harrington, Lia X.
    Singh, Pulkit
    Sarma, Gopal
    Friedman, Samuel F.
    Di Achille, Paolo
    Diamant, Nathaniel
    Cunningham, Jonathan W.
    Turner, Ashby C.
    Lau, Emily S.
    Haimovich, Julian S.
    Al-Alusi, Mostafa A.
    Wang, Xin
    Klarqvist, Marcus D. R.
    Ashburner, Jeffrey M.
    Diedrich, Christian
    Ghadessi, Mercedeh
    Mielke, Johanna
    Eilken, Hanna M.
    McElhinney, Alice
    Derix, Andrea
    Atlas, Steven J.
    Ellinor, Patrick T.
    Philippakis, Anthony A.
    Anderson, Christopher D.
    Ho, Jennifer E.
    Batra, Puneet
    Lubitz, Steven A.
    [J]. NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [46] NATURAL LANGUAGE PROCESSING METHODS ENHANCE MACE IDENTIFICATION FROM ELECTRONIC HEALTH RECORDS
    St Laurent, S.
    Guo, M.
    Alfonso, R.
    Okoro, T.
    Johansen, K.
    Dember, L.
    Lindsay, A.
    [J]. VALUE IN HEALTH, 2018, 21 : S217 - S217
  • [47] Natural language processing for electronic health records in anaesthesiology: an introduction to clinicians with recommendations and pitfalls
    Martin Bernstorff
    Simon Tilma Vistisen
    Kenneth C. Enevoldsen
    [J]. Journal of Clinical Monitoring and Computing, 2024, 38 : 241 - 245
  • [48] Natural Language Processing of Clinical Notes in Electronic Health Records to Improve Capture of Hypoglycemia
    Nunes, Anthony P.
    Yu, Shengsheng
    Kurtyka, Karen
    Senerchia, Cynthia
    Hill, Jefffrey
    Brodovicz, Kimberly G.
    Radican, Larry
    Engel, Samuel S.
    Calvo, Sean R.
    Dore, David D.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 : 494 - 494
  • [49] Development of a natural language processing algorithm to detect chronic cough in electronic health records
    Vishal Bali
    Jessica Weaver
    Vladimir Turzhitsky
    Jonathan Schelfhout
    Misti L. Paudel
    Erin Hulbert
    Jesse Peterson-Brandt
    Anne-Marie Guerra Currie
    Dylan Bakka
    [J]. BMC Pulmonary Medicine, 22
  • [50] Natural language processing for electronic health records in anaesthesiology: an introduction to clinicians with recommendations and pitfalls
    Bernstorff, Martin
    Vistisen, Simon Tilma
    Enevoldsen, Kenneth C.
    [J]. JOURNAL OF CLINICAL MONITORING AND COMPUTING, 2024, 38 (02) : 241 - 245