Development of a machine learning model to predict mild cognitive impairment using natural language processing in the absence of screening

被引:7
|
作者
Penfold, Robert B. [1 ]
Carrell, David S. [1 ]
Cronkite, David J. [1 ]
Pabiniak, Chester [1 ]
Dodd, Tammy [1 ]
Glass, Ashley Mh [1 ]
Johnson, Eric [1 ]
Thompson, Ella [1 ]
Arrighi, H. Michael [2 ]
Stang, Paul E. [2 ]
机构
[1] Kaiser Permanente Washington Hlth Res Inst, 1730 Minor Ave,Suite 1600, Seattle, WA 98101 USA
[2] Janssen Res & Dev LLC, Raritan, NJ USA
关键词
NLP; MCI; Dementia; Early identification; MINI-MENTAL-STATE; DEMENTIA RISK; ALZHEIMERS-DISEASE; PRIMARY-CARE; OLDER-ADULTS; DIAGNOSIS; COHORT; SCORE; FREQUENCY; ACCURACY;
D O I
10.1186/s12911-022-01864-z
中图分类号
R-058 [];
学科分类号
摘要
Background Patients and their loved ones often report symptoms or complaints of cognitive decline that clinicians note in free clinical text, but no structured screening or diagnostic data are recorded. These symptoms/complaints may be signals that predict who will go on to be diagnosed with mild cognitive impairment (MCI) and ultimately develop Alzheimer's Disease or related dementias. Our objective was to develop a natural language processing system and prediction model for identification of MCI from clinical text in the absence of screening or other structured diagnostic information. Methods There were two populations of patients: 1794 participants in the Adult Changes in Thought (ACT) study and 2391 patients in the general population of Kaiser Permanente Washington. All individuals had standardized cognitive assessment scores. We excluded patients with a diagnosis of Alzheimer's Disease, Dementia or use of donepezil. We manually annotated 10,391 clinic notes to train the NLP model. Standard Python code was used to extract phrases from notes and map each phrase to a cognitive functioning concept. Concepts derived from the NLP system were used to predict future MCI. The prediction model was trained on the ACT cohort and 60% of the general population cohort with 40% withheld for validation. We used a least absolute shrinkage and selection operator logistic regression approach (LASSO) to fit a prediction model with MCI as the prediction target. Using the predicted case status from the LASSO model and known MCI from standardized scores, we constructed receiver operating curves to measure model performance. Results Chart abstraction identified 42 MCI concepts. Prediction model performance in the validation data set was modest with an area under the curve of 0.67. Setting the cutoff for correct classification at 0.60, the classifier yielded sensitivity of 1.7%, specificity of 99.7%, PPV of 70% and NPV of 70.5% in the validation cohort. Discussion and conclusion Although the sensitivity of the machine learning model was poor, negative predictive value was high, an important characteristic of models used for population-based screening. While an AUC of 0.67 is generally considered moderate performance, it is also comparable to several tests that are widely used in clinical practice.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Development of a machine learning model to predict mild cognitive impairment using natural language processing in the absence of screening
    Robert B. Penfold
    David S. Carrell
    David J. Cronkite
    Chester Pabiniak
    Tammy Dodd
    Ashley MH Glass
    Eric Johnson
    Ella Thompson
    H. Michael Arrighi
    Paul E. Stang
    [J]. BMC Medical Informatics and Decision Making, 22
  • [2] Mild cognitive impairment, dementia, and cognitive dysfunction screening using machine learning
    Yim, Daehyuk
    Yeo, Tae Young
    Park, Moon Ho
    [J]. JOURNAL OF INTERNATIONAL MEDICAL RESEARCH, 2020, 48 (07)
  • [3] Machine Learning Model to Predict Diagnosis of Mild Cognitive Impairment by Using Radiomic and Amyloid Brain PET
    Ciarmiello, Andrea
    Giovannini, Elisabetta
    Pastorino, Sara
    Ferrando, Ornella
    Foppiano, Franca
    Mannironi, Antonio
    Tartaglione, Antonio
    Giovacchini, Giampiero
    [J]. CLINICAL NUCLEAR MEDICINE, 2023, 48 (01) : 1 - 7
  • [4] Machine Learning Model To Predict Diagnosis Of Mild Cognitive Impairment By Using Radiomic And Amyloid Brain PET
    Giovacchini, G.
    Giovannini, E.
    Pastorino, S.
    Ferrando, O.
    Mannironi, A.
    Tartaglione, A.
    Ciarmiello, A.
    [J]. EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2022, 49 (SUPPL 1) : S103 - S103
  • [5] A Comparison of Connected Speech Tasks for Detecting Early Alzheimer's Disease and Mild Cognitive Impairment Using Natural Language Processing and Machine Learning
    Clarke, Natasha
    Barrick, Thomas R.
    Garrard, Peter
    [J]. FRONTIERS IN COMPUTER SCIENCE, 2021, 3
  • [6] Development of a Machine Learning Model to Discriminate Mild Cognitive Impairment Subjects from Normal Controls in Community Screening
    Jiang, Juanjuan
    Zhang, Jieming
    Li, Chenyang
    Yu, Zhihua
    Yan, Zhuangzhi
    Jiang, Jiehui
    [J]. BRAIN SCIENCES, 2022, 12 (09)
  • [7] Machine learning can predict mild cognitive impairment in Parkinson's disease
    Amboni, Marianna
    Ricciardi, Carlo
    Adamo, Sarah
    Nicolai, Emanuele
    Volzone, Antonio
    Erro, Roberto
    Cuoco, Sofia
    Cesarelli, Giuseppe
    Basso, Luca
    D'Addio, Giovanni
    Salvatore, Marco
    Pace, Leonardo
    Barone, Paolo
    [J]. FRONTIERS IN NEUROLOGY, 2022, 13
  • [8] Detection of Mild Cognitive Impairment Through Natural Language and Touchscreen Typing Processing
    Ntracha, Anastasia
    Iakovakis, Dimitrios
    Hadjidimitriou, Stelios
    Charisis, Vasileios S.
    Tsolaki, Magda
    Hadjileontiadis, Leontios J.
    [J]. FRONTIERS IN DIGITAL HEALTH, 2020, 2
  • [9] Machine-Learning Algorithms Based on Screening Tests for Mild Cognitive Impairment
    Park, Jin-Hyuck
    [J]. AMERICAN JOURNAL OF ALZHEIMERS DISEASE AND OTHER DEMENTIAS, 2020, 35
  • [10] A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
    Munoz-Almaraz, Francisco J.
    Climent, Maria Teresa
    Guerrero, Maria Dolores
    Moreno, Lucrecia
    Pardo, Juan
    [J]. JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2020, (155):