Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI

被引:25
|
作者
Heo, Tak Sung [1 ]
Kim, Yu Seop [1 ]
Choi, Jeong Myeong [1 ]
Jeong, Yeong Seok [1 ]
Seo, Soo Young [1 ]
Lee, Jun Ho [2 ]
Jeon, Jin Pyeong [3 ]
Kim, Chulho [4 ]
机构
[1] Hallym Univ, Dept Convergence Software, Chunchon 24252, South Korea
[2] Chuncheon Sacred Heart Hosp, Dept Otorhinolaryngol & Head & Neck Surg, Chunchon 24253, South Korea
[3] Chuncheon Sacred Heart Hosp, Dept Neurosurg, Chunchon 24253, South Korea
[4] Chuncheon Sacred Heart Hosp, Dept Neurol, Chunchon 24253, South Korea
来源
JOURNAL OF PERSONALIZED MEDICINE | 2020年 / 10卷 / 04期
关键词
ischemic stroke; functional outcome; machine learning; deep learning; natural language processing; magnetic resonance imaging; ISCHEMIC-STROKE; HEMORRHAGE; RISK;
D O I
10.3390/jpm10040286
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Brain magnetic resonance imaging (MRI) is useful for predicting the outcome of patients with acute ischemic stroke (AIS). Although deep learning (DL) using brain MRI with certain image biomarkers has shown satisfactory results in predicting poor outcomes, no study has assessed the usefulness of natural language processing (NLP)-based machine learning (ML) algorithms using brain MRI free-text reports of AIS patients. Therefore, we aimed to assess whether NLP-based ML algorithms using brain MRI text reports could predict poor outcomes in AIS patients. This study included only English text reports of brain MRIs examined during admission of AIS patients. Poor outcome was defined as a modified Rankin Scale score of 3-6, and the data were captured by trained nurses and physicians. We only included MRI text report of the first MRI scan during the admission. The text dataset was randomly divided into a training and test dataset with a 7:3 ratio. Text was vectorized to word, sentence, and document levels. In the word level approach, which did not consider the sequence of words, and the "bag-of-words" model was used to reflect the number of repetitions of text token. The "sent2vec" method was used in the sensation-level approach considering the sequence of words, and the word embedding was used in the document level approach. In addition to conventional ML algorithms, DL algorithms such as the convolutional neural network (CNN), long short-term memory, and multilayer perceptron were used to predict poor outcomes using 5-fold cross-validation and grid search techniques. The performance of each ML classifier was compared with the area under the receiver operating characteristic (AUROC) curve. Among 1840 subjects with AIS, 645 patients (35.1%) had a poor outcome 3 months after the stroke onset. Random forest was the best classifier (0.782 of AUROC) using a word-level approach. Overall, the document-level approach exhibited better performance than did the word- or sentence-level approaches. Among all the ML classifiers, the multi-CNN algorithm demonstrated the best classification performance (0.805), followed by the CNN (0.799) algorithm. When predicting future clinical outcomes using NLP-based ML of radiology free-text reports of brain MRI, DL algorithms showed superior performance over the other ML algorithms. In particular, the prediction of poor outcomes in document-level NLP DL was improved more by multi-CNN and CNN than by recurrent neural network-based algorithms. NLP-based DL algorithms can be used as an important digital marker for unstructured electronic health record data DL prediction.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Natural language processing-based machine learning psychological emotion analysis method
    Zhao, Yang
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2024, 28 (3-4)
  • [2] A Natural Language Processing-based Model to Automate MRI Brain Protocol Selection and Prioritization
    Brown, Andrew D.
    Marotta, Thomas R.
    [J]. ACADEMIC RADIOLOGY, 2017, 24 (02) : 160 - 166
  • [3] Natural language processing and machine learning algorithm to identify brain MRI reports with acute ischemic stroke
    Kim, Chulho
    Zhu, Vivienne
    Obeid, Jihad
    Lenert, Leslie
    [J]. PLOS ONE, 2019, 14 (02):
  • [4] Machine learning based natural language processing of radiology reports in orthopaedic trauma
    Olthof, A. W.
    Shouche, P.
    Fennema, E. M.
    IJpma, F. F. A.
    Koolstra, R. H. C.
    Stirler, V. M. A.
    van Ooijen, P. M. A.
    Cornelissen, L. J.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [5] Application of Natural Language Processing and Machine Learning to Radiology Reports
    Jeon, Seoungdeok
    Colburn, Zachary
    Sakai, Joshua
    Hung, Ling-Hong
    Yeung, Ka Yee
    [J]. 12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [6] A MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING-BASED SMISHING DETECTION MODEL FOR MOBILE MONEY TRANSACTIONS
    Zimba, Aaron
    Phiri, Katongo O.
    Kashale, Chimanga
    Phiri, Mwiza Norina
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2024, 16 (03): : 69 - 80
  • [7] Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
    Lineback, Christina M.
    Garg, Ravi
    Oh, Elissa
    Naidech, Andrew M.
    Holl, Jane L.
    Prabhakaran, Shyam
    [J]. FRONTIERS IN NEUROLOGY, 2021, 12
  • [8] Prediction of stroke thrombolysis outcome using CT brain machine learning
    Bentley, Paul
    Ganesalingam, Jeban
    Jones, Anoma Lalani Carlton
    Mahady, Kate
    Epton, Sarah
    Rinne, Paul
    Sharma, Pankaj
    Halse, Omid
    Mehta, Amrish
    Rueckert, Daniel
    [J]. NEUROIMAGE-CLINICAL, 2014, 4 : 635 - 640
  • [9] Differentiation of lumbar disc herniation and lumbar spinal stenosis using natural language processing-based machine learning based on positive symptoms
    Ren, GuanRui
    Yu, Kun
    Xie, ZhiYang
    Liu, Lei
    Wang, PeiYang
    Zhang, Wei
    Wang, YunTao
    Wu, XiaoTao
    [J]. NEUROSURGICAL FOCUS, 2022, 52 (04)
  • [10] Nursing innovations in machine learning: Using Natural Language Processing in Falls Prediction
    Solberg, L. M.
    Ingibjargardottir, R.
    Wu, Y.
    Lucero, R.
    [J]. JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2020, 68 : S48 - S49