Comparison of ischemic stroke diagnosis models based on machine learning

被引:6
|
作者
Yang, Wan-Xia [1 ]
Wang, Fang-Fang [1 ]
Pan, Yun-Yan [1 ]
Xie, Jian-Qin [2 ]
Lu, Ming-Hua [1 ]
You, Chong-Ge [1 ]
机构
[1] Lanzhou Univ, Hosp 2, Lab Med Ctr, Lanzhou, Peoples R China
[2] Lanzhou Univ, Hosp 2, Anesthesiol Dept, Lanzhou, Peoples R China
来源
FRONTIERS IN NEUROLOGY | 2022年 / 13卷
关键词
ischemic stroke; machine learning; artificial neural network; diagnostic model; transcriptomics; GLOBAL BURDEN; DISEASE; CHINA;
D O I
10.3389/fneur.2022.1014346
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
BackgroundThe incidence, prevalence, and mortality of ischemic stroke (IS) continue to rise, resulting in a serious global disease burden. The prediction models have a great value in the early prediction and diagnosis of IS. MethodsThe R software was used to screen the differentially expressed genes (DEGs) of IS and control samples in the datasets GSE16561, GSE58294, and GSE37587 and analyze DEGs for enrichment analysis. The feature genes of IS were obtained by several machine learning algorithms, including the least absolute shrinkage and selector operation (LASSO) logistic regression, the support vector machine-recursive feature elimination (SVM-RFE), and the Random Forest (RF). The IS diagnostic models were constructed based on transcriptomics by machine learning and artificial neural network (ANN). ResultsA total of 69 DEGs, mainly involved in immune and inflammatory responses, were identified. The pathways enriched in the IS group were complement and coagulation cascades, lysosome, PPAR signaling pathway, regulation of autophagy, and toll-like receptor signaling pathway. The feature genes selected by LASSO, SVM-RFE, and RF were 17, 10, and 12, respectively. The area under the curve (AUC) of the LASSO model in the training dataset, GSE22255, and GSE195442 was 0.969, 0.890, and 1.000. The AUC of the SVM-RFE model was 0.957, 0.805, and 1.000, respectively. The AUC of the RF model was 0.947, 0.935, and 1.000, respectively. The models have good sensitivity, specificity, and accuracy. The AUC of the LASSO+ANN, SVM-RFE+ANN, and RF+ANN models was 1.000, 0.995, and 0.997, respectively, in the training dataset. However, the AUC of LASSO+ANN, SVM-RFE+ANN, and RF+ANN models was 0.688, 0.605, and 0.619, respectively, in the GSE22255 dataset. The AUC of the LASSO+ANN and RF+ANN models was 0.740 and 0.630, respectively, in the GSE195442 dataset. In the training dataset, the sensitivity, specificity, and accuracy of the LASSO+ANN model were 1.000, 1.000, and 1.000, respectively; of the SVM-RFE+ANN model were 0.946, 0.982, and 0.964, respectively; and of the RF+ANN model were 0.964, 1.000, and 0.982, respectively. In the test datasets, the sensitivity was very satisfactory; however, the specificity and accuracy were not good. ConclusionThe LASSO, SVM-RFE, and RF models have good prediction abilities. However, the ANN model is efficient at classifying positive samples and is unsuitable at classifying negative samples.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Interpretable Machine Learning Modeling for Ischemic Stroke Outcome Prediction
    Jabal, Mohamed Sobhi
    Joly, Olivier
    Kallmes, David
    Harston, George
    Rabinstein, Alejandro
    Huynh, Thien
    Brinjikji, Waleed
    [J]. FRONTIERS IN NEUROLOGY, 2022, 13
  • [32] Machine Learning Predicts Sex Difference in Acute Ischemic Stroke
    Lai, Yun-Ju
    Blixt, Frank W.
    McCullough, Louise D.
    [J]. STROKE, 2020, 51
  • [33] Differential Diagnostic Value of Machine Learning-Based Models for Embolic Stroke
    Kuo, Hsunyu
    Liu, Tsai-Wei
    Huang, Yo-Ping
    Chin, Shy-Chyi
    Ro, Long-Sun
    Kuo, Hung-Chou
    [J]. CLINICAL AND APPLIED THROMBOSIS-HEMOSTASIS, 2023, 29
  • [34] Machine Learning in Action: Stroke Diagnosis and Outcome Prediction
    Mainali, Shraddha
    Darsie, Marin E.
    Smetana, Keaton S.
    [J]. FRONTIERS IN NEUROLOGY, 2021, 12
  • [35] Comparison of population-based models of risk factors for transient ischemic attacks and ischemic stroke in Rochester, Minnesota
    Brown, RD
    Whisnant, JP
    Petty, GW
    O'Fallon, WM
    Sicks, JD
    Wiebers, DO
    [J]. NEUROLOGY, 1999, 52 (06) : A443 - A443
  • [36] Comparison and integration of machine learning based ethylene cracking process models
    Zhao Q.
    Bi K.
    Qiu T.
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (09): : 1450 - 1457
  • [37] Comparison of Multiple Machine Learning Models Based on Enterprise Revenue Forecasting
    Huang Lei
    Huang Cailan
    [J]. 2021 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE (ACCTCS 2021), 2021, : 354 - 359
  • [38] Development of machine learning models for diagnosis of glaucoma
    Kim, Seong Jae
    Cho, Kyong Jin
    Oh, Sejong
    [J]. PLOS ONE, 2017, 12 (05):
  • [39] On Machine Learning Models for Heart Disease Diagnosis
    Lin, Chu-Hsing
    Yang, Po-Kai
    Lin, Yu-Chiao
    Fu, Pin-Kuei
    [J]. PROCEEDINGS OF THE 2ND IEEE EURASIA CONFERENCE ON BIOMEDICAL ENGINEERING, HEALTHCARE AND SUSTAINABILITY 2020 (IEEE ECBIOS 2020): BIOMEDICAL ENGINEERING, HEALTHCARE AND SUSTAINABILITY, 2020, : 158 - 161
  • [40] MACHINE-ASSISTED DIAGNOSIS OF CHD BASED ON VARIANT LOGIC THEORY AND MACHINE LEARNING MODELS
    Yao, Ruping
    Pan, Jiahua
    Sun, Jing
    Sun, Ke
    Wang, Weilian
    [J]. BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2021, 33 (03):