Mild cognitive impairment prediction based on multi-stream convolutional neural networks

被引:0
|
作者
Lee, Chien-Cheng [1 ]
Chau, Hong-Han [1 ]
Wang, Hsiao-Lun [1 ]
Chuang, Yi-Fang [2 ,3 ]
Chau, Yawgeng [1 ]
机构
[1] Yuan Ze Univ, Dept Elect Engn, Taoyuan 320, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Inst Publ Hlth, Coll Med, Taipei 112, Taiwan
[3] Far Eastern Mem Hosp, Dept Psychiat, New Taipei City 220, Taiwan
来源
BMC BIOINFORMATICS | 2024年 / 22卷 / SUPPL 5期
关键词
MCI; ResNet; CNN; Deep learning; Facial features; ALZHEIMERS-DISEASE; DIAGNOSIS; MOTION; MCI;
D O I
10.1186/s12859-024-05911-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundMild cognitive impairment (MCI) is the transition stage between the cognitive decline expected in normal aging and more severe cognitive decline such as dementia. The early diagnosis of MCI plays an important role in human healthcare. Current methods of MCI detection include cognitive tests to screen for executive function impairments, possibly followed by neuroimaging tests. However, these methods are expensive and time-consuming. Several studies have demonstrated that MCI and dementia can be detected by machine learning technologies from different modality data. This study proposes a multi-stream convolutional neural network (MCNN) model to predict MCI from face videos.ResultsThe total effective data are 48 facial videos from 45 participants, including 35 videos from normal cognitive participants and 13 videos from MCI participants. The videos are divided into several segments. Then, the MCNN captures the latent facial spatial features and facial dynamic features of each segment and classifies the segment as MCI or normal. Finally, the aggregation stage produces the final detection results of the input video. We evaluate 27 MCNN model combinations including three ResNet architectures, three optimizers, and three activation functions. The experimental results showed that the ResNet-50 backbone with Swish activation function and Ranger optimizer produces the best results with an F1-score of 89% at the segment level. However, the ResNet-18 backbone with Swish and Ranger achieves the F1-score of 100% at the participant level.ConclusionsThis study presents an efficient new method for predicting MCI from facial videos. Studies have shown that MCI can be detected from facial videos, and facial data can be used as a biomarker for MCI. This approach is very promising for developing accurate models for screening MCI through facial data. It demonstrates that automated, non-invasive, and inexpensive MCI screening methods are feasible and do not require highly subjective paper-and-pencil questionnaires. Evaluation of 27 model combinations also found that ResNet-50 with Swish is more stable for different optimizers. Such results provide directions for hyperparameter tuning to further improve MCI predictions.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Fight Detection in Video Sequences Based on Multi-Stream Convolutional Neural Networks
    Carneiro, Sarah Almeida
    da Silva, Gabriel Pellegrino
    Guimaraes, Silvio Jamil F.
    Pedrini, Helio
    [J]. 2019 32ND SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2019, : 8 - 15
  • [2] Region based multi-stream convolutional neural networks for collective activity recognition
    Zalluhoglu, Cemil
    Ikizler-Cinbis, Nazli
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 170 - 179
  • [3] Discrimination and conversion prediction of mild cognitive impairment using convolutional neural networks
    Wu, Congling
    Guo, Shengwen
    Hong, Yanjia
    Xiao, Benheng
    Wu, Yupeng
    Zhang, Qin
    [J]. QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2018, 8 (10) : 992 - 1003
  • [4] Cervical Cell Features Based Multi-Stream Convolutional Neural Networks Classification Method
    Yang, Zhiming
    Li, Yawei
    Yang, Bing
    Pang, Wenbo
    Tian, Zening
    Wang, Yong
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (04): : 531 - 540
  • [5] Multi-stream pose convolutional neural networks for human interaction recognition in images
    Tanisik, Gokhan
    Zalluhoglu, Cemil
    Ikizler-Cinbis, Nazli
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 95
  • [6] Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Liu, Xiao
    Yang, Xudong
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 251 - 262
  • [7] Facial Beauty Prediction From Facial Parts Using Multi-Task and Multi-Stream Convolutional Neural Networks
    Vahdati, Elham
    Suen, Ching Y.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (12)
  • [8] Elderly fall detection based on multi-stream deep convolutional networks
    Chadia Khraief
    Faouzi Benzarti
    Hamid Amiri
    [J]. Multimedia Tools and Applications, 2020, 79 : 19537 - 19560
  • [9] Elderly fall detection based on multi-stream deep convolutional networks
    Khraief, Chadia
    Benzarti, Faouzi
    Amiri, Hamid
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19537 - 19560
  • [10] Multi-stream Convolutional Networks for Indoor Scene Recognition
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    Laaksonen, Jorma
    Zaki, Nazar
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 196 - 208