Mild cognitive impairment prediction based on multi-stream convolutional neural networks

被引：0

作者：

Lee, Chien-Cheng ^{[1
]}

Chau, Hong-Han ^{[1
]}

Wang, Hsiao-Lun ^{[1
]}

Chuang, Yi-Fang ^{[2
,3
]}

Chau, Yawgeng ^{[1
]}

机构：

[1] Yuan Ze Univ, Dept Elect Engn, Taoyuan 320, Taiwan

[2] Natl Yang Ming Chiao Tung Univ, Inst Publ Hlth, Coll Med, Taipei 112, Taiwan

[3] Far Eastern Mem Hosp, Dept Psychiat, New Taipei City 220, Taiwan

来源：

BMC BIOINFORMATICS | 2024年 / 22卷 / SUPPL 5期

关键词：

MCI; ResNet; CNN; Deep learning; Facial features; ALZHEIMERS-DISEASE; DIAGNOSIS; MOTION; MCI;

D O I：

10.1186/s12859-024-05911-6

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

BackgroundMild cognitive impairment (MCI) is the transition stage between the cognitive decline expected in normal aging and more severe cognitive decline such as dementia. The early diagnosis of MCI plays an important role in human healthcare. Current methods of MCI detection include cognitive tests to screen for executive function impairments, possibly followed by neuroimaging tests. However, these methods are expensive and time-consuming. Several studies have demonstrated that MCI and dementia can be detected by machine learning technologies from different modality data. This study proposes a multi-stream convolutional neural network (MCNN) model to predict MCI from face videos.ResultsThe total effective data are 48 facial videos from 45 participants, including 35 videos from normal cognitive participants and 13 videos from MCI participants. The videos are divided into several segments. Then, the MCNN captures the latent facial spatial features and facial dynamic features of each segment and classifies the segment as MCI or normal. Finally, the aggregation stage produces the final detection results of the input video. We evaluate 27 MCNN model combinations including three ResNet architectures, three optimizers, and three activation functions. The experimental results showed that the ResNet-50 backbone with Swish activation function and Ranger optimizer produces the best results with an F1-score of 89% at the segment level. However, the ResNet-18 backbone with Swish and Ranger achieves the F1-score of 100% at the participant level.ConclusionsThis study presents an efficient new method for predicting MCI from facial videos. Studies have shown that MCI can be detected from facial videos, and facial data can be used as a biomarker for MCI. This approach is very promising for developing accurate models for screening MCI through facial data. It demonstrates that automated, non-invasive, and inexpensive MCI screening methods are feasible and do not require highly subjective paper-and-pencil questionnaires. Evaluation of 27 model combinations also found that ResNet-50 with Swish is more stable for different optimizers. Such results provide directions for hyperparameter tuning to further improve MCI predictions.

引用

页数：16

共 50 条

[1] Fight Detection in Video Sequences Based on Multi-Stream Convolutional Neural Networks
Carneiro, Sarah Almeida
da Silva, Gabriel Pellegrino
Guimaraes, Silvio Jamil F.
Pedrini, Helio
[J]. 2019 32ND SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2019, : 8 - 15
[2] Region based multi-stream convolutional neural networks for collective activity recognition
Zalluhoglu, Cemil
Ikizler-Cinbis, Nazli
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 170 - 179
[3] Discrimination and conversion prediction of mild cognitive impairment using convolutional neural networks
Wu, Congling
Guo, Shengwen
Hong, Yanjia
Xiao, Benheng
Wu, Yupeng
Zhang, Qin
[J]. QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2018, 8 (10) : 992 - 1003
[4] Cervical Cell Features Based Multi-Stream Convolutional Neural Networks Classification Method
Yang, Zhiming
Li, Yawei
Yang, Bing
Pang, Wenbo
Tian, Zening
Wang, Yong
[J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (04): : 531 - 540
[5] Multi-stream pose convolutional neural networks for human interaction recognition in images
Tanisik, Gokhan
Zalluhoglu, Cemil
Ikizler-Cinbis, Nazli
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 95
[6] Multi-stream with Deep Convolutional Neural Networks for Human Action Recognition in Videos
Liu, Xiao
Yang, Xudong
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 251 - 262
[7] Facial Beauty Prediction From Facial Parts Using Multi-Task and Multi-Stream Convolutional Neural Networks
Vahdati, Elham
Suen, Ching Y.
[J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (12)
[8] Elderly fall detection based on multi-stream deep convolutional networks
Chadia Khraief
Faouzi Benzarti
Hamid Amiri
[J]. Multimedia Tools and Applications, 2020, 79 : 19537 - 19560
[9] Elderly fall detection based on multi-stream deep convolutional networks
Khraief, Chadia
Benzarti, Faouzi
Amiri, Hamid
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19537 - 19560
[10] Multi-stream Convolutional Networks for Indoor Scene Recognition
Anwer, Rao Muhammad
Khan, Fahad Shahbaz
Laaksonen, Jorma
Zaki, Nazar
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2019, PT I, 2019, 11678 : 196 - 208

← 1 2 3 4 5 →