Multimodal Deep Learning Framework for Mental Disorder Recognition

被引：35

作者：

Zhang, Ziheng ^{[1
,4
]}

Lin, Weizhe ^{[2
]}

Liu, Mingyu ^{[3
]}

Mahmoud, Marwa ^{[1
]}

机构：

[1] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England

[2] Univ Cambridge, Dept Engn, Cambridge, England

[3] Univ Oxford, Dept Phys, Oxford, England

[4] Tencent Jarvis Lab, Shenzhen, Peoples R China

来源：

2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020) | 2020年

关键词：

D O I：

10.1109/FG47880.2020.00033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current methods for mental disorder recognition mostly depend on clinical interviews and self-reported scores that can be highly subjective. Building an automatic recognition system can help in early detection of symptoms and providing insights into the biological markers for diagnosis. It is, however, a challenging task as it requires taking into account indicators from different modalities, such as facial expressions, gestures, acoustic features and verbal content. To address this issue, we propose a general-purpose multimodal deep learning framework, in which multiple modalities - including acoustic, visual and textual features - are processed individually with the cross-modality correlation considered. Specifically, a Multimodal Deep Denoising Autoencoder (multi-DDAE) is designed to obtain multimodal representations of audio-visual features followed by the Fisher Vector encoding which produces session-level descriptors. For textual modality, a Paragraph Vector (PV) is proposed to embed the transcripts of interview sessions into document representations capturing cues related to mental disorders. Following an early fusion strategy, both audio-visual and textual features are then fused prior to feeding them to a Multitask Deep Neural Network (DNN) as the final classifier. Our framework is evaluated on the automatic detection of two mental disorders: bipolar disorder (BD) and depression, using two datasets: Bipolar Disorder Corpus (BDC) and the Extended Distress Analysis Interview Corpus (E-DAIC), respectively. Our experimental evaluation results showed comparable performance to the state-of-the-art in BD and depression detection, thus demonstrating the effective multimodal representation learning and the capability to generalise across different mental disorders.

引用

页码：344 / 350

页数：7

共 50 条

[41] A Novel Multimodal Deep Learning Framework for Encrypted Traffic Classification
Lin, Peng
Ye, Kejiang
Hu, Yishen
Lin, Yanying
Xu, Cheng-Zhong
IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1369 - 1384
[42] Modeling Mental Stress Using a Deep Learning Framework
Masood, Khalid
Alghamdi, Mohammed A.
IEEE ACCESS, 2019, 7 : 68446 - 68454
[43] A Framework for Pedestrian Attribute Recognition Using Deep Learning
Sakib, Saadman
Deb, Kaushik
Dhar, Pranab Kumar
Kwon, Oh-Jin
APPLIED SCIENCES-BASEL, 2022, 12 (02):
[44] A Deep Learning Framework for Grocery Product Detection and Recognition
Selvam, Prabu
Koilraj, Joseph Abraham Sundar
FOOD ANALYTICAL METHODS, 2022, 15 (12) : 3498 - 3522
[45] A Deep Learning Framework for Grocery Product Detection and Recognition
Prabu Selvam
Joseph Abraham Sundar Koilraj
Food Analytical Methods, 2022, 15 : 3498 - 3522
[46] Hybrid deep learning framework for human activity recognition
Pushpalatha, S.
Math, Shrishail
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1225 - 1237
[47] A Deep Learning Framework for Joint Image Restoration and Recognition
Chen, Ruilong
Mihaylova, Lyudmila
Zhu, Hao
Bouaynaya, Nidhal Carla
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (03) : 1561 - 1580
[48] Lightweight Deep Learning Framework for Speech Emotion Recognition
Akinpelu, Samson
Viriri, Serestina
Adegun, Adekanmi
IEEE ACCESS, 2023, 11 : 77086 - 77098
[49] A Deep Learning Framework for Joint Image Restoration and Recognition
Ruilong Chen
Lyudmila Mihaylova
Hao Zhu
Nidhal Carla Bouaynaya
Circuits, Systems, and Signal Processing, 2020, 39 : 1561 - 1580
[50] An Automated Framework Based on Deep Learning for Shark Recognition
Nhat Anh Le
Moon, Jucheol
Lowe, Christopher G.
Kim, Hyun-Il
Choi, Sang-Il
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)

← 1 2 3 4 5 →