Multimodal Deep Learning Framework for Mental Disorder Recognition

被引:35
|
作者
Zhang, Ziheng [1 ,4 ]
Lin, Weizhe [2 ]
Liu, Mingyu [3 ]
Mahmoud, Marwa [1 ]
机构
[1] Univ Cambridge, Dept Comp Sci & Technol, Cambridge, England
[2] Univ Cambridge, Dept Engn, Cambridge, England
[3] Univ Oxford, Dept Phys, Oxford, England
[4] Tencent Jarvis Lab, Shenzhen, Peoples R China
关键词
D O I
10.1109/FG47880.2020.00033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current methods for mental disorder recognition mostly depend on clinical interviews and self-reported scores that can be highly subjective. Building an automatic recognition system can help in early detection of symptoms and providing insights into the biological markers for diagnosis. It is, however, a challenging task as it requires taking into account indicators from different modalities, such as facial expressions, gestures, acoustic features and verbal content. To address this issue, we propose a general-purpose multimodal deep learning framework, in which multiple modalities - including acoustic, visual and textual features - are processed individually with the cross-modality correlation considered. Specifically, a Multimodal Deep Denoising Autoencoder (multi-DDAE) is designed to obtain multimodal representations of audio-visual features followed by the Fisher Vector encoding which produces session-level descriptors. For textual modality, a Paragraph Vector (PV) is proposed to embed the transcripts of interview sessions into document representations capturing cues related to mental disorders. Following an early fusion strategy, both audio-visual and textual features are then fused prior to feeding them to a Multitask Deep Neural Network (DNN) as the final classifier. Our framework is evaluated on the automatic detection of two mental disorders: bipolar disorder (BD) and depression, using two datasets: Bipolar Disorder Corpus (BDC) and the Extended Distress Analysis Interview Corpus (E-DAIC), respectively. Our experimental evaluation results showed comparable performance to the state-of-the-art in BD and depression detection, thus demonstrating the effective multimodal representation learning and the capability to generalise across different mental disorders.
引用
收藏
页码:344 / 350
页数:7
相关论文
共 50 条
  • [41] A Novel Multimodal Deep Learning Framework for Encrypted Traffic Classification
    Lin, Peng
    Ye, Kejiang
    Hu, Yishen
    Lin, Yanying
    Xu, Cheng-Zhong
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1369 - 1384
  • [42] Modeling Mental Stress Using a Deep Learning Framework
    Masood, Khalid
    Alghamdi, Mohammed A.
    IEEE ACCESS, 2019, 7 : 68446 - 68454
  • [43] A Framework for Pedestrian Attribute Recognition Using Deep Learning
    Sakib, Saadman
    Deb, Kaushik
    Dhar, Pranab Kumar
    Kwon, Oh-Jin
    APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [44] A Deep Learning Framework for Grocery Product Detection and Recognition
    Selvam, Prabu
    Koilraj, Joseph Abraham Sundar
    FOOD ANALYTICAL METHODS, 2022, 15 (12) : 3498 - 3522
  • [45] A Deep Learning Framework for Grocery Product Detection and Recognition
    Prabu Selvam
    Joseph Abraham Sundar Koilraj
    Food Analytical Methods, 2022, 15 : 3498 - 3522
  • [46] Hybrid deep learning framework for human activity recognition
    Pushpalatha, S.
    Math, Shrishail
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1225 - 1237
  • [47] A Deep Learning Framework for Joint Image Restoration and Recognition
    Chen, Ruilong
    Mihaylova, Lyudmila
    Zhu, Hao
    Bouaynaya, Nidhal Carla
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (03) : 1561 - 1580
  • [48] Lightweight Deep Learning Framework for Speech Emotion Recognition
    Akinpelu, Samson
    Viriri, Serestina
    Adegun, Adekanmi
    IEEE ACCESS, 2023, 11 : 77086 - 77098
  • [49] A Deep Learning Framework for Joint Image Restoration and Recognition
    Ruilong Chen
    Lyudmila Mihaylova
    Hao Zhu
    Nidhal Carla Bouaynaya
    Circuits, Systems, and Signal Processing, 2020, 39 : 1561 - 1580
  • [50] An Automated Framework Based on Deep Learning for Shark Recognition
    Nhat Anh Le
    Moon, Jucheol
    Lowe, Christopher G.
    Kim, Hyun-Il
    Choi, Sang-Il
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)