An Enhanced Cross-Attention Based Multimodal Model for Depression Detection

被引:0
|
作者
Kou, Yifan [1 ]
Ge, Fangzhen [1 ,2 ]
Chen, Debao [2 ,3 ]
Shen, Longfeng [1 ,2 ,4 ]
Liu, Huaiyu [1 ]
机构
[1] School of Computer Science and Technology, Huaibei Normal University, Huaibei, China
[2] Anhui Engineering Research Center for Intelligent Computing and Application on Cognitive Behavior (ICACB), Anhui, Huaibei, China
[3] School of Physics and Electronic Information, Huaibei Normal University, Huaibei, China
[4] Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, China
基金
中国国家自然科学基金;
关键词
Deep learning - Neural networks;
D O I
10.1111/coin.70019
中图分类号
学科分类号
摘要
Depression, a prevalent mental disorder in modern society, significantly impacts people's daily lives. Recently, there have been advancements in developing automated diagnosis models for detecting depression. However, data scarcity, primarily due to privacy concerns, has posed a challenge. Traditional speech features have limitations in representing knowledge for depression diagnosis, and the complexity of deep learning algorithms necessitates substantial data support. Furthermore, existing multimodal methods based on neural networks overlook the heterogeneity gap between different modalities, potentially resulting in redundant information. To address these issues, we propose a multimodal depression detection model based on the Enhanced Cross-Attention (ECA) Mechanism. This model effectively explores text-speech interactions while considering modality heterogeneity. Data scarcity has been mitigated by fine-tuning pre-trained models. Additionally, we design a modal fusion module based on ECA, which emphasizes similarity responses and updates the weight of each modal feature based on the similarity information between modal features. Furthermore, for speech feature extraction, we have reduced the computational complexity of the model by integrating a multi-window self-attention mechanism with the Fourier transform. The proposed model is evaluated on the public dataset, DAIC-WOZ, achieving an accuracy of 80.0% and an average F1 value improvement of 4.3% compared with relevant methods. © 2025 Wiley Periodicals LLC.
引用
收藏
相关论文
共 50 条
  • [1] Multimodal Cross-Attention Graph Network for Desire Detection
    Gu, Ruitong
    Wang, Xin
    Yang, Qinghong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 512 - 523
  • [2] A Multimodal Graph Recommendation Method Based on Cross-Attention Fusion
    Li, Kai
    Xu, Long
    Zhu, Cheng
    Zhang, Kunlun
    MATHEMATICS, 2024, 12 (15)
  • [3] An Enhanced Phrase Matching Method Based on Cross-Attention
    Pang, Guoqing
    Fu, Qiming
    Chen, Jianping
    Wang, Yunzhe
    Lu, You
    Wu, Hongjie
    Mathematical Problems in Engineering, 2023, 2023
  • [4] A cross-attention and Siamese network based model for off-topic detection
    Fan, Cong
    Guo, Shen
    Wumaier, Aishan
    Liu, Jiajun
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 770 - 777
  • [5] Design and Implementation of Attention Depression Detection Model Based on Multimodal Analysis
    Park, Junhee
    Moon, Nammee
    SUSTAINABILITY, 2022, 14 (06)
  • [6] CAF-RCNN: multimodal 3D object detection with cross-attention
    Liu, Junting
    Liu, Deer
    Zhu, Lei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (19) : 6131 - 6146
  • [7] A Semi-Supervised Image Registration Framework Based on Multimodal Cross-Attention
    Zhao, Ming
    Liu, Jingyi
    Wu, Yan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [8] Synchronous composition and semantic line detection based on cross-attention
    Hou, Qinggang
    Ke, Yongzhen
    Wang, Kai
    Qin, Fan
    Wang, Yaoting
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [9] Web Semantic-Enhanced Multimodal Sentiment Analysis Using Multilayer Cross-Attention Fusion
    Liu, Yong
    Yu, Shiqiu
    International Journal on Semantic Web and Information Systems, 2024, 20 (01)
  • [10] Deception Detection System with Joint Cross-Attention
    Jiang, Peili
    Wang, Yunfan
    Li, Jiajun
    Wang, Ziyang
    PROCEEDINGS OF THE 2024 6TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING SYSTEMS, SSPS 2024, 2024, : 40 - 47