Attention-based acoustic feature fusion network for depression detection

被引:0
|
作者
Xu, Xiao [1 ,2 ]
Wang, Yang [1 ,3 ]
Wei, Xinru [2 ]
Wang, Fei [1 ,3 ]
Zhang, Xizhe [1 ,2 ]
机构
[1] Nanjing Med Univ, Affiliated Brain Hosp, Dept Psychiat, Early Intervent Unit, Nanjing 210029, Peoples R China
[2] Nanjing Med Univ, Sch Biomed Engn & Informat, Nanjing 211166, Peoples R China
[3] Nanjing Med Univ, Funct Brain Imaging Inst, Nanjing 210029, Peoples R China
关键词
Speech; Feature Fusion; Depression Detection; Deep Neural Networks; CLINICAL DEPRESSION; SPEECH; PHQ-9; RECOGNITION; VALIDATION; SEVERITY; SCALE;
D O I
10.1016/j.neucom.2024.128209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression, a common mental disorder, significantly influences individuals and imposes considerable societal impacts. The complexity and heterogeneity of the disorder necessitate prompt and effective detection, which nonetheless, poses a difficult challenge. This situation highlights an urgent requirement for improved detection methods. Exploiting auditory data through advanced machine learning paradigms presents promising research directions. Yet, existing techniques mainly rely on single-dimensional feature models, potentially neglecting the abundance of information hidden in various speech features. To rectify this, we present the novel Attention-Based Acoustic Feature Fusion Network (ABAFnet) for depression detection. ABAFnet combines four different acoustic features into a comprehensive deep neural network, thereby effectively integrating and blending multi-tiered features. We present a novel Type-Adaptive CNN for feature process, a LSTM-Attention Mechanism for features' temporal-spatial computation, and a Dynamic Weight Adjustment module for Linear Late Fusion Network that boosts performance by efficaciously synthesizing these features. The effectiveness of our approach is confirmed via extensive validation on two novel speech databases, CNRAC and CS-NRAC, thereby outperforming previous methods in depression detection and subtype classification. Further in-depth analysis confirms the key role of each feature and highlights the importance of MFCC-related features in speech-based depression detection (SDD).
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Attention-Based Scene Text Detection on Dual Feature Fusion
    Li, Yuze
    Silamu, Wushour
    Wang, Zhenchao
    Xu, Miaomiao
    [J]. SENSORS, 2022, 22 (23)
  • [2] TAFFNet: Two-Stage Attention-Based Feature Fusion Network for Surface Defect Detection
    Cao, Jingang
    Yang, Guotian
    Yang, Xiyun
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (12): : 1531 - 1544
  • [3] TAFFNet: Two-Stage Attention-Based Feature Fusion Network for Surface Defect Detection
    Jingang Cao
    Guotian Yang
    Xiyun Yang
    [J]. Journal of Signal Processing Systems, 2022, 94 : 1531 - 1544
  • [4] Multilayer feature fusion and attention-based network for crops and weeds segmentation
    Wang, Haoyu
    Song, Haiyu
    Wu, Haiyan
    Zhang, Zhiqiang
    Deng, Shengchun
    Feng, Xiaoqing
    Chen, Yanhong
    [J]. JOURNAL OF PLANT DISEASES AND PROTECTION, 2022, 129 (06) : 1475 - 1489
  • [5] Semantic attention-based heterogeneous feature aggregation network for image fusion
    Ruan, Zhiqiang
    Wan, Jie
    Xiao, Guobao
    Tang, Zhimin
    Ma, Jiayi
    [J]. PATTERN RECOGNITION, 2024, 155
  • [6] Multilayer feature fusion and attention-based network for crops and weeds segmentation
    Haoyu Wang
    Haiyu Song
    Haiyan Wu
    Zhiqiang Zhang
    Shengchun Deng
    Xiaoqing Feng
    Yanhong Chen
    [J]. Journal of Plant Diseases and Protection, 2022, 129 : 1475 - 1489
  • [7] Attention-Based Multiscale Feature Fusion for Efficient Surface Defect Detection
    Zhao, Yuhao
    Liu, Qing
    Su, Hu
    Zhang, Jiabin
    Ma, Hongxuan
    Zou, Wei
    Liu, Song
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10
  • [8] Detection of Atrial Fibrillation based on Feature Fusion using Attention-based BiLSTM
    Xie, Weifang
    Chen, Cang
    Zhao, Ruijie
    Lu, Yu
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [9] Attention-based Feature Fusion Generative Adversarial Network for yarn-dyed fabric defect detection
    Zhang, Hongwei
    Qiao, Guanhua
    Lu, Shuai
    Yao, Le
    Chen, Xia
    [J]. TEXTILE RESEARCH JOURNAL, 2023, 93 (5-6) : 1178 - 1195
  • [10] Attention-based mechanism and feature fusion network for person re-identification
    An, Mingshou
    He, Yunchuan
    Lim, Hye-Youn
    Kang, Dae-Seong
    [J]. INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2024, 20 (01)