Attention-based acoustic feature fusion network for depression detection

被引:0
|
作者
Xu, Xiao [1 ,2 ]
Wang, Yang [1 ,3 ]
Wei, Xinru [2 ]
Wang, Fei [1 ,3 ]
Zhang, Xizhe [1 ,2 ]
机构
[1] Nanjing Med Univ, Affiliated Brain Hosp, Dept Psychiat, Early Intervent Unit, Nanjing 210029, Peoples R China
[2] Nanjing Med Univ, Sch Biomed Engn & Informat, Nanjing 211166, Peoples R China
[3] Nanjing Med Univ, Funct Brain Imaging Inst, Nanjing 210029, Peoples R China
关键词
Speech; Feature Fusion; Depression Detection; Deep Neural Networks; CLINICAL DEPRESSION; SPEECH; PHQ-9; RECOGNITION; VALIDATION; SEVERITY; SCALE;
D O I
10.1016/j.neucom.2024.128209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression, a common mental disorder, significantly influences individuals and imposes considerable societal impacts. The complexity and heterogeneity of the disorder necessitate prompt and effective detection, which nonetheless, poses a difficult challenge. This situation highlights an urgent requirement for improved detection methods. Exploiting auditory data through advanced machine learning paradigms presents promising research directions. Yet, existing techniques mainly rely on single-dimensional feature models, potentially neglecting the abundance of information hidden in various speech features. To rectify this, we present the novel Attention-Based Acoustic Feature Fusion Network (ABAFnet) for depression detection. ABAFnet combines four different acoustic features into a comprehensive deep neural network, thereby effectively integrating and blending multi-tiered features. We present a novel Type-Adaptive CNN for feature process, a LSTM-Attention Mechanism for features' temporal-spatial computation, and a Dynamic Weight Adjustment module for Linear Late Fusion Network that boosts performance by efficaciously synthesizing these features. The effectiveness of our approach is confirmed via extensive validation on two novel speech databases, CNRAC and CS-NRAC, thereby outperforming previous methods in depression detection and subtype classification. Further in-depth analysis confirms the key role of each feature and highlights the importance of MFCC-related features in speech-based depression detection (SDD).
引用
收藏
页数:14
相关论文
共 50 条
  • [41] AFF-Dehazing: Attention-based feature fusion network for low-light image Dehazing
    Zhou, Yu
    Chen, Zhihua
    Sheng, Bin
    Li, Ping
    Kim, Jinman
    Wu, Enhua
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (3-4)
  • [42] SALIENCY-GUIDED ATTENTION-BASED FEATURE PYRAMID NETWORK FOR SHIP DETECTION IN SAR IMAGES
    Zhang, Tianwen
    Zhang, Xiaoling
    Shao, Zikang
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4950 - 4953
  • [43] Attention-Based Multiscale Feature Pyramid Network for Corn Pest Detection under Wild Environment
    Kang, Chenrui
    Jiao, Lin
    Wang, Rujing
    Liu, Zhigui
    Du, Jianming
    Hu, Haiying
    [J]. INSECTS, 2022, 13 (11)
  • [44] A novel attention-based feature learning and optimal deep learning approach for network intrusion detection
    Sakthi, K.
    Kumar, P. Nirmal
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 5123 - 5140
  • [45] Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery
    Chen, Luchang
    Liu, Chunsheng
    Chang, Faliang
    Li, Shuang
    Nie, Zhaoying
    [J]. NEUROCOMPUTING, 2021, 451 : 67 - 80
  • [46] Chatter detection in milling of thin-walled parts using multi-channel feature fusion and temporal attention-based network
    Han Z.
    Zhuo Y.
    Yan Y.
    Jin H.
    Fu H.
    [J]. Mechanical Systems and Signal Processing, 2022, 179
  • [47] Chatter detection in milling of thin-walled parts using multi-channel feature fusion and temporal attention-based network
    Han, Zhenyu
    Zhuo, Yue
    Yan, Yizhao
    Jin, Hongyu
    Fu, Hongya
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 179
  • [48] An Attention-Based Network for Textured Surface Anomaly Detection
    Liu, Gaokai
    Yang, Ning
    Guo, Lei
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [49] Attention-based Deep Learning for Network Intrusion Detection
    Guo, Naiwang
    Tian, Yingjie
    Li, Fan
    Yang, Hongshan
    [J]. 2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [50] ABiFN: Attention-based bi-modal fusion network for object detection at night time
    Sai Charan, A.
    Jitesh, M.
    Chowdhury, M.
    Venkataraman, H.
    [J]. ELECTRONICS LETTERS, 2020, 56 (24) : 1309 - 1311