Depression detection using cascaded attention based deep learning framework using speech data

被引:0
|
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
下载
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [21] COVID-19 Detection Systems Using Deep-Learning Algorithms Based on Speech and Image Data
    Nassif, Ali Bou
    Shahin, Ismail
    Bader, Mohamed
    Hassan, Abdelfatah
    Werghi, Naoufel
    MATHEMATICS, 2022, 10 (04)
  • [22] Federated learning and deep learning framework for MRI image and speech signal-based multi-modal depression detection
    Patil, Minakshee
    Mukherji, Prachi
    Wadhai, Vijay
    Computational Biology and Chemistry, 2024, 113
  • [23] Cloud Detection Method Using CNN Based on Cascaded Feature Attention and Channel Attention
    Zhang, Jing
    Wu, Jun
    Wang, Hui
    Wang, Yuchen
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [24] A framework for elders fall detection using deep learning
    Mobsite, Sara
    Alaoui, Nabih
    Boulmalf, Mohammed
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 69 - 74
  • [25] Web Phishing Detection Using a Deep Learning Framework
    Yi, Ping
    Guan, Yuxiang
    Zou, Futai
    Yao, Yao
    Wang, Wei
    Zhu, Ting
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [26] A Model of Normality Inspired Deep Learning Framework for Depression Relapse Prediction Using Audiovisual Data
    Othmani, Alice
    Zeghina, Assaad-Oussama
    Muzammel, Muhammad
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 226
  • [27] Deep Learning-Based Framework for the Detection of Cyberattack Using Feature Engineering
    Akhtar, Muhammad Shoaib
    Feng, Tao
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [28] Deep Learning Based Semantic Similarity Detection Using Text Data
    Mansoor, Muhammad
    Rehman, Zahoor Ur
    Shaheen, Muhammad
    Khan, Muhammad Attique
    Habib, Mohamed
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (04): : 495 - 510
  • [29] Improving Sinhala Hate Speech Detection Using Deep Learning
    Gamage, Kavishka
    Welgama, Viraj
    Weerasinghe, Ruvan
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [30] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974