Depression detection using cascaded attention based deep learning framework using speech data

被引:0
|
作者
Gupta, Sachi [1 ]
Agarwal, Gaurav [2 ]
Agarwal, Shivani [3 ]
Pandey, Dilkeshwar [4 ]
机构
[1] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida 201310, Uttar Pradesh, India
[2] Galgotias Univ, Sch Comp Sci & Engn, Gr Noida 203201, Uttar Pradesh, India
[3] Ajay Kumar Garg Engn Coll, Dept Informat Technol, Ghaziabad 201009, Uttar Pradesh, India
[4] KIET Grp Inst, Dept Comp Sci & Engn, Ghaziabad 201206, Uttar Pradesh, India
关键词
Speech signals; Multi-stage Discrete Wavelet Transform; Auction Optimization; Deep convolutional Attention; Depression; And Non-depression;
D O I
10.1007/s11042-023-18076-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient detection of depression is a challenging scenario in the field of speech signal processing. Since the speech signals provide a better diagnosis of depression, a significant methodology is required for detection. However, manual examination performed by radiologists can be time-consuming and may not be feasible in complex circumstances. Diverse detection methodologies have been proposed previously, but they are found to be less accurate, time-consuming and lead over maximized error rates. The proposed research article presents an effective and automatic deep learning-based depression detection using speech signal data. The steps involved in depression prediction are data acquisition, pre-processing, Feature Extraction, Feature selection and classification. The initial step in depression detection is data acquisition, which aims at collecting speech signals from the Distress Analysis Interview Corpus (DAIC-WOZ) and Sonde Health-free speech (SH2-FS) datasets. The collected data are pre-processed through MS_DWT (Multi-stage Discrete Wavelet Transform) to offer noise-free signals and improved signal quality. The relevant features required for processing the speech signal are extracted through Hilbert Huang (H-H) transform linear prediction cepstrum coefficient (LPCC), fundamental frequency, formants, speaking rate and Mel frequency cepstral coefficients (MFCC). From the extracted features, ideal features required for enhancing the detection accuracy are selected using the Price Auction optimization algorithm (PAOA). Finally, the depression and non-depression states are classified using deep convolutional Attention Cascaded two directional long short-term memory (DAttn_Conv 2D LSTM) with a softmax classifier. The overall accuracy obtained in classifying the depressed and non-depressed classes is 97.82% and 98.91%, respectively.
引用
下载
收藏
页码:66135 / 66173
页数:39
相关论文
共 50 条
  • [1] Depression Detection Based on Hybrid Deep Learning SSCL Framework Using Self-Attention Mechanism: An Application to Social Networking Data
    Nadeem, Aleena
    Naveed, Muhammad
    Satti, Muhammad Islam
    Afzal, Hammad
    Ahmad, Tanveer
    Kim, Ki-Il
    SENSORS, 2022, 22 (24)
  • [2] Deep Learning for Depression Detection Using Twitter Data
    Khafaga, Doaa Sami
    Auvdaiappan, Maheshwari
    Deepa, K.
    Abouhawwash, Mohamed
    Karim, Faten Khalid
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (02): : 1301 - 1313
  • [3] Evaluation of deep learning-based depression detection using medical claims data
    Bertl, Markus
    Bignoumba, Nzamba
    Ross, Peeter
    Ben Yahia, Sadok
    Draheim, Dirk
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 147
  • [4] An evolutionary approach for depression detection from Twitter big data using a novel deep learning model with attention based feature learning mechanism
    Prabhakar, K.
    Kavitha, V
    AUTOMATIKA, 2024, 65 (02) : 441 - 453
  • [5] Low-loss data compression using deep learning framework with attention-based autoencoder
    Sriram, S.
    Chitra, P.
    Sankar, V. Vijay
    Abirami, S.
    Durai, S. J. Rethina
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (01) : 90 - 100
  • [6] Deep Learning-Based Melanoma Detection using Attention Maps
    Andleeb, Ifrah
    Elzein, Almiqdad
    Patel, Vaibhav Anilkumar
    Alginahi, Yasser M.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [7] Study of Depression Detection using Deep Learning
    Sanyal, Hrithik
    Shukla, Sagar
    Agrawal, Rajneesh
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [8] Voice pathology detection on spontaneous speech data using deep learning models
    Farazi, Sahar
    Shekofteh, Yasser
    International Journal of Speech Technology, 2024, 27 (03) : 739 - 751
  • [9] COVID-19 Detection Systems Based on Speech and Image Data Using Deep Learning Algorithms
    Akhtar, Farooq
    Mahum, Rabbia
    Ragab, Adham E.
    Butt, Faisal Shafique
    El-Meligy, Mohammed A.
    Hassan, Haseeb
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [10] Visual Speech Detection using an Unsupervised Learning Framework
    Ahmad, Rameez
    Raza, Syed Paymaan
    Malik, Hafiz
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 525 - 528