Depression Symptom Identification Through Acoustic Speech Analysis: A Transfer Learning Approach

被引:1
|
作者
Narayanrao, Purude Vaishali [1 ,2 ]
Kohirker, Kshiraja [3 ]
Preeth, Tadakamalla Shyam [3 ]
Kumari, P. Lalitha Surya [1 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Hyderabad 500075, Telangana, India
[2] Neil Gogte Inst Technol, Dept CSE, Hyderabad 500039, Telangana, India
[3] Neil Gogte Inst Technol, Dept CSM, Hyderabad 500039, Telangana, India
关键词
depression transfer learning (TL) grid; search (GS) speech analysis Multi-Layer; Perceptron (MLP) classifier;
D O I
10.18280/ts.410113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of mental health diagnostics, the acoustic characteristics of speech have been recognized as potent markers for the identification of depressive symptoms. This study harnesses the power of transfer learning (TL) to discern depression -related sentiments from speech. Acoustic features such as rhythm, pitch, and tone form the core of this analysis. The methodology unfolds in three distinct phases. Initially, a Multi -Layer Perceptron (MLP) network employing stochastic gradient descent is applied to the RAVDESS dataset, yielding an accuracy of 65%. This finding catalyzes the second phase, wherein a comprehensive hyperparameter optimization via grid search (GS) is conducted on the MLP Classifier. This step primarily focuses on detecting emotions commonly associated with depression, including neutrality, sadness, anger, fear, and disgust. The optimized MLP classifier indicates an improved accuracy of 71%. In the final phase, to enhance precision further, the same GS -based model, underpinned by TL principles, is applied to the TASS dataset. This application astonishingly achieves an accuracy of 99.80%, suggesting a high risk of depression. This comparative study establishes the proposed framework as a vanguard in the application of TL for depression prediction, showcasing a significant leap in accuracy over previous methodologies.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [41] LOW-RESOURCE LANGUAGE IDENTIFICATION FROM SPEECH USING TRANSFER LEARNING
    Feng, Kexin
    Chaspari, Theodora
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [42] A VARIATIONAL BAYESIAN APPROACH TO LEARNING LATENT VARIABLES FOR ACOUSTIC KNOWLEDGE TRANSFER
    Hu, Hu
    Siniscalchi, Sabato Marco
    Yang, Chao-Han Huck
    Lee, Chin-Hui
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1041 - 1045
  • [43] Enhancing multimodal depression diagnosis through representation learning and knowledge transfer
    Yang, Shanliang
    Cui, Lichao
    Wang, Lei
    Wang, Tao
    You, Jiebing
    HELIYON, 2024, 10 (04)
  • [44] An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition
    Wu, Bo
    Li, Kehuang
    Ge, Fengpei
    Huang, Zhen
    Yang, Minglei
    Siniscalchi, Sabato Marco
    Lee, Chin-Hui
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1289 - 1300
  • [45] Identification of relevant acoustic transfer paths for WT drivetrains with an operational transfer path analysis
    Schunemann, W.
    Schelenz, R.
    Jacobs, G.
    Vocaet, W.
    FORSCHUNG IM INGENIEURWESEN-ENGINEERING RESEARCH, 2021, 85 (02): : 345 - 351
  • [46] Open Vocabulary Keyword Spotting through Transfer Learning from Speech Synthesis
    Kesavaraj, V
    Vuppala, Anil
    2024 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM 2024, 2024,
  • [47] Atrial fibrillation identification based on a deep transfer learning approach
    Ghaffari, Ali
    Madani, Nasimalsadat
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2019, 5 (03):
  • [48] Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Linguistic Information Fusion
    Zhou, Ziang
    Xu, Yanze
    Li, Ming
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 149 - 161
  • [49] Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis
    Fu, Ruibo
    Tao, Jianhua
    Zheng, Yibin
    Wen, Zhengqi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 907 - 911
  • [50] A transfer learning approach for detecting offensive and hate speech on social media platforms
    Priyadarshini, Ishaani
    Sahu, Sandipan
    Kumar, Raghvendra
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 27473 - 27499