Depression Symptom Identification Through Acoustic Speech Analysis: A Transfer Learning Approach

被引:1
|
作者
Narayanrao, Purude Vaishali [1 ,2 ]
Kohirker, Kshiraja [3 ]
Preeth, Tadakamalla Shyam [3 ]
Kumari, P. Lalitha Surya [1 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Hyderabad 500075, Telangana, India
[2] Neil Gogte Inst Technol, Dept CSE, Hyderabad 500039, Telangana, India
[3] Neil Gogte Inst Technol, Dept CSM, Hyderabad 500039, Telangana, India
关键词
depression transfer learning (TL) grid; search (GS) speech analysis Multi-Layer; Perceptron (MLP) classifier;
D O I
10.18280/ts.410113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of mental health diagnostics, the acoustic characteristics of speech have been recognized as potent markers for the identification of depressive symptoms. This study harnesses the power of transfer learning (TL) to discern depression -related sentiments from speech. Acoustic features such as rhythm, pitch, and tone form the core of this analysis. The methodology unfolds in three distinct phases. Initially, a Multi -Layer Perceptron (MLP) network employing stochastic gradient descent is applied to the RAVDESS dataset, yielding an accuracy of 65%. This finding catalyzes the second phase, wherein a comprehensive hyperparameter optimization via grid search (GS) is conducted on the MLP Classifier. This step primarily focuses on detecting emotions commonly associated with depression, including neutrality, sadness, anger, fear, and disgust. The optimized MLP classifier indicates an improved accuracy of 71%. In the final phase, to enhance precision further, the same GS -based model, underpinned by TL principles, is applied to the TASS dataset. This application astonishingly achieves an accuracy of 99.80%, suggesting a high risk of depression. This comparative study establishes the proposed framework as a vanguard in the application of TL for depression prediction, showcasing a significant leap in accuracy over previous methodologies.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [31] Vocal acoustic analysis and machine learning for the identification of schizophrenia
    Espinola C.W.
    Gomes J.C.
    Pereira J.M.S.
    dos Santos W.P.
    Research on Biomedical Engineering, 2021, 37 (01) : 33 - 46
  • [32] NON-NATIVE CHILDREN SPEECH RECOGNITION THROUGH TRANSFER LEARNING
    Matassoni, Marco
    Gretter, Roberto
    Falavigna, Daniele
    Giuliani, Diego
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6229 - 6233
  • [33] Indoor Multiperson Detection and Recognition Through Footsteps: A Deep Learning Approach With Acoustic Signal Analysis
    Qu, Yuanying
    Shi, Liming
    Wang, Xinheng
    Wang, Zhi
    IEEE SENSORS JOURNAL, 2024, 24 (12) : 19482 - 19496
  • [34] Predictive utility of symptom measures in classifying anxiety and depression: A machine-learning approach
    Liu, Kevin
    Droncheff, Brian
    Warren, Stacie L.
    PSYCHIATRY RESEARCH, 2022, 312
  • [35] Deep feature fusion for hate speech detection: a transfer learning approach
    Vishwajeet Dwivedy
    Pradeep Kumar Roy
    Multimedia Tools and Applications, 2023, 82 : 36279 - 36301
  • [36] Deep feature fusion for hate speech detection: a transfer learning approach
    Dwivedy, Vishwajeet
    Roy, Pradeep Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (23) : 36279 - 36301
  • [37] A Hate Speech Detection Approach Using Transfer Learning with Multiple Idioms
    de Oliveira, Aillkeen Bezerra
    de Souza Baptista, Claudio
    Firmino, Anderson Almeida
    de Paiva, Anselmo Cardoso
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2023, PT I, 2024, 518 : 144 - 160
  • [38] Alzheimer Disease Classification through Transfer Learning Approach
    Raza, Noman
    Naseer, Asma
    Tamoor, Maria
    Zafar, Kashif
    DIAGNOSTICS, 2023, 13 (04)
  • [39] Learning acoustic responses from experiments: A multiscale-informed transfer learning approach
    Van Hai Trinh
    Guilleminot, Johann
    Perrot, Camille
    Viet Dung Vu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (04): : 2587 - 2601
  • [40] PRIVACY SENSITIVE SPEECH ANALYSIS USING FEDERATED LEARNING TO ASSESS DEPRESSION
    Suhas, B. N.
    Abdullah, Saeed
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6272 - 6276