Depression Symptom Identification Through Acoustic Speech Analysis: A Transfer Learning Approach

被引:1
|
作者
Narayanrao, Purude Vaishali [1 ,2 ]
Kohirker, Kshiraja [3 ]
Preeth, Tadakamalla Shyam [3 ]
Kumari, P. Lalitha Surya [1 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Hyderabad 500075, Telangana, India
[2] Neil Gogte Inst Technol, Dept CSE, Hyderabad 500039, Telangana, India
[3] Neil Gogte Inst Technol, Dept CSM, Hyderabad 500039, Telangana, India
关键词
depression transfer learning (TL) grid; search (GS) speech analysis Multi-Layer; Perceptron (MLP) classifier;
D O I
10.18280/ts.410113
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of mental health diagnostics, the acoustic characteristics of speech have been recognized as potent markers for the identification of depressive symptoms. This study harnesses the power of transfer learning (TL) to discern depression -related sentiments from speech. Acoustic features such as rhythm, pitch, and tone form the core of this analysis. The methodology unfolds in three distinct phases. Initially, a Multi -Layer Perceptron (MLP) network employing stochastic gradient descent is applied to the RAVDESS dataset, yielding an accuracy of 65%. This finding catalyzes the second phase, wherein a comprehensive hyperparameter optimization via grid search (GS) is conducted on the MLP Classifier. This step primarily focuses on detecting emotions commonly associated with depression, including neutrality, sadness, anger, fear, and disgust. The optimized MLP classifier indicates an improved accuracy of 71%. In the final phase, to enhance precision further, the same GS -based model, underpinned by TL principles, is applied to the TASS dataset. This application astonishingly achieves an accuracy of 99.80%, suggesting a high risk of depression. This comparative study establishes the proposed framework as a vanguard in the application of TL for depression prediction, showcasing a significant leap in accuracy over previous methodologies.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 50 条
  • [1] A Lightweight Machine Learning Approach to Detect Depression from Speech Analysis
    Verde, Laura
    Raimo, Gennaro
    Vitale, Federica
    Carbonaro, Bruno
    Cordasco, Gennaro
    Marrone, Stefano
    Esposito, Anna
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 330 - 335
  • [2] ACOUSTIC ANALYSIS FOR SPEAKER IDENTIFICATION OF WHISPERED SPEECH
    Fan, Xing
    Hansen, John H. L.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5046 - 5049
  • [3] Analysis of acoustic space variability in speech affected by depression
    Cummins, Nicholas
    Sethu, Vidhyasaharan
    Epps, Julien
    Schnieder, Sebastian
    Krajewski, Jarek
    SPEECH COMMUNICATION, 2015, 75 : 27 - 49
  • [4] Probabilistic Acoustic Volume Analysis for Speech Affected by Depression
    Cummins, Nicholas
    Sethu, Vidhyasaharan
    Epps, Julien
    Krajewski, Jarek
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1238 - 1242
  • [5] Sound as a bell: a deep learning approach for health status classification through speech acoustic biomarkers
    Wang, Yanbing
    Wang, Haiyan
    Li, Zhuoxuan
    Zhang, Haoran
    Yang, Liwen
    Li, Jiarui
    Tang, Zixiang
    Hou, Shujuan
    Wang, Qi
    CHINESE MEDICINE, 2024, 19 (01):
  • [6] Transfer learning for acoustic modeling of noise robust speech recognition
    Yi J.
    Tao J.
    Liu B.
    Wen Z.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (01): : 55 - 60
  • [7] A NEW APPROACH FOR THE ACOUSTIC ANALYSIS OF THE SPEECH PATHOLOGY
    Ankishan, Haydar
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,
  • [9] An Effective Depression Diagnostic System Using Speech Signal Analysis Through Deep Learning Methods
    Verma, Aman
    Jain, Pooja
    Kumar, Tapan
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2023, 32 (02)
  • [10] Automatic gender identification through speech analysis
    Khosla, Anu
    Yadav, Devendra Kumar
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 375 - +