Glottal Source Features for Automatic Speech-based Depression Assessment

被引:15
|
作者
Simantiraki, Olympia [1 ]
Charonyktakis, Paulos [2 ]
Pampouchidou, Anastasia [3 ]
Tsiknakis, Manolis [4 ,5 ]
Cooker, Martin [1 ]
机构
[1] Univ Basque Country, Language & Speech Lab, Vitoria, Spain
[2] Gnosis Data Anal PC, Iraklion, Greece
[3] Univ Burgundy, Le2i Lab, Le Creusot, France
[4] Technol Educ Inst Greece, Iraklion, Greece
[5] FORTH, Iraklion, Greece
关键词
glottal source; Phase Distortion Deviation; binary classification; machine learning;
D O I
10.21437/Interspeech.2017-1251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is one of the most prominent mental disorders, with an increasing rate that makes it the fourth cause of disability worldwide. The field of automated depression assessment has emerged to aid clinicians in the form of a decision support system. Such a system could assist as a pre-screening tool, or even for monitoring high risk populations. Related work most commonly involves multimodal approaches, typically combining audio and visual signals to identify depression presence and/or severity. The current study explores categorical assessment of depression using audio features alone. Specifically, since depression-related vocal characteristics impact the glottal source signal, we examine Phase Distortion Deviation which has previously been applied to the recognition of voice qualities such as hoarseness, breathiness and creakiness, some of which are thought to be features of depressed speech. The proposed method uses as features DCT-coefficients of the Phase Distortion Deviation for each frequency band. An automated machine learning tool, Just Add Data, is used to classify speech samples. The method is evaluated on a benchmark dataset (AVEC2014), in two conditions: read-speech and spontaneous-speech. Our findings indicate that Phase Distortion Deviation is a promising audio-only feature for automated detection and assessment of depressed speech.
引用
收藏
页码:2700 / 2704
页数:5
相关论文
共 50 条
  • [1] GLOTTAL FEATURES FOR SPEECH-BASED COGNITIVE LOAD CLASSIFICATION
    Yap, Tet Fei
    Epps, Julien
    Choi, Eric H. C.
    Ambikairajah, Eliathamby
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5234 - 5237
  • [2] Speech-Based Automatic Recognition Technology for Major Depression Disorder
    Yang, Zhixin
    Li, Hualiang
    Li, Li
    Zhang, Kai
    Xiong, Chaolin
    Liu, Yuzhong
    [J]. HUMAN CENTERED COMPUTING, 2019, 11956 : 546 - 553
  • [3] Avoiding dominance of speaker features in speech-based depression detection
    Zuo, Lishi
    Mak, Man-Wai
    [J]. PATTERN RECOGNITION LETTERS, 2023, 173 : 50 - 56
  • [4] Differential Performance of Automatic Speech-Based Depression Classification Across Smartphones
    Stasak, Brian
    Epps, Julien
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2017, : 171 - 175
  • [5] Exploring Modulation Spectrum Features for Speech-Based Depression Level Classification
    Bozkurt, Elif
    Toledo-Ronen, Orith
    Sorin, Alexander
    Hoory, Ron
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1243 - 1247
  • [6] Automatic Speech-Based Smoking Status Identification
    Ma, Zhizhong
    Singh, Satwinder
    Qiu, Yuanhang
    Hou, Feng
    Wang, Ruili
    Bullen, Christopher
    Chu, Joanna Ting Wai
    [J]. INTELLIGENT COMPUTING, VOL 3, 2022, 508 : 193 - 203
  • [7] Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features
    Narendra, N. P.
    Alku, Paavo
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [8] An assessment of a speech-based programming environment
    Begel, Andrew
    Graham, Susan L.
    [J]. IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, 2006, : 116 - +
  • [9] Automatic intelligibility assessment of dysarthric speech using glottal parameters
    Narendra, N. P.
    Alku, Paavo
    [J]. SPEECH COMMUNICATION, 2020, 123 : 1 - 9
  • [10] Speech-Based Automatic Assessment of Question Making Skill in L2 Language
    Mansour, Eman
    Sandouka, Rand
    Jaber, Dima
    Hanani, Abualsoud
    [J]. SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 317 - 326