Glottal Source Features for Automatic Speech-based Depression Assessment

被引:15
|
作者
Simantiraki, Olympia [1 ]
Charonyktakis, Paulos [2 ]
Pampouchidou, Anastasia [3 ]
Tsiknakis, Manolis [4 ,5 ]
Cooker, Martin [1 ]
机构
[1] Univ Basque Country, Language & Speech Lab, Vitoria, Spain
[2] Gnosis Data Anal PC, Iraklion, Greece
[3] Univ Burgundy, Le2i Lab, Le Creusot, France
[4] Technol Educ Inst Greece, Iraklion, Greece
[5] FORTH, Iraklion, Greece
关键词
glottal source; Phase Distortion Deviation; binary classification; machine learning;
D O I
10.21437/Interspeech.2017-1251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Depression is one of the most prominent mental disorders, with an increasing rate that makes it the fourth cause of disability worldwide. The field of automated depression assessment has emerged to aid clinicians in the form of a decision support system. Such a system could assist as a pre-screening tool, or even for monitoring high risk populations. Related work most commonly involves multimodal approaches, typically combining audio and visual signals to identify depression presence and/or severity. The current study explores categorical assessment of depression using audio features alone. Specifically, since depression-related vocal characteristics impact the glottal source signal, we examine Phase Distortion Deviation which has previously been applied to the recognition of voice qualities such as hoarseness, breathiness and creakiness, some of which are thought to be features of depressed speech. The proposed method uses as features DCT-coefficients of the Phase Distortion Deviation for each frequency band. An automated machine learning tool, Just Add Data, is used to classify speech samples. The method is evaluated on a benchmark dataset (AVEC2014), in two conditions: read-speech and spontaneous-speech. Our findings indicate that Phase Distortion Deviation is a promising audio-only feature for automated detection and assessment of depressed speech.
引用
收藏
页码:2700 / 2704
页数:5
相关论文
共 50 条
  • [41] Automatic speech-based assessment to discriminate Parkinson’s disease from essential tremor with a cross-language approach
    Cristian David Rios-Urrego
    Jan Rusz
    Juan Rafael Orozco-Arroyave
    [J]. npj Digital Medicine, 7
  • [42] Assessment of disordered voice based on an optimized glottal source model
    Boudjerda, Mounir
    Kacha, Abdellah
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3201 - 3214
  • [43] Recursive ARMAX speech analysis based on a glottal source model with phase compensation
    Funaki, K
    Miyanaga, Y
    Tochinai, K
    [J]. SIGNAL PROCESSING, 1999, 74 (03) : 279 - 295
  • [44] Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels
    Feng, Kexin
    Feng, Kexin
    [J]. 2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
  • [45] A study of glottal waveform features for deceptive speech classification
    Torres, Juan F.
    Moore, Elliot, II
    Bryant, Ernest
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4489 - 4492
  • [46] Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives
    Popescu-Belis, Andrei
    Kilgour, Jonathan
    Poller, Peter
    Nanchen, Alexandre
    Boertjes, Erik
    de Wit, Joost
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 703 - 703
  • [47] Towards Glottal Source Controllability in Expressive Speech Synthesis
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    Raitio, Tuomo
    Obin, Nicolas
    Alku, Paavo
    Yamagishi, Junichi
    Montero, Juan M.
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1618 - 1621
  • [48] Speech-Based Home Automation System
    Fytrakis, Emmanouil
    Georgoulas, Ioannis
    Part, Jose
    Zhu, Yuting
    [J]. BRITISH HCI 2015, 2015, : 271 - 272
  • [49] Usability engineering of speech-based services
    Sidhu, CK
    Coyle, G
    [J]. BRITISH TELECOMMUNICATIONS ENGINEERING, 1996, 14 : 337 - 340
  • [50] Difficulties in Automatic Speech Recognition of Dysarthric Speakers and Implications for Speech-Based Applications Used by the Elderly: A Literature Review
    Young, Victoria
    Mihailidis, Alex
    [J]. ASSISTIVE TECHNOLOGY, 2010, 22 (02) : 99 - 112