Using Prosodic and Spectral Features in Detecting Depression in Elderly Males

被引:0
|
作者
Sanchez, Michelle Hewlett [1 ,2 ]
Vergyri, Dimitra [1 ]
Ferrer, Luciana [1 ]
Richey, Colleen [1 ]
Garcia, Pablo [3 ]
Knoth, Bruce [3 ]
Jarrold, William [4 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Stanford Univ, Stanford, CA 94305 USA
[3] SRI Int, Robot & Med Syst Lab, Menlo Pk, CA 94025 USA
[4] Univ Calif Davis, Ctr Mind & Brain, Davis, CA 95616 USA
关键词
Depression; Emotion Detection; Prosodic Features; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As research in speech processing has matured, there has been much interest in paralinguistic speech processing problems including the speaker's mental and psychological health. In this study, we focus on speech features that can identify the speaker's emotional health, i.e., whether the speaker is depressed or not. We use prosodic speech measurements, such as pitch and energy, in addition to spectral features, such as formants and spectral tilt, and compute statistics of these features over different regions of the speech signal. These statistics are used as input features to a discriminative classifier that predicts the speaker's depression state. We find that with an N-fold leave-one-out cross-validation setup, we can achieve a prediction accuracy of 81.3%, where random guess is 50%.
引用
收藏
页码:3012 / +
页数:2
相关论文
共 50 条
  • [31] Using prosodic features in language models for meetings
    Huang, Songfang
    Renals, Steve
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 192 - 203
  • [32] Prosodic and Spectral Features within Segment-based Acoustic Modeling
    Schuller, Bjoern
    Zhang, Xiaohua
    Rigoll, Gerhard
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2370 - 2373
  • [33] Speaker recognition using prosodic and lexical features
    Kajarekar, S
    Ferrer, L
    Venkataraman, A
    Sonmez, K
    Shriberg, E
    Stolcke, A
    Bratt, H
    Gadde, RR
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 19 - 24
  • [34] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
    Zhou, Yu
    Li, Junfeng
    Sun, Yanqing
    Zhang, Jianping
    Yan, Yonghong
    Akagi, Masato
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
  • [35] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
    Gaurav, Manish
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
  • [36] EVALUATION OF MIMICKED SPEECH USING PROSODIC FEATURES
    Mary, Leena
    Babu, Anish K. K.
    Joseph, Aju
    George, Gibin M.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7189 - 7193
  • [37] Classification of lexical stress using spectral and prosodic features for computer-assisted language learning systems
    Ferrer, Luciana
    Bratt, Harry
    Richey, Colleen
    Franco, Horacio
    Abrash, Victor
    Precoda, Kristin
    SPEECH COMMUNICATION, 2015, 69 : 31 - 45
  • [38] Detecting Symptoms of Dementia in Elderly Persons using Features of Pupil Light Reflex
    Nakayama, Minoru
    Nowak, Wioletta
    Zarowska, Anna
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 745 - 749
  • [39] Detecting Depression in Videos using Uniformed Local Binary Pattern on Facial Features
    Dadiz, Bryan G.
    Ruiz, Conrado R.
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, 2019, 481 : 413 - 422
  • [40] Effectiveness of Voice Quality Features in Detecting Depression
    Afshan, Amber
    Guo, Jinxi
    Park, Soo Jin
    Ravi, Vijay
    Flint, Jonathan
    Alwan, Abeer
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1676 - 1680