VOICE SOURCE FEATURES FOR COGNITIVE LOAD CLASSIFICATION

被引:0
|
作者
Yap, Tet Fei [1 ]
Epps, Julien [1 ]
Ambikairajah, Eliathamby [1 ]
Choi, Eric H. C.
机构
[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
关键词
cognitive load; voice source features; GMM classification; voice quality;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Previous work in speech-based cognitive load classification has shown that the glottal source contains important information for cognitive load discrimination. However, the reliability of glottal flow features depends on the accuracy of the glottal flow estimation, which is a non-trivial process. In this paper, we propose the use of acoustic voice source features extracted directly from the speech spectrum (or cepstrum) for cognitive load classification. We also propose pre- and post-processing techniques to improve the estimation of the cepstral peak prominence (CPP). 3-class classification results on two databases showed CPP as a promising cognitive load classification feature that outperforms glottal flow features. Score-level fusion of the CPP-based classification system with a formant frequency-based system yielded a final improved accuracy of 62.7%, suggesting that CPP contains useful voice source information that complements the information captured by vocal tract features.
引用
收藏
页码:5700 / 5703
页数:4
相关论文
共 50 条
  • [31] Formant Frequencies under Cognitive Load: Effects and Classification
    Tet Fei Yap
    Julien Epps
    Eliathamby Ambikairajah
    Eric H. C. Choi (EURASIP Member)
    EURASIP Journal on Advances in Signal Processing, 2011
  • [32] Automatic classification of neurological voice disorders using wavelet scattering features
    Yagnavajjula, Madhu Keerthana
    Mittapalle, Kiran Reddy
    Alku, Paavo
    Rao, K. Sreenivasa
    Mitra, Pabitra
    SPEECH COMMUNICATION, 2024, 157
  • [33] Modal and Nonmodal Voice Quality Classification Using Acoustic and Electroglottographic Features
    Borsky, Michal
    Mehta, Daryush D.
    Van Stan, Jarrad H.
    Gudnason, Jon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2281 - 2291
  • [34] Voice Source Correlates of Prosodic Features in American English: A Pilot Study
    Iseli, Markus
    Shue, Yen-Liang
    Epstein, Melissa A.
    Keating, Patricia
    Kreiman, Jody
    Alwan, Abeer
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2226 - +
  • [35] ON THE RELATION BETWEEN VOICE SOURCE PARAMETERS AND PROSODIC FEATURES IN CONNECTED SPEECH
    STRIK, H
    BOVES, L
    SPEECH COMMUNICATION, 1992, 11 (2-3) : 167 - 174
  • [36] ICA-FX features for classification of singing voice and instrumental sound
    Leung, TW
    Ngo, CW
    Lau, RWH
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 367 - 370
  • [37] Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification
    Florian Eyben
    Gláucia L Salomão
    Johan Sundberg
    Klaus R Scherer
    Björn W Schuller
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [38] Cognitive Impairment Classification Prediction Model Using Voice Signal Analysis
    Sung, Sang-Ha
    Hong, Soongoo
    Kim, Jong-Min
    Kang, Do-Young
    Park, Hyuntae
    Kim, Sangjin
    ELECTRONICS, 2024, 13 (18)
  • [39] Using ASR Posterior Probability and Acoustic Features for Voice Disorder Classification
    Tulics, Miklos Gabriel
    Szaszak, Gyorgy
    Meszaros, Krisztina
    Vicsi, Klara
    2020 11TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2020), 2020, : 155 - 159
  • [40] Wavelet sub-band features for voice disorder detection and classification
    Gidaye, Girish
    Nirmal, Jagannath
    Ezzine, Kadria
    Frikha, Mondher
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28499 - 28523