Fear emotion classification in speech by acoustic and behavioral cues

被引:5
|
作者
Yoon, Shin-ae [1 ]
Son, Guiyoung [1 ]
Kwon, Soonil [1 ]
机构
[1] Sejong Univ, Coll Software & Convergence Technol, Dept Software, 209 Neung Dong Ro, Seoul 05006, South Korea
关键词
Emotional speech classification; Emergency situation; Behavioral cue; Disfluency(interjection); Speech signal processing; VOCAL EXPRESSION; RECOGNITION; COMMUNICATION; DISCRETE; FEATURES;
D O I
10.1007/s11042-018-6329-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine-based emotional speech classification has become a requirement for natural and familiar human-computer interactions. Because emotional speech recognition systems use a person's voice to spontaneously detect their emotional status and take subsequent appropriate actions, they can be used widely for various reason in call centers and emotional based media services. Emotional speech recognition systems are primarily developed using emotional acoustic data. While there are several emotional acoustic databases available for emotion recognition systems in other countries, there is currently no real situational data related to the fear emotion available. Thus, in this study, we collected acoustic data recordings which represent real urgent and fearful situations from an emergency call center. To classify callers' emotions more accurately, we also included the additional behavioral feature interjection which can be classified as a type of disfluency which arises due to cognitive dysfunction observed in spontaneous speech when a speaker gets hyperemotional. We used Support Vector Machines (SVM), with the interjections feature, as well as conventionally used acoustic features (i.e., F0 variability, voice intensity variability, and Mel-Frequency Cepstral Coefficients; MFCCs) to identify which emotional category acoustic data fell into. The results of our study revealed that the MFCC was the best acoustic feature for spontaneous fear speech classification. In addition, we demonstrated the validity of behavioral features as an important criteria for emotional classification improvement.
引用
收藏
页码:2345 / 2366
页数:22
相关论文
共 50 条
  • [1] Fear emotion classification in speech by acoustic and behavioral cues
    Shin-ae Yoon
    Guiyoung Son
    Soonil Kwon
    Multimedia Tools and Applications, 2019, 78 : 2345 - 2366
  • [2] The Acoustic Cues of Fear: Investigation of Acoustic Parameters of Speech Containing Fear
    Ozseven, Turgut
    ARCHIVES OF ACOUSTICS, 2018, 43 (02) : 245 - 251
  • [3] Speech Emotion Classification using Acoustic Features
    Chen, Shizhe
    Jin, Qin
    Li, Xirong
    Yang, Gang
    Xu, Jieping
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 579 - 583
  • [4] Emotion in Speech: The Acoustic Attributes of Fear, Anger, Sadness, and Joy
    Christina Sobin
    Murray Alpert
    Journal of Psycholinguistic Research, 1999, 28 : 347 - 365
  • [5] Emotion in speech: The acoustic attributes of fear, anger, sadness, and joy
    Sobin, C
    Alpert, M
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 1999, 28 (04) : 347 - 365
  • [6] SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Epps, Julien
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4693 - 4696
  • [7] The Great Terror, Fear, Emotion and Speech
    Korstanje, Maximiliano E.
    E-LATINA-REVISTA ELECTRONICA DE ESTUDIOS LATINOAMERICANOS, 2015, 13 (51): : 72 - 73
  • [8] DEEP ENCODED LINGUISTIC AND ACOUSTIC CUES FOR ATTENTION BASED END TO END SPEECH EMOTION RECOGNITION
    Bhosale, Swapnil
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7189 - 7193
  • [9] Measuring the Randomness of Speech Cues for Emotion Recognition
    Susan, Seba
    Kaur, Amandeep
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 78 - 83
  • [10] Emotion Classification in Children's Speech Using Fusion of Acoustic and Linguistic Features
    Polzehl, Tim
    Sundaram, Shiva
    Ketabdar, Hamed
    Wagner, Michael
    Metze, Florian
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 340 - +