Human Emotional States Classification Based upon Changes in Speech Production Features in Vowel Regions

被引:0
|
作者
Mohanta, Abhijit [1 ]
Mittal, Vinay Kumar [1 ]
机构
[1] Indian Inst Informat Technol Chittoor, Sri City, AP, India
关键词
F0; Formants; SVM; LP spectrum; LP residual; LINEAR PREDICTION; TUTORIAL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech signal, that is produced by the human speech production system, carries emotions that the humans can perceive easily. In this paper, we aim to classify the four basic emotions, namely happy, anger, fear and neutral, by analyzing changes in the speech production features in the vowel regions of speech signal. A Telugu emotional speech database is used. Changes in two production features, the instantaneous fundamental frequency (F0) as the source feature, and first three Formants (F1, F2, and F3) as the filter (system) features, are examined in each case. These features are derived from the speech signal using the signal processing methods, i.e., linear prediction (LP) residual and LP spectrum, respectively. The features are examined in the speech segments of five Telugu vowels that have corresponding English vowels, i.e., /a/, /e/, /i/, /o/, and /u/. These vowel regions in the speech signal are detected manually. Further, the classification of emotional states is carried out using a Support Vector Machine (SVM) classifier. The results indicate that in the case of anger emotional state for both male and female speakers, the vowels /a/ and /e/ have higher mean F0 value, as compared to mean F0 for happy, fear and neutral states. Also, the classification accuracy of SVM classifier is observed to be highest for happy emotional state, and lowest for fear emotional state. This insight should be helpful in developing other diverse applications on emotional speech.
引用
收藏
页码:172 / 177
页数:6
相关论文
共 50 条
  • [41] Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone speech
    Lee, Jung-Won
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02): : 1536 - 1546
  • [42] Methods for stress classification: Nonlinear TEO and linear speech based features
    Zhou, GJ
    Hansen, JHL
    Kaiser, JF
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2087 - 2090
  • [43] Cross-covariance-based features for speech classification in film audio
    Benatan, Matt
    Ng, Kia
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2015, 31 : 215 - 221
  • [44] Classification of EEG Based Imagine Speech Using Time Domain Features
    Paul, Yogesh
    Jaswal, Ram Avtar
    Kajal, Sanjay
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2921 - 2924
  • [45] GMM Based Classification of Speech under Stress Using Physical Features
    Yao, Xiao
    Xu, Ning
    Gao, Mingsheng
    Jiang, Aiming
    Liu, Xiaofeng
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 379 - 384
  • [46] Knowledge-Based Features for Speech Analysis and Classification: Pronunciation Diagnoses
    Liu, Lichuan
    Li, Wei
    Morris, Sherrill
    Zhuang, Mutian
    ELECTRONICS, 2023, 12 (09)
  • [47] Part of Speech Features for Sentiment Classification based on Latent Dirichlet Allocation
    Usop, Eka Surya
    Isnanto, R. Rizal
    Kusumaningrum, Retno
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2017, : 31 - 34
  • [48] A Novel Approach for Classification of Speech Emotions Based on Deep and Acoustic Features
    Er, Mehmet Bilal
    IEEE ACCESS, 2020, 8 : 221640 - 221653
  • [49] Optimized discriminative transformations for speech features based on minimum classification error
    Zamani, Behzad
    Akbari, Ahmad
    Nasersharif, Babak
    Jalalvand, Azarakhsh
    PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 948 - 955
  • [50] Speech emotion classification using fractal dimension-based features
    Tamulevicius, Gintautas
    Karbauskaite, Rasa
    Dzemyda, Gintautas
    NONLINEAR ANALYSIS-MODELLING AND CONTROL, 2019, 24 (05): : 679 - 695