Design of Hierarchical Classifier to Improve Speech Emotion Recognition

被引:0
|
作者
Vasuki, P. [1 ]
机构
[1] Sri Sivasubramaniya Nadar Coll Engn, Dept IT, Chennai 603110, Tamil Nadu, India
来源
关键词
Speech emotion recognition; hierarchical classifier design; ensemble; emotion speech corpora; FEATURES;
D O I
10.32604/csse.2023.024441
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Emotion Recognition (SER) is used to recognize emotion from speech automatically. Speech Emotion recognition is working well in a laboratory environment but real-time emotion recognition has been influenced by the variations in gender, age, the cultural and acoustical background of the speaker. The acoustical resemblance between emotional expressions further increases the complexity of recognition. Many recent research works are concentrated to address these effects individually. Instead of addressing every influencing attribute individually, we would like to design a system, which reduces the effect that arises on any factor. We propose a two-level Hierarchical classifier named Interpreter of responses (IR). The first level of IR has been realized using Support Vector Machine (SVM) and Gaussian Mixer Model (GMM) classifiers. In the second level of IR, a discriminative SVM classifier has been trained and tested with meta information of first-level classifiers along with the input acoustical feature vector which is used in primary classifiers. To train the system with a corpus of versatile nature, an integrated emotion corpus has been composed using emotion samples of 5 speech corpora, namely; EMO-DB, IITKGP-SESC, SAVEE Corpus, Spanish emotion corpus, CMU's Woogle corpus. The hierarchical classifier has been trained and tested using MFCC and Low-Level Descriptors (LLD). The empirical analysis shows that the proposed classifier outperforms the traditional classifiers. The proposed ensemble design is very generic and can be adapted even when the number and nature of features change. The first-level classifiers GMM or SVM may be replaced with any other learning algorithm.
引用
收藏
页码:19 / 33
页数:15
相关论文
共 50 条
  • [1] Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment
    Vasuki, P.
    Aravindan, Chandrabose
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2021, 33 (03) : 451 - 466
  • [2] CLASSIFIER FUSION FOR EMOTION RECOGNITION FROM SPEECH
    Scherer, Stefan
    Schwenker, Friedhelm
    Palm, Guenther
    [J]. ADVANCED INTELLIGENT ENVIRONMENTS, 2009, : 95 - 117
  • [3] Hierarchical framework for speech emotion recognition
    You, Mingyu
    Chen, Chun
    Bu, Jiajun
    Liu, Jia
    Tao, Jianhua
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOLS 1-7, 2006, : 515 - +
  • [4] Multi-Classifier Speech Emotion Recognition System
    Partila, Pavol
    Tovarek, Jaromir
    Voznak, Miroslav
    Rozhon, Jan
    Sevcik, Lukas
    Baran, Remigiusz
    [J]. 2018 26TH TELECOMMUNICATIONS FORUM (TELFOR), 2018, : 416 - 419
  • [5] Learning Spontaneity to Improve Emotion Recognition in Speech
    Mangalam, Karttikeya
    Guha, Tanaya
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 946 - 950
  • [6] Improvement Of Speech Emotion Recognition with Neural Network Classifier by Using Speech Spectrogram
    Prasomphan, Sathit
    [J]. 2015 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2015), 2015, : 73 - 76
  • [7] Hierarchical sparse coding framework for speech emotion recognition
    Torres-Boza, Diana
    Oveneke, Meshia Cedric
    Wang, Fengna
    Jiang, Dongmei
    Verhelst, Werner
    Sahli, Hichem
    [J]. SPEECH COMMUNICATION, 2018, 99 : 80 - 89
  • [8] A Hierarchical Classification Scheme for Efficient Speech Emotion Recognition
    Heracleous, Panikos
    Takai, Kohichi
    Yasuda, Keiji
    Yoneyama, Akio
    [J]. HCI INTERNATIONAL 2021 - LATE BREAKING POSTERS, HCII 2021, PT II, 2021, 1499 : 88 - 92
  • [9] Speech Emotion Recognition Using Multi-Layer Perceptron Classifier
    Yuan, Xiaochen
    Wong, Wai Pang
    Lam, Chan Tong
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 644 - 648
  • [10] Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition
    Zhou, Ying
    Liang, Xuefeng
    Gu, Yu
    Yin, Yifei
    Yao, Longshan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 695 - 705