Hidden Markov model-based speech emotion recognition

被引：0

作者：

Schuller, B ^{[1
]}

Rigoll, G ^{[1
]}

Lang, M ^{[1
]}

机构：

[1] Tech Univ Munich, Inst Human Comp Commun, D-8000 Munich, Germany

来源：

2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING | 2003年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this contribution we introduce speech emotion recognition by use of continuous hidden Markov models. Two methods are propagated and compared throughout the paper. Within the first method a global statistics framework of an utterance is classified by Gaussian mixture models using derived features of the raw pitch and energy contour of the speech signal. A second method introduces increased temporal complexity applying continuous hidden Markov models considering several states using low-level instantaneous features instead of global statistics. The paper addresses the design of working recognition engines and results achieved with respect to the alluded alternatives. A speech corpus consisting of acted and spontaneous emotion samples in German and English language is described in detail. Both engines have been tested and trained using this equivalent speech corpus. Results in recognition of seven discrete emotions exceeded 86% recognition rate. As a basis of comparison the similar judgment of human deciders classifying the same corpus at 79.8% recognition rate was analyzed.

引用

页码：1 / 4

页数：4

共 50 条

[31] Speaker adaptation method for fenonic Markov model-based speech recognition
[J]. Nishimura, Masafumi, 1600, (22):
[32] A Hidden Markov Model-Based Performance Recognition System for Marching Wind Bands
Jiang, Wei
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 420 - 431
[33] Hidden Markov Model-based human action recognition using mixed features
[J]. Ji, X. (jixiaofei7804@yahoo.com.cn), 2013, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
[34] Hidden Markov model-based speech enhancement using multivariate Laplace and Gaussian distributions
Aroudi, Ali
Veisi, Hadi
Sameti, Hossein
[J]. IET SIGNAL PROCESSING, 2015, 9 (02) : 177 - 185
[35] Genones: Generalized mixture tying in continuous hidden Markov model-based speech recognizers
Digalakis, VV
Monaco, P
Murveit, H
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (04): : 281 - 289
[36] Audio-visual based emotion recognition using tripled hidden Markov model
Song, ML
Chen, C
You, MY
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 877 - 880
[37] Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers
Technical Univ of Crete, Chania, Greece
[J]. IEEE Trans Speech Audio Process, 4 (281-289):
[38] Emotion Recognition based on Third-Order Circular Suprasegmental Hidden Markov Model
Shahin, Ismail
[J]. 2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 800 - 805
[39] Hidden Markov Model-based population synthesis
Saadi, Ismail
Mustafa, Ahmed
Teller, Jacques
Farooq, Bilal
Cools, Mario
[J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2016, 90 : 1 - 21
[40] Partly Hidden Markov Model and its application to speech recognition
Waseda Univ, Tokyo, Japan
[J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (121-124):

← 1 2 3 4 5 →