Performance Evaluation of HMM-Based Style Classification with a Small Amount of Training Data

被引：0

作者：

Tachibana, Makoto ^{[1
]}

Kawashima, Keigo ^{[1
]}

Yamagishi, Junichi ^{[1
]}

Kobayashi, Takao ^{[1
]}

机构：

[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

emotional speech; speaking style; speech emotion recognition; classification; MSD-HMM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a classification technique for emotional expressions and speaking styles of speech using only a small amount of training data of a target speaker. We model spectral and fundamental frequency (F0) features simultaneously using multi-space probability distribution HMM (MSD-HMM), and adapt a speaker-independent neutral style model to a certain target speaker's style model with a small amount of data using MSD-MLLR which is extended MLLR for MSD-HMM. We perform classification experiments for professional narrators' speech and non-professional speakers' speech and evaluate the performance of proposed technique by comparing with other commonly used classifiers. We show that the proposed technique gives better result than the other classifiers when using a few sentences of target speaker's style data.

引用

页码：569 / 572

页数：4

共 50 条

[31] An HMM-based over-sampling technique to improve text classification
Iglesias, E. L.
Seara Vieira, A.
Borrajo, L.
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7184 - 7192
[32] HMM-based audio/video mixed data mining algorithm
Zhang Aijun
Xu Yun
Wang Xun
ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 1217 - 1222
[33] Data Selection and Adaptation for Naturalness in HMM-based Speech Synthesis
Cooper, Erica
Chang, Alison
Levitan, Yocheved
Hirschberg, Julia
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 357 - +
[34] Performance evaluation for an HMM-based keyword spotter and a Large-margin based one in noisy environments
Tabibian, Shima
Shokri, Akram
Akbari, Ahmad
Nasersharif, Babak
WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
[35] Evaluation of prosodic contextual factors for HMM-based speech synthesis
Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, 226-8502, Japan
Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (430-433):
[36] HMM-Based Underwater Target Classification with Synthesized Active Sonar Signals
Kim, Taehwan
Bae, Keunsung
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (10) : 2039 - 2042
[37] HMM-BASED UNDERWATER TARGET CLASSIFICATION WITH SYNTHESIZED ACTIVE SONAR SIGNALS
Kim, Taehwan
Bae, Keunsung
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1805 - 1808
[38] HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation
Nguyen Thi Thu Trang
D'Alessandro, Christophe
Rilliard, Albert
Tran Do Dat
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2310 - 2314
[39] Feature pruning in likelihood evaluation of HMM-based speech recognition
Li, X
Bilmes, J
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 303 - 308
[40] An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text
Rashid, Sheikh Faisal
Shafait, Faisal
Breuel, Thomas M.
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1260 - 1264

← 1 2 3 4 5 →