Performance Evaluation of HMM-Based Style Classification with a Small Amount of Training Data

被引:0
|
作者
Tachibana, Makoto [1 ]
Kawashima, Keigo [1 ]
Yamagishi, Junichi [1 ]
Kobayashi, Takao [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
emotional speech; speaking style; speech emotion recognition; classification; MSD-HMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a classification technique for emotional expressions and speaking styles of speech using only a small amount of training data of a target speaker. We model spectral and fundamental frequency (F0) features simultaneously using multi-space probability distribution HMM (MSD-HMM), and adapt a speaker-independent neutral style model to a certain target speaker's style model with a small amount of data using MSD-MLLR which is extended MLLR for MSD-HMM. We perform classification experiments for professional narrators' speech and non-professional speakers' speech and evaluate the performance of proposed technique by comparing with other commonly used classifiers. We show that the proposed technique gives better result than the other classifiers when using a few sentences of target speaker's style data.
引用
收藏
页码:569 / 572
页数:4
相关论文
共 50 条
  • [31] An HMM-based over-sampling technique to improve text classification
    Iglesias, E. L.
    Seara Vieira, A.
    Borrajo, L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7184 - 7192
  • [32] HMM-based audio/video mixed data mining algorithm
    Zhang Aijun
    Xu Yun
    Wang Xun
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 1217 - 1222
  • [33] Data Selection and Adaptation for Naturalness in HMM-based Speech Synthesis
    Cooper, Erica
    Chang, Alison
    Levitan, Yocheved
    Hirschberg, Julia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 357 - +
  • [34] Performance evaluation for an HMM-based keyword spotter and a Large-margin based one in noisy environments
    Tabibian, Shima
    Shokri, Akram
    Akbari, Ahmad
    Nasersharif, Babak
    WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3
  • [35] Evaluation of prosodic contextual factors for HMM-based speech synthesis
    Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, 226-8502, Japan
    Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (430-433):
  • [36] HMM-Based Underwater Target Classification with Synthesized Active Sonar Signals
    Kim, Taehwan
    Bae, Keunsung
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (10) : 2039 - 2042
  • [37] HMM-BASED UNDERWATER TARGET CLASSIFICATION WITH SYNTHESIZED ACTIVE SONAR SIGNALS
    Kim, Taehwan
    Bae, Keunsung
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1805 - 1808
  • [38] HMM-based TTS for Hanoi Vietnamese: issues in design and evaluation
    Nguyen Thi Thu Trang
    D'Alessandro, Christophe
    Rilliard, Albert
    Tran Do Dat
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2310 - 2314
  • [39] Feature pruning in likelihood evaluation of HMM-based speech recognition
    Li, X
    Bilmes, J
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 303 - 308
  • [40] An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text
    Rashid, Sheikh Faisal
    Shafait, Faisal
    Breuel, Thomas M.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1260 - 1264