Speech parameters for the robust emotional speech recognition

被引:0
|
作者
Kim W.-G. [1 ]
机构
[1] Kunsan National University, Korea, Republic of
关键词
Robust speech recognition; Speech parameter; Vocal tract length normalization;
D O I
10.5302/J.ICROS.2010.16.12.1137
中图分类号
学科分类号
摘要
This paper studied the speech parameters less affected by the human emotion for the development of the robust speech recognition system. For this purpose, the effect of emotion on the speech recognition system and robust speech parameters of speech recognition system were studied using speech database containing various emotions. In this study, mel-cepstral coefficient, delta-cepstral coefficient, RASTA mel-cepstral coefficient and frequency warped mel-cepstral coefficient were used as feature parameters. And CMS (Cepstral Mean Subtraction) method were used as a signal bias removal technique. Experimental results showed that the HMM based speaker independent word recognizer using vocal tract length normalized mel-cepstral coefficient, its derivatives and CMS as a signal bias removal showed the best performance of 0.78% word error rate. This corresponds to about a 50% word error reduction as compare to the performance of baseline system using mel -cepstral coefficient, its derivatives and CMS. © ICROS 2010.
引用
收藏
页码:1137 / 1142
页数:5
相关论文
共 50 条
  • [1] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [2] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    [J]. 13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [3] Emotional Speech Clustering based Robust Speaker Recognition System
    Li, Dongdong
    Yang, Yingchun
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
  • [4] Investigating emotional speech parameters for speech synthesis
    Galanis, D
    Darsinos, V
    Kokkinakis, G
    [J]. ICECS 96 - PROCEEDINGS OF THE THIRD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2, 1996, : 1227 - 1230
  • [5] Japanese speech databases for robust speech recognition
    Nakamura, A
    Matsunaga, S
    Shimizu, T
    Tonomura, M
    Sagisaka, Y
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2199 - 2202
  • [6] Robust speech detector for speech recognition applications
    Liang, WQ
    Chen, YN
    Shan, YX
    Liu, J
    Liu, RS
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 453 - 456
  • [7] Speech production parameters for automatic speech recognition
    McGowan, RS
    Faber, A
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (01): : 28 - 28
  • [8] Normalization of the Speech Modulation Spectra for Robust Speech Recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1662 - 1674
  • [9] Histogram equalization of speech representation for robust speech recognition
    de la Torre, A
    Peinado, AM
    Segura, JC
    Pérez-Córdoba, JL
    Benítez, MC
    Rubio, AJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 355 - 366
  • [10] CASA Based Speech Separation for Robust Speech Recognition
    Han Runqiang
    Zhao Pei
    Gao Qin
    Zhang Zhiping
    Wu Hao
    Wu Xihong
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 77 - 80