Linear spectral transformation for robust speech recognition using maximum mutual information

被引:7
|
作者
Kim, Donghyun [1 ]
Yook, Dongsuk [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Seoul 136701, South Korea
关键词
linear spectral transformation; maximum mutual information (MMI); rapid adaptation; robust speech recognition;
D O I
10.1109/LSP.2006.891337
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a transformation-based rapid adaptation technique for robust speech recognition using a linear spectral transformation (LST) and a maximum mutual information (NIMI) criterion. Previously, a maximum likelihood linear spectral transformation (ML-LST) algorithm was proposed for fast adaptation in unknown environments. Since the NIMI estimation methods does not require evenly distributed training data and increases the a posteriori probability of the word sequences of the training data, we combine the linear spectral transformation method and the MMI estimation technique in order to achieve extremely rapid adaptation using only one word of adaptation data. The proposed algorithm, called MMI-LST, was implemented using the extended Baum-Welch algorithm and phonetic lattices, and evaluated on the TIMIT and FFMTIMIT corpora. It provides a relative reducion in the speech recognition error rate of 11.1% using only 0.25 s of adaptation data.
引用
收藏
页码:496 / 499
页数:4
相关论文
共 50 条
  • [21] Robust speech recognition by using spectral subtraction with noise peak shifting
    Dai, Peng
    Soon, Ing Yann
    [J]. IET SIGNAL PROCESSING, 2013, 7 (08) : 684 - 692
  • [22] Spectral estimation and normalisation for robust speech recognition
    Claes, T
    Xie, F
    VanCompernolle, D
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1997 - 2000
  • [23] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    ERELL, A
    WEINTRAUB, M
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
  • [24] Nonlinear spectral transformations for robust speech recognition
    Ikbal, S
    Hermansky, H
    Bourlard, H
    [J]. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 393 - 398
  • [25] Noise robust speech recognition based on fractional spectral subtraction and perceptual linear prediction
    Postdoctoral Station, Nanjing University of International Relations, Nanjing 210039, China
    不详
    [J]. Nanjing Youdian Daxue Xuebao (Ziran Kexue Ban), 2008, 4 (12-15+21):
  • [26] Investigation of DNN-HMM and Lattice Free Maximum Mutual Information Approaches for Impaired Speech Recognition
    Veni, Vishnika S.
    Chandrakala, S.
    [J]. IEEE ACCESS, 2021, 9 : 168840 - 168849
  • [27] Rapid adaptation using linear spectral transformation for embedded speech recognisers
    Cho, Y.
    Yook, D.
    [J]. ELECTRONICS LETTERS, 2008, 44 (17) : 1040 - 1041
  • [28] Maximum likelihood polynomial regression for robust speech recognition
    Lü, Yong
    Wu, Zhenyang
    [J]. Shengxue Xuebao/Acta Acustica, 2010, 35 (01): : 88 - 96
  • [29] Maximum likelihood polynomial regression for robust speech recognition
    L Yong WU Zhenyang (School of Information Science and Engineering
    [J]. Chinese Journal of Acoustics, 2011, 30 (03) : 358 - 370
  • [30] Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2367 - 2370