On the Jointly Unsupervised Feature Vector Normalization and Acoustic Model Compensation for Robust Speech Recognition

被引:0
|
作者
Buera, Luis [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
Saz, Oscar [1 ]
Ortega, Alfonso [1 ]
机构
[1] Univ Zaragoza, GTC, E-50009 Zaragoza, Spain
关键词
robust speech recognition; feature vector normalization; acoustic model adaptation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To compensate the mismatch between training and testing conditions, an unsupervised hybrid compensation technique is proposed. It combines Multi-Environment Model based LInear Normalization (MEMLIN) with a novel acoustic model adaptation method based on rotation transformations. A set of rotation transformations is estimated between clean and MEMLIN-normalized data by linear regression in a training process. Thus, each MEMLIN-normalized frame is decoded using the expanded acoustic models, which are obtained from the reference ones and the set of rotation transformations. During the search algorithm, one of the rotation transformations is on-line selected for each frame according to the ML criterion in a modified Viterbi algorithm. Some experiments with Spanish SpeechDat Car database were carried out. MEMLIN over standard ETSI front-end parameters reaches 75.53% of mean improvement in WER, while the introduced hybrid solution goes up to 90.54%.
引用
收藏
页码:1381 / 1384
页数:4
相关论文
共 50 条
  • [1] Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Saz, Oscar
    Ortega, Alfonso
    Lleida, Eduardo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 296 - 309
  • [2] Robust speech recognition with on-line unsupervised acoustic feature compensation
    Buera, Luis
    Miguel, Antonio
    Lleida, Eduardo
    Saz, Oscar
    Ortega, Alfonso
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 105 - 110
  • [3] A recursive feature vector normalization approach for robust speech recognition in noise
    Viikki, O
    Bye, D
    Laurila, K
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 733 - 736
  • [4] Cepstral domain segmental feature vector normalization for noise robust speech recognition
    Viikki, O
    Laurila, K
    [J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 133 - 147
  • [5] On-line feature and acoustic model space compensation for robust speech recognition in car environment
    Miguel, Antonio
    Buera, Luis
    Lleida, Eduardo
    Ortega, Alfonso
    Saz, Oscar
    [J]. 2007 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2007, : 518 - 523
  • [6] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    [J]. FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [7] On stochastic feature and model compensation approaches to robust speech recognition
    Lee, CH
    [J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47
  • [8] Temporal structure normalization of speech feature for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 500 - 503
  • [9] A Robust Feature Normalization Algorithm for Automatic Speech Recognition
    Lei, Jianjun
    Yang, Zhen
    Wang, Jian
    [J]. FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 473 - +
  • [10] Acoustic quality normalization for robust automatic speech recognition
    Muhammad G.
    [J]. International Journal of Speech Technology, 2007, 10 (4) : 175 - 182