Robust speech recognition with on-line unsupervised acoustic feature compensation

被引:0
|
作者
Buera, Luis [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
Saz, Oscar [1 ]
Ortega, Alfonso [1 ]
机构
[1] Univ Zaragoza, GTC, E-50009 Zaragoza, Spain
关键词
robust speech recognition; feature vector normalization; acoustic model adaptation;
D O I
10.1109/ASRU.2007.4430092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An on-line unsupervised hybrid compensation technique is proposed to reduce the mismatch between training and testing conditions. It combines Multi-Environment Model based LInear Normalization with cross-probability model based on GMMs (MEMLIN CPM) with a novel acoustic model adaptation method based on rotation transformations. Hence, a set of rotation transformations is estimated with clean and MEMLIN CPM-normalized training data by linear regression in an unsupervised process. Thus, in testing, each MEMLIN CPM normalized frame is decoded using a modified Viterbi algorithm and expanded acoustic models, which are obtained from the reference ones and the set of rotation transformations. To test the proposed solution, some experiments with Spanish SpeechDat Car database were carried out. MEMLIN CPM over standard ETSI front-end parameters reaches 83.89% of average improvement in WER, while the introduced hybrid solution goes up to 92.07%. Also, the proposed hybrid technique was tested with Aurora 2 database, obtaining an average improvement of 68.88% with clean training.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [1] On-line feature and acoustic model space compensation for robust speech recognition in car environment
    Miguel, Antonio
    Buera, Luis
    Lleida, Eduardo
    Ortega, Alfonso
    Saz, Oscar
    2007 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2007, : 518 - 523
  • [2] On the Jointly Unsupervised Feature Vector Normalization and Acoustic Model Compensation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Lleida, Eduardo
    Saz, Oscar
    Ortega, Alfonso
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1381 - 1384
  • [3] Acoustic feature combination for robust speech recognition
    Zolnay, A
    Schlüter, R
    Ney, H
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 457 - 460
  • [4] Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Saz, Oscar
    Ortega, Alfonso
    Lleida, Eduardo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 296 - 309
  • [5] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [6] Model-based feature compensation for robust speech recognition
    School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
    不详
    不详
    Fundam Inf, 2006, 4 (529-539):
  • [7] A Particle Filter Feature Compensation Approach to Robust Speech Recognition
    Mushtaq, Aleem
    Tsao, Yu
    Hui-Lee, Chin
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2054 - +
  • [8] On stochastic feature and model compensation approaches to robust speech recognition
    Lee, CH
    SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47
  • [9] Feature domain compensation of nonstationary noise for robust speech recognition
    Kim, NS
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 231 - 248
  • [10] Two-domain feature compensation for robust speech recognition
    Shen, HF
    Liu, G
    Guo, J
    Li, QX
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 351 - 356