Robust speech recognition with on-line unsupervised acoustic feature compensation

被引:0
|
作者
Buera, Luis [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
Saz, Oscar [1 ]
Ortega, Alfonso [1 ]
机构
[1] Univ Zaragoza, GTC, E-50009 Zaragoza, Spain
关键词
robust speech recognition; feature vector normalization; acoustic model adaptation;
D O I
10.1109/ASRU.2007.4430092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An on-line unsupervised hybrid compensation technique is proposed to reduce the mismatch between training and testing conditions. It combines Multi-Environment Model based LInear Normalization with cross-probability model based on GMMs (MEMLIN CPM) with a novel acoustic model adaptation method based on rotation transformations. Hence, a set of rotation transformations is estimated with clean and MEMLIN CPM-normalized training data by linear regression in an unsupervised process. Thus, in testing, each MEMLIN CPM normalized frame is decoded using a modified Viterbi algorithm and expanded acoustic models, which are obtained from the reference ones and the set of rotation transformations. To test the proposed solution, some experiments with Spanish SpeechDat Car database were carried out. MEMLIN CPM over standard ETSI front-end parameters reaches 83.89% of average improvement in WER, while the introduced hybrid solution goes up to 92.07%. Also, the proposed hybrid technique was tested with Aurora 2 database, obtaining an average improvement of 68.88% with clean training.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [31] Two-stage model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Liu, Gang
    Guo, Jun
    COMPUTING, 2012, 94 (01) : 1 - 20
  • [32] Two-stage model-based feature compensation for robust speech recognition
    Haifeng Shen
    Gang Liu
    Jun Guo
    Computing, 2012, 94 : 1 - 20
  • [33] ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION
    Dubey, Harishchandra
    Sangwan, Abhijeet
    Hansen, John H. L.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2726 - 2730
  • [34] Channel compensation for robust telephone speech recognition
    Han, JQ
    Han, MS
    Gao, W
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 169 - 172
  • [35] Model compensation using robust features for robust speech recognition
    Zhang, Jun
    Wei, Gang
    Shuju Caiji Yu Chuli/Journal of Data Acquisition and Processing, 2003, 18 (03):
  • [36] VTS feature compensation based on two-layer GMM structure for robust speech recognition
    Zhou, Lin
    Li, Haijing
    Chen, Ying
    Wu, Zhenyang
    Lu, Yong
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [37] EFFECT OF FEATURE SMOOTHING FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Chng, Eng Siong
    Lit, Haizhou
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 73 - 76
  • [38] ROBUST FEATURE EXTRACTORS FOR CONTINUOUS SPEECH RECOGNITION
    Alam, M. J.
    Kenny, P.
    Dumouchel, P.
    O'Shaughnessy, D.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 944 - 948
  • [39] Online feature compensation using modified quantile based noise estimation for robust speech recognition
    Lee, Heungkyu
    Kwon, Ohil
    Kim, June
    ADVANCES IN INTELLIGENT IT: ACTIVE MEDIA TECHNOLOGY 2006, 2006, 138 : 236 - 242
  • [40] A new on-line robust approach to design noise-immune speech recognition systems
    Vargas, F
    Fagundes, RD
    Barros, D
    PROCEEDINGS OF THE EIGHTH IEEE INTERNATIONAL ON-LINE TESTING WORKSHOP, 2002, : 187 - 187