Feature Adaptation for Robust Mobile Speech Recognition

被引:0
|
作者
Lee, Hyeopwoo [1 ]
Yook, Dongsuk [1 ]
机构
[1] Korea Univ, Dept Comp & Commun Engn, Speech Informat Proc Lab, Seoul 136701, South Korea
关键词
Speech recognition; speaker adaptation; environment adaptation; feature adaptation; feature space maximum likelihood linear regression (FMLLR); regression tree; LINEAR-REGRESSION; DEVICES;
D O I
10.1109/TCE.2012.6415011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Feature adaptation such as feature space maximum likelihood linear regression (FMLLR) is useful for robust mobile speech recognition. However, as the amount of adaptation data increases, feature adaptation performance becomes saturated quickly due to its limitation of global transformation. To handle this problem, we propose regression tree based FMLLR which can adopt multiple transformations as the amount of adaptation data increases. An experimental result shows that the proposed method reduces the recognition error by 11.8% further for speaker adaptation task and by 13.6% further for noisy environment adaptation task compared to the conventional method(1).
引用
收藏
页码:1393 / 1398
页数:6
相关论文
共 50 条
  • [41] Combining speech enhancement with feature post-processing for robust speech recognition
    Lei, Jianjun
    Guo, Jun
    Liu, Gang
    Wang, Jian
    Nie, Xiangfei
    Yang, Zhen
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 773 - 778
  • [42] Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
    Leutnant, Volker
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (08): : 1640 - 1652
  • [43] Robust Feature Extraction Methods for Speech Recognition in Noisy Environments
    Mukheolkar, Ajinkya Sunil
    Alex, John Sahaya Rani
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 295 - 299
  • [44] Joint model and feature space optimization for robust speech recognition
    Hwang, JN
    Wang, CJ
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 855 - 858
  • [45] DYNAMIC ADAPTATION OF HIDDEN MARKOV MODEL FOR ROBUST SPEECH RECOGNITION
    GAO, YQ
    CHEN, YB
    WU, BX
    1989 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 1989, : 1336 - 1339
  • [46] A unified spectral transformation adaptation approach for robust speech recognition
    Yao, L
    Yu, D
    Huang, TY
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 981 - 984
  • [47] Domain Adaptation Using Class Similarity for Robust Speech Recognition
    Zhu, Han
    Zhao, Jiangjiang
    Ren, Yuling
    Wang, Li
    Zhang, Pengyuan
    INTERSPEECH 2020, 2020, : 4367 - 4371
  • [48] UNSUPERVISED ADAPTATION WITH DOMAIN SEPARATION NETWORKS FOR ROBUST SPEECH RECOGNITION
    Meng, Zhong
    Chen, Zhuo
    Mazalov, Vadim
    Li, Jinyu
    Gong, Yifan
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 214 - 221
  • [49] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [50] Combining acoustic and articulatory feature information for robust speech recognition
    Kirchhoff, K
    Fink, GA
    Sagerer, G
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 303 - 319