Feature Adaptation for Robust Mobile Speech Recognition

被引:0
|
作者
Lee, Hyeopwoo [1 ]
Yook, Dongsuk [1 ]
机构
[1] Korea Univ, Dept Comp & Commun Engn, Speech Informat Proc Lab, Seoul 136701, South Korea
关键词
Speech recognition; speaker adaptation; environment adaptation; feature adaptation; feature space maximum likelihood linear regression (FMLLR); regression tree; LINEAR-REGRESSION; DEVICES;
D O I
10.1109/TCE.2012.6415011
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Feature adaptation such as feature space maximum likelihood linear regression (FMLLR) is useful for robust mobile speech recognition. However, as the amount of adaptation data increases, feature adaptation performance becomes saturated quickly due to its limitation of global transformation. To handle this problem, we propose regression tree based FMLLR which can adopt multiple transformations as the amount of adaptation data increases. An experimental result shows that the proposed method reduces the recognition error by 11.8% further for speaker adaptation task and by 13.6% further for noisy environment adaptation task compared to the conventional method(1).
引用
收藏
页码:1393 / 1398
页数:6
相关论文
共 50 条
  • [1] ROBUST FEATURE SPACE ADAPTATION FOR TELEPHONY SPEECH RECOGNITION
    Lei, Xin
    Hamaker, Jon
    He, Xiaodong
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 773 - +
  • [2] Multi-Channel Feature Adaptation for Robust Speech Recognition
    Zhang, Zhaofeng
    Xiao, Xiong
    Wang, Longbiao
    Dang, Jianwu
    Iwahashi, Masahiro
    Chng, Eng Siong
    Li, Haizhou
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [3] Feature adaptation using deviation vector for robust speech recognition in noisy environment
    Hwang, TH
    Lee, LM
    Wang, HC
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1227 - 1230
  • [4] Robust feature extraction for mobile-based speech emotion recognition system
    Lee, Kang-Kue
    Cho, Youn-Ho
    Park, Kyu-Sik
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 470 - 477
  • [5] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [6] Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition
    Duc Hoang Ha Nguyen
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 1006 - 1019
  • [7] Feature Combination Using Multiple Spectral Cues for Robust Speech Recognition in Mobile Communications
    Addou, Djamel
    Selouani, Sid-Ahmed
    Boudraa, Malika
    Boudraa, Bachir
    PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1256 - +
  • [8] EFFECT OF FEATURE SMOOTHING FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Chng, Eng Siong
    Lit, Haizhou
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 73 - 76
  • [9] ROBUST FEATURE EXTRACTORS FOR CONTINUOUS SPEECH RECOGNITION
    Alam, M. J.
    Kenny, P.
    Dumouchel, P.
    O'Shaughnessy, D.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 944 - 948
  • [10] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276