Feature extraction using non-linear transformation for robust speech recognition on the AURORA database

被引:0
|
作者
Sharma, S [1 ]
Ellis, D [1 ]
Kajarekar, S [1 ]
Jain, P [1 ]
Hermansky, H [1 ]
机构
[1] Intel Corp, Santa Clara, CA 95051 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We evaluate the performance of several feature sets on the AURORA task as defined by ETSI. We show that after a non-linear transformation, a number of features can be effectively used in a HMM-based recognition system. The non-linear transformation is computed using a neural network which is discriminatively trained on the phonetically labeled (forcibly aligned) training data. A combination of the non-linearly transformed PLP, MSG and TRAP features yields a 63% improvement in error rate as compared to a baseline MFCC features. The use of the non-linearly transformed RASTA-like features, with system parameters scaled down to take into account the ETSI imposed memory and latency constraints, still yields a 40% improvement in error rate.
引用
收藏
页码:1117 / 1120
页数:4
相关论文
共 50 条
  • [31] Robust Feature Combination for Speech Recognition Using Linear Microphone Array in a Car
    Obuchi, Yasunari
    Hataoka, Nobuo
    IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 187 - +
  • [32] Noisy speech feature estimation on the Aurora2 database using a switching linear dynamic model
    Deng, Jianping
    Bouchard, Martin
    Yeap, Tet Hin
    2007, Academy Publisher (02):
  • [33] A NON-LINEAR OPERATOR BASED METHOD FOR HARMONIC FEATURE EXTRACTION FROM SPEECH SIGNALS
    Kavanagh, Darren F.
    Boland, Frank
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 217 - 220
  • [34] Linear spectral transformation for robust speech recognition using maximum mutual information
    Kim, Donghyun
    Yook, Dongsuk
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 496 - 499
  • [35] Robust speech processing using local adaptive non-linear filtering
    1600, Institution of Engineering and Technology, United States (07):
  • [36] Robust speech processing using local adaptive non-linear filtering
    Diaz-Ramirez, Victor H.
    Kober, Vitaly
    IET SIGNAL PROCESSING, 2013, 7 (05) : 345 - 359
  • [37] UNSEEN NOISE ROBUST SPEECH RECOGNITION USING ADAPTIVE PIECEWISE LINEAR TRANSFORMATION
    Chijiiwa, Keigo
    Suzuki, Masayuki
    Minematsu, Nobuaki
    Hirose, Keikichi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4289 - 4292
  • [38] Robust speech processing using local adaptive non-linear filtering
    1600, Institution of Engineering and Technology, United States (07):
  • [39] Non-linear predictors based on the functionally expanded neural networks for speech feature extraction
    Chetouani, Mohamed
    Hussain, Amir
    Gas, Bruno
    Zarader, Jean-Luc
    2006 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF INTELLIGENT SYSTEMS, 2006, : 1 - +
  • [40] Radial projections for non-linear feature extraction
    Perez-Jimenez, AJ
    Perez-Cortes, JC
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 444 - 447