A robust front-end for telephone speech recognition

被引:0
|
作者
Cho, HY [1 ]
Chi, SM [1 ]
Oh, YH [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Comp Sci, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we propose an effective front-end technique to improve the performance of telephone speech recognition. Many works have been concentrated on compensating the noise and the channel distortions contained in telephone speech at the front-end stage of speech recognition. Based on RASTA processing which is well known for its channel robust feature parameters, we tried to further improve this method using the channel estimation power of cepstral mean subtraction and maximum likelihood method. As a hybrid method of channel estimation and RASTA processing, the proposed method was proved to be effective by experiments performed on real telephone speech data.
引用
收藏
页码:636 / 644
页数:9
相关论文
共 50 条
  • [31] Recognizing voice aver IP:: A robust front-end for speech recognition on the World Wide Web
    Peláez-Moreno, C
    Gallardo-Antolín, A
    Díaz-De-María, F
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2001, 3 (02) : 209 - 218
  • [32] Robust automatic speech recognition using a multi-channel signal separation front-end
    Yen, KC
    Zhao, YX
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1337 - 1340
  • [33] A New Subband-Weighted MVDR-Based Front-End for Robust Speech Recognition
    Seyedin, Sanaz
    Ahadi, Seyed Mohammad
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (08): : 2252 - 2261
  • [34] Thin client front-end processor for distributed speech recognition
    Chow, KF
    Liew, SC
    Lua, KT
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 29 - 32
  • [35] The speech recognition based on the bark wavelet front-end processing
    Zhang, XY
    Jiao, ZP
    Zhao, ZF
    [J]. FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 302 - 305
  • [36] Wavelet-based Front-End for Electromyographic Speech Recognition
    Wand, Michael
    Jou, Szu-Chen Stan
    Schultz, Tanja
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1773 - +
  • [37] Front-end design by using auditory modeling in speech recognition
    Tian, JL
    Laurila, K
    Hariharan, R
    Kiss, I
    [J]. COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 2001, 312 : 329 - 342
  • [38] Optimization of Speech Enhancement Front-end with Speech Recognition-level Criterion
    Higuchi, Takuya
    Yoshioka, Takuya
    Nakatani, Tomohiro
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3808 - 3812
  • [39] Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training
    Du, Zhihao
    Zhang, Xueliang
    Han, Jiqing
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 249 - 254
  • [40] Incorporating a Generative Front-end Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition
    Kundu, Souvik
    Sim, Khe Chai
    Gales, Mark
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2359 - 2363