A robust front-end for telephone speech recognition

被引:0
|
作者
Cho, HY [1 ]
Chi, SM [1 ]
Oh, YH [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Comp Sci, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we propose an effective front-end technique to improve the performance of telephone speech recognition. Many works have been concentrated on compensating the noise and the channel distortions contained in telephone speech at the front-end stage of speech recognition. Based on RASTA processing which is well known for its channel robust feature parameters, we tried to further improve this method using the channel estimation power of cepstral mean subtraction and maximum likelihood method. As a hybrid method of channel estimation and RASTA processing, the proposed method was proved to be effective by experiments performed on real telephone speech data.
引用
收藏
页码:636 / 644
页数:9
相关论文
共 50 条
  • [41] A study of mutual front-end processing method based on statistical model for noise robust speech recognition
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    Nakatani, Tomohiro
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1251 - 1254
  • [42] A Multichannel Noise Reduction Front-end based on psychoacoustics for robust speech recognition in highly noisy environments
    Cifani, Simone
    Principi, Emanuele
    Rocchi, Cesare
    Squartini, Stefano
    Piazza, Francesco
    [J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 173 - 176
  • [43] Robust front-end for speech recognition by human and machine in noisy reverberant environments: the effect of phase information
    Liu, Yang
    Nower, Naushin
    Morita, Shota
    Unoki, Masashi
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [44] Robust front-end for audio, visual and audio–visual speech classification
    Terissi L.D.
    Sad G.D.
    Gómez J.C.
    [J]. International Journal of Speech Technology, 2018, 21 (2) : 293 - 307
  • [45] An Efferent-Inspired Auditory Model Front-End for Speech Recognition
    Lee, Chia-ying
    Glass, James
    Ghitza, Oded
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 56 - +
  • [46] Combined Software/hardware implementation of a filterbank front-end for speech recognition
    Mouchtaris, A
    Cao, Y
    Khan, S
    Van der Spiegel, J
    [J]. 2005 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS - DESIGN AND IMPLEMENTATION (SIPS), 2005, : 436 - 441
  • [47] A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
    Yapanel, Umit H.
    Hansen, John H. L.
    [J]. SPEECH COMMUNICATION, 2008, 50 (02) : 142 - 152
  • [48] A noise-robust front-end based on tree-structured filter-bank for speech recognition
    Kil, RM
    Kim, YI
    Lee, GH
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 81 - 86
  • [49] Implementation of The MFCC Front-end for Low-cost Speech Recognition Systems
    Vu, Ngoc-Vinh
    Whittington, Jim
    Ye, Hua
    Devlin, John
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 2334 - 2337
  • [50] Feature enhancement for a bitstream-based front-end in wireless speech recognition
    Kim, HK
    Cox, RV
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 241 - 244