Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard

被引:2
|
作者
Neves, Claudio [1 ]
Veiga, Arlindo [1 ]
Sa, Luis [1 ]
Perdigao, Fernando [1 ]
机构
[1] Inst Telecomunicacoes, Coimbra, Portugal
关键词
D O I
10.1109/ICOSP.2008.4697206
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A powerful feature extraction system for noise robust speech recognition was standardized by ETSI. The system was developed for Distributed Speech Recognition (DSR) and includes an Advanced Front-End (AFE) to be implemented in client terminals, which send the extracted parameters to a remote server that runs a speech recognition engine. In view of the integration of a noise-robust front-end in an embedded speech recognition system, which performs simultaneously the feature extraction and the speech recognition tasks, we propose a modified implementation of the front-end with less computational requirements. Using the Aurora 2 speech database, we evaluate the impact on performance of the Blind Equalization (BE) block, the Gain Factorization (GF) block and the SNR-dependent Waveform Processing (SWP) block that are used in the AFE. We conclude that our modified front-end using Cepstral Mean Normalization (CMN) and dropping BE, GF and SWP, outperforms the AFE in a practical task.
引用
收藏
页码:609 / 612
页数:4
相关论文
共 50 条
  • [1] A noise-robust front-end for distributed speech recognition in mobile communications
    Addou, Djamel
    Selouani, Sid-Ahmed
    Kifaya, Kaoukeb
    Boudraa, Malika
    Boudraa, Bachir
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (04) : 167 - 173
  • [2] A noise-robust front-end based on tree-structured filter-bank for speech recognition
    Kil, RM
    Kim, YI
    Lee, GH
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 81 - 86
  • [3] A companding front end for noise-robust automatic speech recognition
    Guinness, J
    Raj, B
    Schmidt-Nielsen, B
    Turicchia, L
    Sarpeshkar, R
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
  • [4] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
    Narayanan, Arun
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
  • [5] An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition
    Bhiksha Raj
    Lorenzo Turicchia
    Bent Schmidt-Nielsen
    Rahul Sarpeshkar
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [6] An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition
    Raj, Bhiksha
    Turicchia, Lorenzo
    Schmidt-Nielsen, Bent
    Sarpeshkar, Rahul
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [7] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Das, Biswajit
    Kopparapu, Sunil Kumar
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [8] Improved ETSI Advanced Front-End for ASR Based on Robust Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [9] A robust front-end for telephone speech recognition
    Cho, HY
    Chi, SM
    Oh, YH
    [J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
  • [10] Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11): : 2211 - 2219