Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard

被引：2

作者：

Neves, Claudio ^{[1
]}

Veiga, Arlindo ^{[1
]}

Sa, Luis ^{[1
]}

Perdigao, Fernando ^{[1
]}

机构：

[1] Inst Telecomunicacoes, Coimbra, Portugal

来源：

ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS | 2008年

关键词：

D O I：

10.1109/ICOSP.2008.4697206

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A powerful feature extraction system for noise robust speech recognition was standardized by ETSI. The system was developed for Distributed Speech Recognition (DSR) and includes an Advanced Front-End (AFE) to be implemented in client terminals, which send the extracted parameters to a remote server that runs a speech recognition engine. In view of the integration of a noise-robust front-end in an embedded speech recognition system, which performs simultaneously the feature extraction and the speech recognition tasks, we propose a modified implementation of the front-end with less computational requirements. Using the Aurora 2 speech database, we evaluate the impact on performance of the Blind Equalization (BE) block, the Gain Factorization (GF) block and the SNR-dependent Waveform Processing (SWP) block that are used in the AFE. We conclude that our modified front-end using Cepstral Mean Normalization (CMN) and dropping BE, GF and SWP, outperforms the AFE in a practical task.

引用

页码：609 / 612

页数：4

共 50 条

[1] A noise-robust front-end for distributed speech recognition in mobile communications
Addou, Djamel
Selouani, Sid-Ahmed
Kifaya, Kaoukeb
Boudraa, Malika
Boudraa, Bachir
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (04) : 167 - 173
[2] A noise-robust front-end based on tree-structured filter-bank for speech recognition
Kil, RM
Kim, YI
Lee, GH
[J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 81 - 86
[3] A companding front end for noise-robust automatic speech recognition
Guinness, J
Raj, B
Schmidt-Nielsen, B
Turicchia, L
Sarpeshkar, R
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
[4] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Narayanan, Arun
Wang, DeLiang
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
[5] An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition
Bhiksha Raj
Lorenzo Turicchia
Bent Schmidt-Nielsen
Rahul Sarpeshkar
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
[6] An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition
Raj, Bhiksha
Turicchia, Lorenzo
Schmidt-Nielsen, Bent
Sarpeshkar, Rahul
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
[7] Front-End Feature Compensation for Noise Robust Speech Emotion Recognition
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Das, Biswajit
Kopparapu, Sunil Kumar
[J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[8] Improved ETSI Advanced Front-End for ASR Based on Robust Complex Speech Analysis
Higa, Keita
Funaki, Keiichi
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[9] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
[J]. PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[10] Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis
Higa, Keita
Funaki, Keiichi
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11): : 2211 - 2219

← 1 2 3 4 5 →