Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis

被引：1

作者：

Higa, Keita ^{[1
]}

Funaki, Keiichi ^{[2
]}

机构：

[1] Univ Ryukyus, Sch Engn & Sci, Nakagami, Okinawa 9030213, Japan

[2] Univ Ryukyus, C&N Ctr, Nakagami, Okinawa 9030213, Japan

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2015年 / E98A卷 / 11期

关键词：

robust ASR; ETSI AFE; iterative Wiener filter (IWF); complex speech analysis; analytic signal; RECOGNITION; ENHANCEMENT;

D O I：

10.1587/transfun.E98.A.2211

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The advanced front-end (AFE) for automatic speech recognition (ASR) was standardized by the European Telecommunications Standards Institute (ETSI). The AFE provides speech enhancement realized by an iterative Wiener filter (IWF) in which a smoothed FFT spectrum over adjacent frames is used to design the filter. We have previously proposed robust time-varying complex Auto-Regressive (TV-CAR) speech analysis for an analytic signal and evaluated the performance of speech processing such as F-0 estimation and speech enhancement. TV-CAR analysis can estimate more accurate spectrum than FFT, especially in low frequencies because of the nature of the analytic signal. In addition, TV-CAR can estimate more accurate speech spectrum against additive noise. In this paper, a time-invariant version of wide-band TV-CAR analysis is introduced to the IWF in the AFE and is evaluated using the CENSREC-2 database and its baseline script.

引用

下载

页码：2211 / 2219

页数：9

共 50 条

[1] Improved ETSI Advanced Front-End for ASR Based on Robust Complex Speech Analysis
Higa, Keita
Funaki, Keiichi
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[2] An Uncertainty Propagation Approach to Robust ASR Using the ETSI Advanced Front-End
Astudillo, Ramon Fernandez
Kolossa, Dorothea
Mandelartz, Philipp
Orglmeister, Reinhold
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (05) : 824 - 833
[3] Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard
Neves, Claudio
Veiga, Arlindo
Sa, Luis
Perdigao, Fernando
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 609 - 612
[4] A complexity reduction of ETSI advanced front-end for DSR
Li, JY
Liu, B
Wang, RH
Dai, LR
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 61 - 64
[5] Developing the ETSI Aurora advanced distributed speech recognition front-end & What next?
Pearce, D
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 131 - 134
[6] On a robust ASR based on robust complex speech analysis
Higa, Keita
Funaki, Keiichi
2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 129 - 133
[7] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
Dimitriadis, Dimitrios
Segura, Jose C.
Garcia, Luz
Potamianos, Alexandros
Maragos, Petros
Pitsikalis, Vassilis
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
[8] Evaluation of a wavelet based ASR front-end
Farooq, Omar
Datta, Sekharjit
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2007, 5 (04) : 641 - 654
[9] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[10] On a robust ASR based on complex AR speech analysis
Higa, Keita
Funaki, Keiichi
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1232 - 1235

← 1 2 3 4 5 →