Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis

被引:1
|
作者
Higa, Keita [1 ]
Funaki, Keiichi [2 ]
机构
[1] Univ Ryukyus, Sch Engn & Sci, Nakagami, Okinawa 9030213, Japan
[2] Univ Ryukyus, C&N Ctr, Nakagami, Okinawa 9030213, Japan
关键词
robust ASR; ETSI AFE; iterative Wiener filter (IWF); complex speech analysis; analytic signal; RECOGNITION; ENHANCEMENT;
D O I
10.1587/transfun.E98.A.2211
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The advanced front-end (AFE) for automatic speech recognition (ASR) was standardized by the European Telecommunications Standards Institute (ETSI). The AFE provides speech enhancement realized by an iterative Wiener filter (IWF) in which a smoothed FFT spectrum over adjacent frames is used to design the filter. We have previously proposed robust time-varying complex Auto-Regressive (TV-CAR) speech analysis for an analytic signal and evaluated the performance of speech processing such as F-0 estimation and speech enhancement. TV-CAR analysis can estimate more accurate spectrum than FFT, especially in low frequencies because of the nature of the analytic signal. In addition, TV-CAR can estimate more accurate speech spectrum against additive noise. In this paper, a time-invariant version of wide-band TV-CAR analysis is introduced to the IWF in the AFE and is evaluated using the CENSREC-2 database and its baseline script.
引用
下载
收藏
页码:2211 / 2219
页数:9
相关论文
共 50 条
  • [1] Improved ETSI Advanced Front-End for ASR Based on Robust Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [2] An Uncertainty Propagation Approach to Robust ASR Using the ETSI Advanced Front-End
    Astudillo, Ramon Fernandez
    Kolossa, Dorothea
    Mandelartz, Philipp
    Orglmeister, Reinhold
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (05) : 824 - 833
  • [3] Efficient Noise-Robust Speech Recognition Front-End Based on the ETSI Standard
    Neves, Claudio
    Veiga, Arlindo
    Sa, Luis
    Perdigao, Fernando
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 609 - 612
  • [4] A complexity reduction of ETSI advanced front-end for DSR
    Li, JY
    Liu, B
    Wang, RH
    Dai, LR
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 61 - 64
  • [5] Developing the ETSI Aurora advanced distributed speech recognition front-end & What next?
    Pearce, D
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 131 - 134
  • [6] On a robust ASR based on robust complex speech analysis
    Higa, Keita
    Funaki, Keiichi
    2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 129 - 133
  • [7] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
    Dimitriadis, Dimitrios
    Segura, Jose C.
    Garcia, Luz
    Potamianos, Alexandros
    Maragos, Petros
    Pitsikalis, Vassilis
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
  • [8] Evaluation of a wavelet based ASR front-end
    Farooq, Omar
    Datta, Sekharjit
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2007, 5 (04) : 641 - 654
  • [9] A robust front-end for telephone speech recognition
    Cho, HY
    Chi, SM
    Oh, YH
    PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
  • [10] On a robust ASR based on complex AR speech analysis
    Higa, Keita
    Funaki, Keiichi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1232 - 1235