Comparison of AM-FM Based Features For Robust Speech Recognition

被引：0

作者：

Narayana, K. V. S. ^{[1
]}

Sreenivas, T. V. ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect & Commun Engg, Bangalore 560012, Karnataka, India

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

ASR; AM-FM modeling;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective feature extraction for robust speech recognition is a widely addressed topic and currently there is much effort to invoke non-stationary signal models instead of quasi-stationary signal models leading to standard features such as LPC or MFCC. Joint amplitude modulation and frequency modulation (AM-FM) is a classical non-parametric approach to non-stationary signal modeling and recently new feature sets for automatic speech recognition (ASR) have been derived based on a multi-band AM-FM representation of the signal. We consider several of these representations and compare their performances for robust speech recognition in noise, using the AURORA-2 database. We show that FEPSTRUM representation proposed is more effective than others. We also propose an improvement to FEPSTRUM based on the Teager energy operator (TEO) and show that it can selectively outperform even FEPSTRUM.

引用

页码：1545 / 1548

页数：4

共 50 条

[1] Robust AM-FM features for speech recognition
Dimitriadis, D
Maragos, P
Potamianos, A
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (09) : 621 - 624
[2] Temporal AM-FM combination for robust speech recognition
Kubo, Yotaro
Okawa, Shigeki
Kurematsu, Akira
Shirai, Katsuhiko
SPEECH COMMUNICATION, 2011, 53 (05) : 716 - 725
[3] Speaker Identification based on Robust AM-FM Features
Deshpande, Mangesh S.
Holambe, Raghunath S.
2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
[4] Demodulators for AM-FM models of speech signals: A comparison
Lu, S
Doerschuk, PC
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 263 - 266
[5] Noisy speech recognition using temporal AM-FM combination
Kubo, Yotaro
Kurematsu, Akira
Shirai, Katsuhiko
Okwa, Shigeki
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4709 - +
[6] AM-FM MODULATION FEATURES FOR MUSIC INSTRUMENT SIGNAL ANALYSIS AND RECOGNITION
Zlatintsi, Athanasia
Maragos, Petros
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2035 - 2039
[7] Robust multiscale AM-FM demodulation of digital images
Murray, Victor
Paul, Rodriguez V.
Pattichis, Marios S.
2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 465 - +
[8] Object tracking using AM-FM image features
Prakash, R. Senthil
Aravind, R.
IET COMPUTER VISION, 2010, 4 (04) : 295 - 305
[9] AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 112 - 115
[10] AM-FM ANALOGY
HARRIS, HC
PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1951, 39 (03): : 296 - 296

← 1 2 3 4 5 →