Comparison of AM-FM Based Features For Robust Speech Recognition

被引:0
|
作者
Narayana, K. V. S. [1 ]
Sreenivas, T. V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect & Commun Engg, Bangalore 560012, Karnataka, India
关键词
ASR; AM-FM modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective feature extraction for robust speech recognition is a widely addressed topic and currently there is much effort to invoke non-stationary signal models instead of quasi-stationary signal models leading to standard features such as LPC or MFCC. Joint amplitude modulation and frequency modulation (AM-FM) is a classical non-parametric approach to non-stationary signal modeling and recently new feature sets for automatic speech recognition (ASR) have been derived based on a multi-band AM-FM representation of the signal. We consider several of these representations and compare their performances for robust speech recognition in noise, using the AURORA-2 database. We show that FEPSTRUM representation proposed is more effective than others. We also propose an improvement to FEPSTRUM based on the Teager energy operator (TEO) and show that it can selectively outperform even FEPSTRUM.
引用
收藏
页码:1545 / 1548
页数:4
相关论文
共 50 条
  • [1] Robust AM-FM features for speech recognition
    Dimitriadis, D
    Maragos, P
    Potamianos, A
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (09) : 621 - 624
  • [2] Temporal AM-FM combination for robust speech recognition
    Kubo, Yotaro
    Okawa, Shigeki
    Kurematsu, Akira
    Shirai, Katsuhiko
    SPEECH COMMUNICATION, 2011, 53 (05) : 716 - 725
  • [3] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [4] Demodulators for AM-FM models of speech signals: A comparison
    Lu, S
    Doerschuk, PC
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 263 - 266
  • [5] Noisy speech recognition using temporal AM-FM combination
    Kubo, Yotaro
    Kurematsu, Akira
    Shirai, Katsuhiko
    Okwa, Shigeki
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4709 - +
  • [6] AM-FM MODULATION FEATURES FOR MUSIC INSTRUMENT SIGNAL ANALYSIS AND RECOGNITION
    Zlatintsi, Athanasia
    Maragos, Petros
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2035 - 2039
  • [7] Robust multiscale AM-FM demodulation of digital images
    Murray, Victor
    Paul, Rodriguez V.
    Pattichis, Marios S.
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 465 - +
  • [8] Object tracking using AM-FM image features
    Prakash, R. Senthil
    Aravind, R.
    IET COMPUTER VISION, 2010, 4 (04) : 295 - 305
  • [9] AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model
    Pantazis, Yannis
    Rosec, Olivier
    Stylianou, Yannis
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 112 - 115
  • [10] AM-FM ANALOGY
    HARRIS, HC
    PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1951, 39 (03): : 296 - 296