Recognition of noisy speech using dynamic spectral subband centroids

被引:33
|
作者
Chen, JD
Huang, YT
Li, Q
Paliwal, KK
机构
[1] Bell Labs, Murray Hill, NJ 07974 USA
[2] Griffith Univ, Sch Microelect Engn, Nathan, Qld 4111, Australia
关键词
cepstrum; robust speech recognition; subband centroid;
D O I
10.1109/LSP.2003.821689
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite their widespread popularity as front-end parameters for speech recognition, the cepstral coefficients derived from either linear prediction analysis or a filter-bank are found to be sensitive to additive noise. In this letter, we discuss the use of spectral subband centroids for robust speech recognition. We show that centroids, if properly selected, can achieve recognition performance comparable to that of the mel-frequency cepstral coefficients (MFCCs) in clean speech, while delivering better performance than MFCC in noisy environments. A procedure is proposed to construct the dynamic centroid feature vector that essentially embodies the transitional spectral information. We discuss some properties of the proposed dynamic features.
引用
收藏
页码:258 / 261
页数:4
相关论文
共 50 条
  • [1] Spectral subband centroids as features for speech recognition
    Paliwal, KK
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 124 - 131
  • [2] Robust speech recognition in noisy environments based on subband spectral centroid histograms
    Gajic, B
    Paliwal, KK
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 600 - 608
  • [3] Auditory driven subband speech enhancement for automatic recognition of noisy speech
    Upadhyay N.
    Rosales H.G.
    International Journal of Speech Technology, 2016, 19 (4) : 869 - 880
  • [4] Spectral subband centroid features for speech recognition
    Paliwal, KK
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 617 - 620
  • [5] Dynamic Minimum Subband Spectral Subtraction and its application in robust speech recognition
    Ma, Xin
    Peng, Yuhua
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 349 - +
  • [6] Speaker verification with adaptive spectral subband centroids
    Kinnunen, Tomi
    Mang, Bingjun
    Zhu, Jia
    Wang, Ye
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 58 - +
  • [7] Speech classification in noisy environment using subband decomposition
    Lachiri, Z
    Ellouze, N
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 409 - 412
  • [8] Subband-based blind signal separation for noisy speech recognition
    Park, HM
    Jung, HY
    Lee, TW
    Lee, SY
    ELECTRONICS LETTERS, 1999, 35 (23) : 2011 - 2012
  • [9] Performance estimation of noisy speech recognition using spectral distortion and recognition task complexity
    Guo, Ling
    Yamada, Takeshi
    Miyabe, Shigeki
    Makino, Shoji
    Kitawaki, Nobuhiko
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2016, 37 (06) : 286 - 294
  • [10] Audio fingerprinting based on normalized spectral subband centroids
    Seo, JS
    Jin, M
    Lee, S
    Jang, D
    Lee, S
    Yoo, CD
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 213 - 216