Feature extraction based on perceptually non-uniform spectral compression for speech recognition

被引:0
|
作者
Chu, KK [1 ]
Leung, SF [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The power law of hearing used in approximating the loudness function has an exponent that decreases from about 0.3 for a narrow band tone to 0.23 for a broadband uniform-exciting noise. Exploiting this property of psychoacoustics of hearing, this paper proposes a new feature extraction method for robust speech recognition. In the method, larger energy compression is applied to broadband-like high frequency bands of the power spectrum of each frame, instead of a fixed compression for all frequency bands as in root cepstral analysis or PLP analysis. In addition, those sound segments having broadband characteristics are given larger compression as well, using frame energy as the measuring index. The scatter of feature vectors and the class discrimination of our new method for phonemes are compared against traditional feature extraction techniques. It is shown that the feature derived from the new scheme has smaller variation and better class discrimination than the traditional features. Significant improvement in recognition accuracy is also obtained, especially in very low SNR, under white noise environment.
引用
收藏
页码:726 / 729
页数:4
相关论文
共 50 条
  • [31] An Investigation of Non-Uniform Error Cost Function Design in Automatic Speech Recognition
    Fu, Qiang
    Juang, Biing-Hwang
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 168 - +
  • [32] Perceptual Speech Enhancement System Based on Non-Uniform Analysis
    Zoghlami, Novlene
    Lachiri, Zied
    INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS (ICSES '10): CONFERENCE PROCEEDINGS, 2010, : 73 - 76
  • [33] Local Feature Entropy Based Non-Uniform Simplification Algorithm
    Chu, Xiaoli
    Zhang, Yan
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [34] An Auditory Based Modulation Spectral Feature for Reverberant Speech Recognition
    Maganti, HariKrishna
    Matassoni, Marco
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 570 - 573
  • [35] Optimizing feature extraction for speech recognition
    Lee, CH
    Hyun, DH
    Choi, ES
    Go, JW
    Lee, CY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 80 - 87
  • [36] Feature extraction for robust speech recognition
    Dharanipragada, S
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, 2002, : 855 - 858
  • [37] A Non-Uniform Filterbank for Speaker Recognition
    Kua, Jia Min Karen
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2271 - 2274
  • [38] Bitstream-based feature extraction for wireless speech recognition
    Kim, HK
    Cox, RV
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1607 - 1610
  • [39] Speech feature extraction based on wavelet modulation scale for robust speech recognition
    Ma, Xin
    Zhou, Weidong
    Ju, Fang
    Jiang, Qi
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 499 - 505
  • [40] Multiresolution Feature Extraction (MRFE) based speech recognition system
    Priyanka, M. Anbu Swarna
    Solomi, V. Sherlin
    Vijayalakshmi, P.
    Nagarajan, T.
    2013 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2013, : 152 - 156