Feature extraction based on perceptually non-uniform spectral compression for speech recognition

被引:0
|
作者
Chu, KK [1 ]
Leung, SF [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The power law of hearing used in approximating the loudness function has an exponent that decreases from about 0.3 for a narrow band tone to 0.23 for a broadband uniform-exciting noise. Exploiting this property of psychoacoustics of hearing, this paper proposes a new feature extraction method for robust speech recognition. In the method, larger energy compression is applied to broadband-like high frequency bands of the power spectrum of each frame, instead of a fixed compression for all frequency bands as in root cepstral analysis or PLP analysis. In addition, those sound segments having broadband characteristics are given larger compression as well, using frame energy as the measuring index. The scatter of feature vectors and the class discrimination of our new method for phonemes are compared against traditional feature extraction techniques. It is shown that the feature derived from the new scheme has smaller variation and better class discrimination than the traditional features. Significant improvement in recognition accuracy is also obtained, especially in very low SNR, under white noise environment.
引用
收藏
页码:726 / 729
页数:4
相关论文
共 50 条
  • [1] Perceptually non-uniform spectral compression for noisy speech recognition
    Chu, KK
    Leung, SH
    Yip, CS
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 404 - 407
  • [2] Improved perceptually non-uniform spectral compression for robust speech recognition
    ZHANG Yi
    HE Chun-jiang
    LUO Yuan
    CHEN Kai
    XING Wu-chao
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2013, 20 (04) : 122 - 126+132
  • [3] Improved perceptually non-uniform spectral compression for robust speech recognition
    ZHANG Yi
    HE Chun-jiang
    LUO Yuan
    CHEN Kai
    XING Wu-chao
    The Journal of China Universities of Posts and Telecommunications, 2013, (04) : 122 - 126
  • [4] Improved perceptually non-uniform spectral compression for robust speech recognition
    Zhang, Yi
    He, Chun-Jiang
    Luo, Yuan
    Chen, Kai
    Xing, Wu-Chao
    Journal of China Universities of Posts and Telecommunications, 2013, 20 (04): : 122 - 126
  • [5] DFT based feature extraction with non-uniform spectral compression for robust speech recongition
    Yip, CS
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4162 - 4162
  • [6] Speech Recognizer-Based Non-Uniform Spectral Compression for Robust MFCC Feature Extraction
    Ali, Bagher Baba
    Wojcik, Waldemar
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    PRZEGLAD ELEKTROTECHNICZNY, 2018, 94 (06): : 90 - 93
  • [7] HMM Compensation Based on Non-uniform Spectral Compression for Noisy Speech Recognition
    Ning, Geng-xin
    Zhang, Jun
    Yu, Hua
    2008 11TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), VOLS 1-3, 2008, : 184 - 187
  • [8] Robust Speech Recognition Based on Speech Enhancement and Improved Perceptual Non-uniform Spectral Compression
    Zhang, Yi
    Sun, Long
    Wang, Pei-pei
    Luo, Yuan
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 3077 - 3082
  • [9] SNR-dependent non-uniform spectral compression for noisy speech recognition
    Chu, KK
    Leung, SH
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 973 - 976
  • [10] Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC
    Han Zhi-yan
    Wang Jian
    PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 98 - 102