Noisy speech recognition using de-noised multiresolution analysis acoustic features

被引:10
|
作者
Chan, CP [1 ]
Ching, PC [1 ]
Lee, T [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
来源
关键词
Cepstral mean normalization - Feature parameters - High frequency bands - Mel-frequency cepstral coefficients - Noisy speech recognition - Novel applications - Robust speech recognition - Wavelet packet filters;
D O I
10.1121/1.1398054
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a novel application of multiresolution analysis (MRA) in extracting acoustic features that possess de-noising capability for robust speech recognition. The MRA algorithm is used to construct a mel-scaled wavelet packet filter-bank, from which subband powers are computed as the feature parameters for speech recognition. Wiener filtering is applied to a few selected subbands at some intermediate stages of decomposition. For high-frequency bands, Wiener filters are designed based on a reduced fraction of the estimated noise power, making the consonant features much more prominent and contrastive. The proposed method is evaluated in phone recognition experiments with the MIT database. In the presence of stationary white noise at 10-dB SNR, the de-noised MRA features attain a phone recognition rate of 32%. There is a noticeable improvement compared with the accuracy of 29% and 20% attained by the commonly used mel-frequency cepstral coefficients (MFCC) with and without cepstral mean normalization (CMN), respectively. The effectiveness of the MRA features is also verified by the fact that they exhibit smaller distortion from clean speech. (C) 2001 Acoustical Society of America.
引用
收藏
页码:2567 / 2574
页数:8
相关论文
共 50 条
  • [41] Deep fusion framework for speech command recognition using acoustic and linguistic features
    Mehra, Sunakshi
    Susan, Seba
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38667 - 38691
  • [42] Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition
    Chu, Shih-Chuan
    Wu, Chung-Hsien
    Lin, Yun-Wen
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 536 - 540
  • [43] AN EVOLUTIONARY-BASED APPROACH TO SYNTHESIZE EARTHQUAKES USING DE-NOISED SMALL EARTHQUAKES - SIMULATING THE 2003 BAM EARTHQUAKE (IRAN)
    Eslamian, Yasser
    Adlparvar, Mohammad Reza
    Bozorgnasab, Mohsen
    Mehrjardi, Mohammad Ali Sadredini
    AUSTRIAN JOURNAL OF EARTH SCIENCES, 2010, 103 (02): : 15 - 27
  • [44] Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features
    Kitaoka, Norihide
    Hayashi, Tomoki
    Takeda, Kazuya
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [45] Gradient-Based Acoustic Features for Speech Recognition
    Muroi, Takashi
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2009), 2009, : 445 - 448
  • [46] Compression of acoustic features for speech recognition in network environments
    Ramaswamy, GN
    Gopalakrishnan, PS
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 977 - 980
  • [47] NOT ALL FEATURES ARE EQUAL: SELECTION OF ROBUST FEATURES FOR SPEECH EMOTION RECOGNITION IN NOISY ENVIRONMENTS
    Leem, Seong-Gyun
    Fulford, Daniel
    Onnela, Jukka-Pekka
    Gard, David
    Busso, Carlos
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6447 - 6451
  • [48] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 389 - 392
  • [49] Acoustic Analysis for Automatic Speech Recognition
    O'Shaughnessy, Douglas
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
  • [50] Acoustic analysis and recognition of whispered speech
    Itoh, T
    Takeda, K
    Itakura, F
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 429 - 432