Noise Suppression based on nonnegative matrix factorization for robust speech recognition

被引:0
|
作者
Fan, Hao-teng [1 ]
Lin, Pao-han [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
关键词
nonnegative matrix factorization; noise suppression; speech recognition; noise-robustness;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel noise robustness method, nonnegative matrix factorization-based noise suppression (NNS), to enhance the magnitude spectrum of speech signals for better speech recognition performance in noise-corrupted environments. In the presented approach, the clean data and noise in the training set are firstly converted to the spectrograms via short-time Fourier transform (STFT), and the basis spectral matrices of the speech data and noise are learned from the corresponding spectrograms accordingly. Then, the magnitude spectrogram of the noise-corrupted testing data is factorized via the basis matrices of the clean data, and the resulting noise components are alleviated from the original magnitude spectrogram. Finally, the new noisereduced magnitude spectrogram is integrated with the original noisy phase spectrogram and then converted back to a time-domain signal, which is subsequently converted to a sequence of MFCC speech features. By using the presented NNS as a pre-processing stage of the speech recognition system, the obtained recognition accuracy can outperform the MFCC baseline especially at median and low SNR cases. Furthermore, performing NNS on the different sub-band spectrograms can further improve the recognition results relative to the original NNS performing on the full-band spectrogram, indicating that sub-band NNS can produce more robust speech features suitable for noisy speech recognition.
引用
收藏
页码:1731 / +
页数:2
相关论文
共 50 条
  • [1] NONNEGATIVE MATRIX FACTORIZATION BASED NOISE ROBUST SPEAKER VERIFICATION
    Liu, S. H.
    Zou, Y. X.
    Ning, H. K.
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 35 - 39
  • [2] UNSUPERVISED BEAMFORMING BASED ON MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR NOISY SPEECH RECOGNITION
    Shimada, Kazuki
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5734 - 5738
  • [3] Speaker and Noise Factorization for Robust Speech Recognition
    Wang, Yongqiang
    Gales, Mark J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
  • [4] Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization
    Wang, Syu-Siang
    Chern, Alan
    Tsao, Yu
    Hung, Jeih-weih
    Lu, Xugang
    Lai, Ying-Hui
    Su, Borching
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1101 - 1105
  • [5] Locality Preserved Joint Nonnegative Matrix Factorization for Speech Emotion Recognition
    Mathulaprangsan, Seksan
    Lee, Yuan-Shan
    Wang, Jia-Ching
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (04) : 821 - 825
  • [6] Robust watermarking based on DWT and nonnegative matrix factorization
    Lu, Wei
    Sun, Wei
    Lu, Hongtao
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2009, 35 (01) : 183 - 188
  • [7] Robust Watermarking based on Subsampling and Nonnegative Matrix Factorization
    Lu, Wei
    Lu, Hongtao
    [J]. INFORMATICA, 2008, 19 (04) : 555 - 566
  • [8] Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization
    Weninger, Felix
    Schuller, Bjoern
    Batliner, Anton
    Steidl, Stefan
    Seppi, Dino
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [9] Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization
    Felix Weninger
    Björn Schuller
    Anton Batliner
    Stefan Steidl
    Dino Seppi
    [J]. EURASIP Journal on Advances in Signal Processing, 2011
  • [10] Robust Kernel Nonnegative Matrix Factorization
    Xia, Zhichen
    Ding, Chris
    Chow, Edmond
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 522 - 529