ONLINE SPEECH DEREVERBERATION USING RLS-WPE BASED ON A FULL SPATIAL CORRELATION MATRIX INTEGRATED IN A SPEECH ENHANCEMENT SYSTEM

被引:0
|
作者
Kim, Jun Hyung [1 ]
Park, Jazzson [2 ]
Ahn, Minwook [2 ]
Lee, Yeonbok [2 ]
Kim, Wonsub [2 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul, South Korea
[2] SK Telecom, SW R&D Ctr, Seoul, South Korea
关键词
Dereverberation; weighted prediction error; recursive least squares; online processing; speech enhancement;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reverberation may degrade the performance of automatic speech recognition significantly. Although the weighted prediction error (WPE) provided impressive results, most of the conventional WPE methods were based on batch processing. An efficient WPE method based on frame-by-frame adaptation of the prediction filter in the recursive least squares (RLS) framework was proposed, but the spatial correlation matrix was oversimplified to be a scaled identity matrix. In this paper, we derive a generalized online RLS-WPE method allowing a full spatial correlation matrix. Experimental results showed that the method using a diagonal matrix achieved better performance than that using a scaled identity matrix and employing a full matrix further improved the performance. Furthermore, we also integrated the algorithm into a speech enhancement system, including stereo adaptive echo cancellation and minimum variance distortion response beamforming, implemented onto Xilinx Zynq Ultrascale+MPSoC.
引用
收藏
页码:36 / 40
页数:5
相关论文
共 29 条
  • [21] Preliminary study of mobile device-based speech enhancement system using lip-reading
    Matsunaga, Yuta
    Matsui, Kenji
    Nakatoh, Yoshihisa
    Kato, Yumiko O.
    Lopez-Sanchez, Daniel
    Rodriguez, Sara
    Corchado, Juan Manuel
    [J]. Advances in Intelligent Systems and Computing, 2019, 800 : 308 - 315
  • [22] Improving the Performance of Deep Learning Based Speech Enhancement System Using Fuzzy Restricted Boltzmann Machine
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 534 - 542
  • [23] TIME-FREQUENCY MASKING BASED ONLINE SPEECH ENHANCEMENT WITH MULTI-CHANNEL DATA USING CONVOLUTIONAL NEURAL NETWORKS
    Chakrabarty, Soumitro
    Wang, DeLiang
    Habets, Emanuel A. P.
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 476 - 480
  • [24] A Novel Approach for Design of a Speech Enhancement System using NLMS Adaptive Filter and ZCR based Pattern Identification
    Goswami, Sivaranjan
    Deka, Pinky
    Bardoloi, Bijeet
    Dutta, Darathi
    Sarma, Dipjyoti
    [J]. 2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 125 - 129
  • [25] Speech Intelligibility Based Enhancement System Using Modified Deep Neural Network and Adaptive Multi-band Spectral Subtraction
    Tusar Kanti Dash
    Sandeep Singh Solanki
    [J]. Wireless Personal Communications, 2020, 111 : 1073 - 1087
  • [26] Speech Intelligibility Based Enhancement System Using Modified Deep Neural Network and Adaptive Multi-band Spectral Subtraction
    Dash, Tusar Kanti
    Solanki, Sandeep Singh
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 111 (02) : 1073 - 1087
  • [27] Pathological voice classification system based on CNN-BiLSTM network using speech enhancement and multi-stream approach
    Belabbas, Soumeya
    Addou, Djamel
    Selouani, Sid Ahmed
    [J]. International Journal of Speech Technology, 2024, 27 (02) : 483 - 502
  • [28] Quality Improvement of Vietnamese HMM-Based Speech Synthesis System Based on Decomposition of Naturalness and Intelligibility Using Non-negative Matrix Factorization
    Anh-Tuan Dinh
    Thanh-Son Phan
    Akagi, Masato
    [J]. ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 490 - 499
  • [29] Supervised Single Channel Speech Enhancement Based on Dual-Tree Complex Wavelet Transforms and Nonnegative Matrix Factorization Using the Joint Learning Process and Subband Smooth Ratio Mask
    Islam, Md Shohidul
    Al Mahmud, Tarek Hasan
    Khan, Wasim Ullah
    Ye, Zhongfu
    [J]. ELECTRONICS, 2019, 8 (03):