Robust Dereverberation With Kronecker Product Based Multichannel Linear Prediction

被引:24
|
作者
Yang, Wenxing [1 ,2 ]
Huang, Gongping [3 ]
Chen, Jingdong [1 ]
Benesty, Jacob [4 ]
Cohen, Israel [3 ]
Kellermann, Walter [2 ]
机构
[1] Northwestern Polytech Univ, CIAIC, Xian 710072, Peoples R China
[2] Univ Erlangen Nurnberg, LMS, D-91058 Erlangen, Germany
[3] Technion Israel Inst Technol, Andrew & Erna Viterby Fac Elect Engn, Technion City, IL-3200003 Haifa, Israel
[4] Univ Quebec, INRS EMT, Montreal, PQ H5A 1K6, Canada
基金
美国国家科学基金会; 以色列科学基金会;
关键词
Reverberation; Computational complexity; Additive noise; Predictive models; Covariance matrices; Cost function; Array signal processing; Beamforming; dereverberation; Kronecker product filter; noise robustness; speech enhancement; weighted-prediction-error;
D O I
10.1109/LSP.2020.3044796
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Reverberation impairs not only the speech quality, but also intelligibility. The weighted-prediction-error (WPE) method, which estimates the late reverberation component based on a multichannel linear predictor, is by far one of the most effective algorithms for dereverberation. Generally, the WPE prediction filter in every short-time-Fourier-transform (STFT) subband has to be long enough to estimate accurately the late reverberation component. As a consequence, WPE is computationally expensive, which makes it difficult to implement into real-time embedded or edge computing devices. Moreover, WPE is sensitive to additive noise and its performance may suffer from dramatic degradation even in environments where the signal-to-noise ratio (SNR) is high. To address these drawbacks, this letter proposes to decompose the multichannel linear prediction filter as a Kronecker product of a temporal (interframe) prediction filter and a spatial filter. An iterative algorithm is then developed to optimize the two filters. In comparison with the original WPE algorithm, the presented method not only exhibits better performance in terms of dereverberation and robustness to additive noise, as there are fewer parameters to estimate for a given number of observation signal samples, but is also computationally more efficient, since the dimensions of the covariance matrices after Kronecker product decomposition are smaller.
引用
收藏
页码:101 / 105
页数:5
相关论文
共 50 条
  • [1] Kronecker Product Multichannel Linear Filtering for Adaptive Weighted Prediction Error-Based Speech Dereverberation
    Huang, Gongping
    Benesty, Jacob
    Cohen, Israel
    Chen, Jingdong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1277 - 1289
  • [2] Dereverberation and denoising using multichannel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshi, Masato
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1791 - 1801
  • [3] Precise dereverberation using multichannel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshi, Masato
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 430 - 440
  • [4] Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
    Yang, Jae-Mo
    Kang, Hong-Goo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 608 - 619
  • [5] A tensor decomposition based multichannel linear prediction approach to speech dereverberation
    Zeng, Xiaojin
    He, Hongsen
    Chen, Jingdong
    Benesty, Jacob
    [J]. APPLIED ACOUSTICS, 2023, 214
  • [6] ADAPTIVE MULTICHANNEL LINEAR PREDICTION BASED DEREVERBERATION IN TIME-VARYING ROOM ENVIRONMENTS
    Yang, Jae-Mo
    Kang, Hong-Goo
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [7] Adaptive Speech Dereverberation Using Constrained Sparse Multichannel Linear Prediction
    Jukic, Ante
    van Waterschoot, Toon
    Doclo, Simon
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (01) : 101 - 105
  • [8] Online Speech Dereverberation Using Mixture of Multichannel Linear Prediction Models
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Kamo, Naoyuki
    Nakatani, Tomohiro
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1580 - 1584
  • [9] Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter
    Okamoto, Takuma
    Iwaya, Yukio
    Suzuki, Yoiti
    [J]. APPLIED ACOUSTICS, 2012, 73 (01) : 50 - 55
  • [10] Robust Multichannel Dereverberation using Relaxed Multichannel Least Squares
    Lim, Felicia
    Zhang, Wancheng
    Habets, Emanuel A. P.
    Naylor, Patrick A.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (09) : 1379 - 1390