Robust Dereverberation With Kronecker Product Based Multichannel Linear Prediction

被引:24
|
作者
Yang, Wenxing [1 ,2 ]
Huang, Gongping [3 ]
Chen, Jingdong [1 ]
Benesty, Jacob [4 ]
Cohen, Israel [3 ]
Kellermann, Walter [2 ]
机构
[1] Northwestern Polytech Univ, CIAIC, Xian 710072, Peoples R China
[2] Univ Erlangen Nurnberg, LMS, D-91058 Erlangen, Germany
[3] Technion Israel Inst Technol, Andrew & Erna Viterby Fac Elect Engn, Technion City, IL-3200003 Haifa, Israel
[4] Univ Quebec, INRS EMT, Montreal, PQ H5A 1K6, Canada
基金
美国国家科学基金会; 以色列科学基金会;
关键词
Reverberation; Computational complexity; Additive noise; Predictive models; Covariance matrices; Cost function; Array signal processing; Beamforming; dereverberation; Kronecker product filter; noise robustness; speech enhancement; weighted-prediction-error;
D O I
10.1109/LSP.2020.3044796
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Reverberation impairs not only the speech quality, but also intelligibility. The weighted-prediction-error (WPE) method, which estimates the late reverberation component based on a multichannel linear predictor, is by far one of the most effective algorithms for dereverberation. Generally, the WPE prediction filter in every short-time-Fourier-transform (STFT) subband has to be long enough to estimate accurately the late reverberation component. As a consequence, WPE is computationally expensive, which makes it difficult to implement into real-time embedded or edge computing devices. Moreover, WPE is sensitive to additive noise and its performance may suffer from dramatic degradation even in environments where the signal-to-noise ratio (SNR) is high. To address these drawbacks, this letter proposes to decompose the multichannel linear prediction filter as a Kronecker product of a temporal (interframe) prediction filter and a spatial filter. An iterative algorithm is then developed to optimize the two filters. In comparison with the original WPE algorithm, the presented method not only exhibits better performance in terms of dereverberation and robustness to additive noise, as there are fewer parameters to estimate for a given number of observation signal samples, but is also computationally more efficient, since the dimensions of the covariance matrices after Kronecker product decomposition are smaller.
引用
收藏
页码:101 / 105
页数:5
相关论文
共 50 条
  • [31] MULTICHANNEL SPEECH DEREVERBERATION AND SEPARATION WITH OPTIMIZED COMBINATION OF LINEAR AND NON-LINEAR FILTERING
    Togami, Masahito
    Kawaguchi, Yohei
    Takeda, Ryu
    Obuchi, Yasunari
    Nukaga, Nobuo
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4057 - 4060
  • [32] SPEECH DEREVERBERATION BASED ON CONVEX OPTIMIZATION ALGORITHMS FOR GROUP SPARSE LINEAR PREDICTION
    Giacobello, Daniele
    Jensen, Tobias Lindstrom
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 446 - 450
  • [33] Multi-Channel Linear Prediction Based on Binaural Coherence for Speech Dereverberation
    Liu, Hong
    Wang, Xiuling
    Sun, Miao
    Pang, Cheng
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1735 - 1739
  • [34] Split Bregman Approach to Linear Prediction Based Dereverberation With Enforced Speech Sparsity
    Witkowski, Marcin
    Kowalczyk, Konrad
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 942 - 946
  • [35] Fractionally spaced equalizer based on multichannel linear prediction
    Zhang, ZJ
    Zhang, LR
    Shui, PL
    [J]. PROCEEDINGS OF THE IEEE 6TH CIRCUITS AND SYSTEMS SYMPOSIUM ON EMERGING TECHNOLOGIES: FRONTIERS OF MOBILE AND WIRELESS COMMUNICATION, VOLS 1 AND 2, 2004, : 393 - 396
  • [36] ROBUST SPEECH DEREVERBERATION USING SUBBAND MULTICHANNEL LEAST SQUARES WITH VARIABLE RELAXATION
    Lim, Felicia
    Naylor, Patrick A.
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [37] Local quasi-linear embedding based on kronecker product expansion of vectors
    Niu, Guo
    Ma, Zhengming
    [J]. Journal of Intelligent and Fuzzy Systems, 2021, 41 (01): : 2195 - 2205
  • [38] Nearest Kronecker Product Decomposition Based Linear-in-The-Parameters Nonlinear Filters
    Bhattacharjee, Sankha Subhra
    George, Nithin V.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2111 - 2122
  • [39] Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm with Eigenvector Extraction
    Chen, Yitong
    Zhang, Wen
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1227 - 1231
  • [40] Local quasi-linear embedding based on kronecker product expansion of vectors
    Niu, Guo
    Ma, Zhengming
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 2195 - 2205