A tensor decomposition based multichannel linear prediction approach to speech dereverberation

被引:0
|
作者
Zeng, Xiaojin [1 ]
He, Hongsen [1 ]
Chen, Jingdong [2 ,3 ]
Benesty, Jacob [4 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621010, Peoples R China
[2] Northwestern Polytech Univ, CIAIC, 127 Youyi West Rd, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Shaanxi Prov Key Lab Artificial Intelligence, 127 Youyi West Rd, Xian 710072, Peoples R China
[4] Univ Quebec, INRS EMT, 800 Gauchetiere Ouest,Suite 6900, Montreal, PQ H5A 1K6, Canada
基金
美国国家科学基金会;
关键词
Speech dereverberation; Multichannel linear prediction; Weighted-prediction-error (WPE); Tensor and Kronecker product decompositions; NOISE-REDUCTION; ALGORITHM;
D O I
10.1016/j.apacoust.2023.109690
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dereverberation technology is needed in a wide range of speech applications as reverberation often greatly degrades the quality and intelligibility of the speech signal of interest captured by microphones. The commonly used weighted-prediction-error method generally requires long prediction-error filters to remove the reverberation components, which makes it computationally expensive. To deal with this issue, this paper proposes a computationally efficient dereverberation algorithm based on tensor decomposition in which the long prediction-error filter is decomposed into a group of short sub-filters through multiple Kronecker products. Consequently, the high dimensional cross-correlation matrix that needs to be inverted in the dereverberation algorithm is then converted into a set of low dimensional matrices, which leads to significant reduction in the computational complexity. Simulation results demonstrate the properties of the proposed algorithm.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
    Yang, Jae-Mo
    Kang, Hong-Goo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 608 - 619
  • [2] Adaptive Speech Dereverberation Using Constrained Sparse Multichannel Linear Prediction
    Jukic, Ante
    van Waterschoot, Toon
    Doclo, Simon
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (01) : 101 - 105
  • [3] Online Speech Dereverberation Using Mixture of Multichannel Linear Prediction Models
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Kamo, Naoyuki
    Nakatani, Tomohiro
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1580 - 1584
  • [4] SPEECH DEREVERBERATION BASED ON LINEAR PREDICTION: AN ACOUSTIC VECTOR SENSOR APPROACH
    Shujau, M.
    Ritz, C. H.
    Burnett, I. S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 639 - 643
  • [5] Robust Dereverberation With Kronecker Product Based Multichannel Linear Prediction
    Yang, Wenxing
    Huang, Gongping
    Chen, Jingdong
    Benesty, Jacob
    Cohen, Israel
    Kellermann, Walter
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 101 - 105
  • [6] Split Bregman Approach to Linear Prediction Based Dereverberation With Enforced Speech Sparsity
    Witkowski, Marcin
    Kowalczyk, Konrad
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 942 - 946
  • [7] Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1724 - 1735
  • [8] Kronecker Product Multichannel Linear Filtering for Adaptive Weighted Prediction Error-Based Speech Dereverberation
    Huang, Gongping
    Benesty, Jacob
    Cohen, Israel
    Chen, Jingdong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1277 - 1289
  • [9] Dereverberation and denoising using multichannel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshi, Masato
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1791 - 1801
  • [10] Precise dereverberation using multichannel linear prediction
    Delcroix, Marc
    Hikichi, Takafumi
    Miyoshi, Masato
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 430 - 440