SPEECH DEREVERBERATION BASED ON CONVEX OPTIMIZATION ALGORITHMS FOR GROUP SPARSE LINEAR PREDICTION

被引:0
|
作者
Giacobello, Daniele [1 ]
Jensen, Tobias Lindstrom [2 ]
机构
[1] Sonos Inc, Santa Barbara, CA 93101 USA
[2] Dept Elect Syst, Signal & Informat Proc, Aalborg, Denmark
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider methods for improving far-field speech recognition using dereverberation based on sparse multi-channel linear prediction. In particular, we extend successful methods based on nonconvex iteratively reweighted least squares, that look for a sparse desired speech signal in the short-term Fourier transform domain, by proposing sparsity promoting convex functions. Furthermore, we show how to improve performance by applying regularization into both the reweighted least squares and convex methods. We evaluate the methods using large scale simulations by mimicking the application scenarios of interest. The experiments show that the proposed convex formulations and regularization offer improvements over existing methods with added robustness and flexibility in fairly different acoustic scenarios.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 50 条
  • [1] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) : 1509 - 1520
  • [2] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Department of Medical Physics and Acoustics, University of Oldenburg, Oldenburg
    26111, Germany
    不详
    3000, Belgium
    [J]. IEEE Trans. Audio Speech Lang. Process., 9 (1509-1520):
  • [3] Adaptive Speech Dereverberation Using Constrained Sparse Multichannel Linear Prediction
    Jukic, Ante
    van Waterschoot, Toon
    Doclo, Simon
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (01) : 101 - 105
  • [4] Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1724 - 1735
  • [5] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
    Mousavi, Leila
    Razzazi, Farbod
    Haghbin, Afrooz
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 729 - 738
  • [6] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
    Leila Mousavi
    Farbod Razzazi
    Afrooz Haghbin
    [J]. International Journal of Speech Technology, 2019, 22 : 729 - 738
  • [7] SPEECH DEREVERBERATION WITH MULTI-CHANNEL LINEAR PREDICTION AND SPARSE PRIORS FOR THE DESIRED SIGNAL
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 23 - 26
  • [8] Speech Dereverberation Based on Sparse Matrix Decomposition
    Fan, Miao
    Liu, Liyang
    Li, Weifeng
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT INNOVATION, 2015, 28 : 1169 - 1173
  • [9] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
  • [10] Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
    Yang, Jae-Mo
    Kang, Hong-Goo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 608 - 619