Sparse Linear Prediction-based Dereverberation for Signal Enhancement in Distant Speaker Verification

被引:0
|
作者
Witkowski, Marcin [1 ]
Rybicka, Magdalena [1 ]
Kowalczyk, Konrad [1 ]
机构
[1] AGH Univ Sci & Technol, Inst Elect, Krakow, Poland
关键词
speaker recognition; dereverberation; linear prediction; sparse optimization; DNN-based speaker embedding; SPEECH DEREVERBERATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we investigate several sparse dereverberation methods which provide signal enhancement in reverberant environments with the aim to apply them as a preprocessing step for the distant speaker verification (SV) task. First, we present multichannel linear prediction (LP) based techniques which promote sparsity of the dereverberated speech, whose performance has never been verified in the context of speaker recognition. In particular, we describe two existing sparse LP-based methods and present a novel LP-based method in which speech sparsity is enforced by adopting the so-called split Bregman approach.We then study the performance of both sparse and nonsparse dereverberation approaches for signal enhancement, and investigate the gain offered by these methods when applied as a preprocessing step for two different speaker verification systems based on DNN-based speaker embedding extraction. The results of performed experiments indicate that the proposed sparse approach and one of the existing methods consistently achieve significant improvements in distant speaker verification in reverberant environments, and that the SV results are well in line with signal enhancement achieved by the compared techniques.
引用
收藏
页码:461 / 465
页数:5
相关论文
共 50 条
  • [1] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Department of Medical Physics and Acoustics, University of Oldenburg, Oldenburg
    26111, Germany
    不详
    3000, Belgium
    [J]. IEEE Trans. Audio Speech Lang. Process., 9 (1509-1520):
  • [2] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) : 1509 - 1520
  • [3] Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1724 - 1735
  • [4] Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification
    Yang, Joon-Young
    Chang, Joon-Hyuk
    [J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2022, 30 : 3144 - 3159
  • [5] Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification
    Yang, Joon-Young
    Chang, Joon-Hyuk
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 3144 - 3159
  • [6] SPEECH DEREVERBERATION WITH MULTI-CHANNEL LINEAR PREDICTION AND SPARSE PRIORS FOR THE DESIRED SIGNAL
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 23 - 26
  • [7] Linear Prediction-Based Online Dereverberation and Noise Reduction Using Alternating Kalman Filters
    Braun, Sebastian
    Habets, Emanuel A. P.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1115 - 1125
  • [8] SPEECH DEREVERBERATION BASED ON CONVEX OPTIMIZATION ALGORITHMS FOR GROUP SPARSE LINEAR PREDICTION
    Giacobello, Daniele
    Jensen, Tobias Lindstrom
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 446 - 450
  • [9] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
  • [10] Linear Prediction-based Dereverberation with Very Deep Convolutional Neural Networks for Reverberant Speech Recognition
    Park, Sunchan
    Jeong, Yongwon
    Kim, Min Sik
    Kim, Hyung Soon
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 310 - 311