An efficient solution to sparse linear prediction analysis of speech

被引:12
|
作者
Khanagha, Vahid [1 ]
Daoudi, Khalid [1 ]
机构
[1] INRIA Bordeaux Sud Ouest, GeoStat Team, F-33405 Talence, France
关键词
GENERALIZED METHODS; NOISE REMOVAL; SOLVERS;
D O I
10.1186/1687-4722-2013-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an efficient solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l (2)-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l (0)-norm cost function for sparse residual recovery. We show that the efficient minimization of this objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l (1)-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse synthesizer and show that the corresponding multi-pulse estimate of the excitation source results in slightly better synthesis quality when compared to the classical technique which uses the traditional non-sparse minimum variance synthesizer.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Wide-Band Speech Coding Based on Bandwidth Extension and Sparse Linear Prediction
    Alipoor, Ghasem
    Savoji, Mohamad Hasan
    2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 454 - 459
  • [32] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) : 1509 - 1520
  • [33] Restoration of Click Degraded Speech and Music Based on High Order Sparse Linear Prediction
    Dufera, Bisrat Derebssa
    Adugna, Eneyew
    Eneman, Koen
    van Waterschoot, Toon
    2019 IEEE AFRICON, 2019,
  • [34] A class of multichannel sparse linear prediction algorithms for time delay estimation of speech sources
    He, Hongsen
    Chen, Jingdong
    Benesty, Jacob
    Zhang, Wenxing
    Yang, Tao
    SIGNAL PROCESSING, 2020, 169
  • [35] Blind speech dereverberation using sparse decomposition and multi-channel linear prediction
    Mousavi, Leila
    Razzazi, Farbod
    Haghbin, Afrooz
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 729 - 738
  • [36] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Department of Medical Physics and Acoustics, University of Oldenburg, Oldenburg
    26111, Germany
    不详
    3000, Belgium
    IEEE Trans. Audio Speech Lang. Process., 9 (1509-1520):
  • [37] SPEECH DEREVERBERATION WITH MULTI-CHANNEL LINEAR PREDICTION AND SPARSE PRIORS FOR THE DESIRED SIGNAL
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 23 - 26
  • [38] WINDOWING IN LINEAR PREDICTION ANALYSIS OF VOICED SPEECH.
    Paliwal, K.K.
    Rao, P.V.S.
    IETE Journal of Research, 1981, 27 (05) : 165 - 171
  • [39] Linear Prediction Analysis of Crosscorrelation Sequence for Voiced Speech
    Liu, Liqing
    Shimamura, Tetsuya
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [40] A scheme for high quality linear prediction analysis of speech
    Wang, CF
    Dai, BQ
    Zhang, JS
    Hui, L
    Yi, L
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 694 - 697