An efficient solution to sparse linear prediction analysis of speech

被引:12
|
作者
Khanagha, Vahid [1 ]
Daoudi, Khalid [1 ]
机构
[1] INRIA Bordeaux Sud Ouest, GeoStat Team, F-33405 Talence, France
关键词
GENERALIZED METHODS; NOISE REMOVAL; SOLVERS;
D O I
10.1186/1687-4722-2013-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an efficient solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l (2)-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l (0)-norm cost function for sparse residual recovery. We show that the efficient minimization of this objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l (1)-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse synthesizer and show that the corresponding multi-pulse estimate of the excitation source results in slightly better synthesis quality when compared to the classical technique which uses the traditional non-sparse minimum variance synthesizer.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] An efficient solution to sparse linear prediction analysis of speech
    Vahid Khanagha
    Khalid Daoudi
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [2] Speech Reconstruction by Sparse Linear Prediction
    Koloda, Jan
    Peinado, Antonio M.
    Sanchez, Victoria
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 247 - 256
  • [3] Sparse Linear Prediction Coefficients for Isolated Speech Recognition
    Ramitha, R. S.
    Baburaj, M.
    George, Sudhish N.
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 534 - 538
  • [4] Sparse Linear Prediction and Its Applications to Speech Processing
    Giacobello, Daniele
    Christensen, Mads Graesboll
    Murthi, Manohar N.
    Jensen, Soren Holdt
    Moonen, Marc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1644 - 1657
  • [5] Maximum Phase Modeling for Sparse Linear Prediction of Speech
    Drugman, Thomas
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) : 185 - 189
  • [6] Efficient approximate solution of sparse linear systems
    Reif, JH
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1998, 36 (09) : 37 - 58
  • [7] Efficient approximate solution of sparse linear systems
    Reif, J.H.
    Computers and Mathematics with Applications, 1998, 36 (09): : 37 - 58
  • [8] SPECTRAL ANALYSIS OF SPEECH BY LINEAR PREDICTION
    MAKHOUL, J
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (03): : 140 - 148
  • [9] SPEECH ANALYSIS AND SYNTHESIS BY LINEAR PREDICTION OF SPEECH WAVE
    ATAL, BS
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 47 (1P1): : 65 - &
  • [10] WEIGHTED LINEAR PREDICTION ANALYSIS OF SPEECH
    YANAGIDA, M
    KAKUSHO, O
    JOURNAL OF THE RADIO RESEARCH LABORATORY, 1988, 35 (144): : 85 - 105