An efficient solution to sparse linear prediction analysis of speech

被引:12
|
作者
Khanagha, Vahid [1 ]
Daoudi, Khalid [1 ]
机构
[1] INRIA Bordeaux Sud Ouest, GeoStat Team, F-33405 Talence, France
关键词
GENERALIZED METHODS; NOISE REMOVAL; SOLVERS;
D O I
10.1186/1687-4722-2013-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an efficient solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l (2)-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l (0)-norm cost function for sparse residual recovery. We show that the efficient minimization of this objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l (1)-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse synthesizer and show that the corresponding multi-pulse estimate of the excitation source results in slightly better synthesis quality when compared to the classical technique which uses the traditional non-sparse minimum variance synthesizer.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Weighted Linear Prediction for Speech Analysis in Noisy Conditions
    Pohjalainen, Jouni
    Kallasjoki, Heikki
    Palomaki, Kalle J.
    Kurimo, Mikko
    Alku, Paavo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1347 - +
  • [42] AMGCL -A C++ library for efficient solution of large sparse linear systems
    Demidov, Denis
    SOFTWARE IMPACTS, 2020, 6
  • [43] Efficient and Robust Sparse Linear Discriminant Analysis for Data Classification
    Liu, Jingjing
    Feng, Manlong
    Xiu, Xianchao
    Liu, Wanquan
    Zeng, Xiaoyang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 617 - 629
  • [44] Communication-efficient Distributed Sparse Linear Discriminant Analysis
    Tian, Lu
    Gu, Quanquan
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 1178 - 1187
  • [45] Regularized linear prediction of speech
    Ekman, L. Anders
    Kleijn, W. Bastiaan
    Murthi, Manohar N.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 65 - 73
  • [46] ON ROBUST LINEAR PREDICTION OF SPEECH
    LEE, CH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (05): : 642 - 650
  • [47] ROBUST SPEECH ANALYSIS BY LAG-WEIGHTED LINEAR PREDICTION
    Pohjalainen, Jouni
    Alku, Paavo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4453 - 4456
  • [48] Time-varying linear prediction for speech analysis and synthesis
    Schnell, Karl
    Lacroix, Arild
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3941 - 3944
  • [49] THEORETICAL JUSTIFICATION FOR ASYNCHRONOUS APPLICATION OF LINEAR PREDICTION TO SPEECH ANALYSIS
    ALLEN, JB
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02): : 396 - 396
  • [50] ROBUST SIGNAL SELECTION FOR LINEAR PREDICTION ANALYSIS OF VOICED SPEECH
    MA, CX
    KAMP, Y
    WILLEMS, LF
    SPEECH COMMUNICATION, 1993, 12 (01) : 69 - 81