Restoration of Click Degraded Speech and Music Based on High Order Sparse Linear Prediction

被引:0
|
作者
Dufera, Bisrat Derebssa [1 ,3 ]
Adugna, Eneyew [3 ]
Eneman, Koen [1 ,2 ]
van Waterschoot, Toon [1 ,2 ]
机构
[1] KU Leuven Grp T Leuven Campus, ESAT ETC, Leuven, Belgium
[2] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium
[3] Addis Ababa Univ, Addis Ababa Inst Technol, Addis Ababa, Ethiopia
来源
基金
欧洲研究理事会;
关键词
Click degradation; Missing sample estimation; High-order sparse linear prediction; Linear prediction; SIGNALS;
D O I
10.1109/africon46755.2019.9133792
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clicks are localized degradation that affect most archived audio media. Click degradation are objectionable to the listener and should be suppressed to make the audio acceptable. The use of linear prediction (LP) modeling for the restoration of audio signal that has been corrupted by click degradation has been extensively researched. However, it is hampered by the need of a pitch predictor and by its poor performance for voiced speech and music. High-order sparse linear prediction has been shown to offer better representation of voiced speech and music over conventional linear prediction. In this paper, the use of l1-norm and l0-norm regularized high-order sparse linear prediction is proposed for restoration of audio signal that is corrupted by click degradation that can work equally well for speech and music without a priori information of the type of signal. High-order sparse linear prediction is used to obtain a better model of the spectral envelope and harmonics in the presence of click degradation and background noise. Evaluation with clean speech and music shows that the proposed method achieves SNR improvement from 3dB to 5dB over conventional LP approach for a wide range of click durations. Tests with speech and music corrupted by background noise in addition to click degradation show that the proposed method achieves a better SNR than the restoration of click degraded speech and music that is not corrupted by background noise using conventional LP. Perceptual evaluation of audio quality (PEAQ), used to estimate the subjective quality audio, shows that the proposed method performs better than conventional LP methods in terms of perceived quality of the restored audio by a listener. A computational requirement analysis shows that even though the proposed method is not real-time, it only takes 2 to 3 times the duration of the frame being restored on a present day general-purpose processor.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Fast algorithms for high-order sparse linear prediction with applications to speech processing
    Jensen, Tobias Lindstrom
    Giacobello, Daniele
    van Waterschoot, Toon
    Christensen, Mads Graesboll
    SPEECH COMMUNICATION, 2016, 76 : 143 - 156
  • [2] Speech Reconstruction by Sparse Linear Prediction
    Koloda, Jan
    Peinado, Antonio M.
    Sanchez, Victoria
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 247 - 256
  • [3] Missing Sample Estimation Based on High-Order Sparse Linear Prediction for Audio Signals
    Dufera, Bisrat Derebssa
    Eneman, Koen
    van Waterschoot, Toon
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2464 - 2468
  • [4] SPEECH DEREVERBERATION BASED ON CONVEX OPTIMIZATION ALGORITHMS FOR GROUP SPARSE LINEAR PREDICTION
    Giacobello, Daniele
    Jensen, Tobias Lindstrom
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 446 - 450
  • [5] Sparse Linear Prediction Coefficients for Isolated Speech Recognition
    Ramitha, R. S.
    Baburaj, M.
    George, Sudhish N.
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 534 - 538
  • [6] An efficient solution to sparse linear prediction analysis of speech
    Vahid Khanagha
    Khalid Daoudi
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [7] An efficient solution to sparse linear prediction analysis of speech
    Khanagha, Vahid
    Daoudi, Khalid
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [8] Sparse Linear Prediction and Its Applications to Speech Processing
    Giacobello, Daniele
    Christensen, Mads Graesboll
    Murthi, Manohar N.
    Jensen, Soren Holdt
    Moonen, Marc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1644 - 1657
  • [9] Maximum Phase Modeling for Sparse Linear Prediction of Speech
    Drugman, Thomas
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) : 185 - 189
  • [10] COMPUTATIONAL ANALYSIS OF A FAST ALGORITHM FOR HIGH-ORDER SPARSE LINEAR PREDICTION
    Jensen, Tobias Lindstrom
    Giacobello, Daniele
    van Waterschoot, Toon
    Christensen, Mads Groesboll
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1073 - 1077