Restoration of Click Degraded Speech and Music Based on High Order Sparse Linear Prediction

被引:0
|
作者
Dufera, Bisrat Derebssa [1 ,3 ]
Adugna, Eneyew [3 ]
Eneman, Koen [1 ,2 ]
van Waterschoot, Toon [1 ,2 ]
机构
[1] KU Leuven Grp T Leuven Campus, ESAT ETC, Leuven, Belgium
[2] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium
[3] Addis Ababa Univ, Addis Ababa Inst Technol, Addis Ababa, Ethiopia
来源
基金
欧洲研究理事会;
关键词
Click degradation; Missing sample estimation; High-order sparse linear prediction; Linear prediction; SIGNALS;
D O I
10.1109/africon46755.2019.9133792
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Clicks are localized degradation that affect most archived audio media. Click degradation are objectionable to the listener and should be suppressed to make the audio acceptable. The use of linear prediction (LP) modeling for the restoration of audio signal that has been corrupted by click degradation has been extensively researched. However, it is hampered by the need of a pitch predictor and by its poor performance for voiced speech and music. High-order sparse linear prediction has been shown to offer better representation of voiced speech and music over conventional linear prediction. In this paper, the use of l1-norm and l0-norm regularized high-order sparse linear prediction is proposed for restoration of audio signal that is corrupted by click degradation that can work equally well for speech and music without a priori information of the type of signal. High-order sparse linear prediction is used to obtain a better model of the spectral envelope and harmonics in the presence of click degradation and background noise. Evaluation with clean speech and music shows that the proposed method achieves SNR improvement from 3dB to 5dB over conventional LP approach for a wide range of click durations. Tests with speech and music corrupted by background noise in addition to click degradation show that the proposed method achieves a better SNR than the restoration of click degraded speech and music that is not corrupted by background noise using conventional LP. Perceptual evaluation of audio quality (PEAQ), used to estimate the subjective quality audio, shows that the proposed method performs better than conventional LP methods in terms of perceived quality of the restored audio by a listener. A computational requirement analysis shows that even though the proposed method is not real-time, it only takes 2 to 3 times the duration of the frame being restored on a present day general-purpose processor.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Improvement of Speech Preprocessing Model Based on Linear Prediction
    Gao, Yubin
    Xin, Duqiang
    Tian, Huaigu
    Xu, Xiaomin
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5581 - 5586
  • [32] Linear prediction based on improved solution of speech parameters
    Chen, Shu-Zhen
    Zhang, Chen-Guang
    Liu, Huai-Lin
    Zhang, Yu
    2003, Wuhan University (49):
  • [33] On the modeling of linear prediction based speech coders in MATLAB
    Baulin, Alexander A.
    EDM 2006: 7th Annual International Workshop and Tutorials on Electron Devices and Materials, Proceedings, 2006, : 138 - 139
  • [34] SPEECH ENHANCEMENT FOR BONE-CONDUCTED SPEECH BASED ON LOW-ORDER CEPSTRUM RESTORATION
    Watanabe, Daiki
    Sugiura, Yosuke
    Shimamura, Tetsuya
    Makinae, Hisanori
    2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 212 - 216
  • [35] SINGLE CHANNEL REVERBERATION SUPPRESSION BASED ON SPARSE LINEAR PREDICTION
    Lopez, Nicolas
    Grenier, Yves
    Richard, Gael
    Bourmeyster, Ivan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [36] Click-Through Rate Prediction Algorithm Based on Modeling of Implicit High-Order Feature Importance
    Yang, Qing
    Li, Ning
    Hu, Shiyan
    Li, Heyong
    Zhang, Jingwei
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05): : 1077 - 1086
  • [37] Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition
    Kumar, Udara Laxman
    Kurimo, Mikko
    Kathania, Hemant Kumar
    SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 483 - 493
  • [38] Order-Variable Multi-Pulses Linear Prediction Speech Coding
    Li, Jianlei
    Ma, Zhen
    Wu, Mingzhao
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3672 - 3676
  • [39] Empirical Priors for Prediction in Sparse High-dimensional Linear Regression
    Martin, Ryan
    Tang, Yiqi
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [40] High Resolution Range Profile Generation via Sparse Linear Prediction
    Ozen, Bahar
    Erer, Isin
    Kent, Sedef
    11TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2016), 2016, : 467 - 470