Restoration of Click Degraded Speech and Music Based on High Order Sparse Linear Prediction

被引：0

作者：

Dufera, Bisrat Derebssa ^{[1
,3
]}

Adugna, Eneyew ^{[3
]}

Eneman, Koen ^{[1
,2
]}

van Waterschoot, Toon ^{[1
,2
]}

机构：

[1] KU Leuven Grp T Leuven Campus, ESAT ETC, Leuven, Belgium

[2] Katholieke Univ Leuven, ESAT STADIUS, Leuven, Belgium

[3] Addis Ababa Univ, Addis Ababa Inst Technol, Addis Ababa, Ethiopia

来源：

2019 IEEE AFRICON | 2019年

基金：

欧洲研究理事会;

关键词：

Click degradation; Missing sample estimation; High-order sparse linear prediction; Linear prediction; SIGNALS;

D O I：

10.1109/africon46755.2019.9133792

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Clicks are localized degradation that affect most archived audio media. Click degradation are objectionable to the listener and should be suppressed to make the audio acceptable. The use of linear prediction (LP) modeling for the restoration of audio signal that has been corrupted by click degradation has been extensively researched. However, it is hampered by the need of a pitch predictor and by its poor performance for voiced speech and music. High-order sparse linear prediction has been shown to offer better representation of voiced speech and music over conventional linear prediction. In this paper, the use of l1-norm and l0-norm regularized high-order sparse linear prediction is proposed for restoration of audio signal that is corrupted by click degradation that can work equally well for speech and music without a priori information of the type of signal. High-order sparse linear prediction is used to obtain a better model of the spectral envelope and harmonics in the presence of click degradation and background noise. Evaluation with clean speech and music shows that the proposed method achieves SNR improvement from 3dB to 5dB over conventional LP approach for a wide range of click durations. Tests with speech and music corrupted by background noise in addition to click degradation show that the proposed method achieves a better SNR than the restoration of click degraded speech and music that is not corrupted by background noise using conventional LP. Perceptual evaluation of audio quality (PEAQ), used to estimate the subjective quality audio, shows that the proposed method performs better than conventional LP methods in terms of perceived quality of the restored audio by a listener. A computational requirement analysis shows that even though the proposed method is not real-time, it only takes 2 to 3 times the duration of the frame being restored on a present day general-purpose processor.

引用

页数：6

共 50 条

[31] Improvement of Speech Preprocessing Model Based on Linear Prediction
Gao, Yubin
Xin, Duqiang
Tian, Huaigu
Xu, Xiaomin
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5581 - 5586
[32] Linear prediction based on improved solution of speech parameters
Chen, Shu-Zhen
Zhang, Chen-Guang
Liu, Huai-Lin
Zhang, Yu
2003, Wuhan University (49):
[33] On the modeling of linear prediction based speech coders in MATLAB
Baulin, Alexander A.
EDM 2006: 7th Annual International Workshop and Tutorials on Electron Devices and Materials, Proceedings, 2006, : 138 - 139
[34] SPEECH ENHANCEMENT FOR BONE-CONDUCTED SPEECH BASED ON LOW-ORDER CEPSTRUM RESTORATION
Watanabe, Daiki
Sugiura, Yosuke
Shimamura, Tetsuya
Makinae, Hisanori
2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 212 - 216
[35] SINGLE CHANNEL REVERBERATION SUPPRESSION BASED ON SPARSE LINEAR PREDICTION
Lopez, Nicolas
Grenier, Yves
Richard, Gael
Bourmeyster, Ivan
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[36] Click-Through Rate Prediction Algorithm Based on Modeling of Implicit High-Order Feature Importance
Yang, Qing
Li, Ning
Hu, Shiyan
Li, Heyong
Zhang, Jingwei
JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (05): : 1077 - 1086
[37] Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition
Kumar, Udara Laxman
Kurimo, Mikko
Kathania, Hemant Kumar
SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 483 - 493
[38] Order-Variable Multi-Pulses Linear Prediction Speech Coding
Li, Jianlei
Ma, Zhen
Wu, Mingzhao
FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3672 - 3676
[39] Empirical Priors for Prediction in Sparse High-dimensional Linear Regression
Martin, Ryan
Tang, Yiqi
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[40] High Resolution Range Profile Generation via Sparse Linear Prediction
Ozen, Bahar
Erer, Isin
Kent, Sedef
11TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2016), 2016, : 467 - 470

← 1 2 3 4 5 →