An improved algorithm for noise-robust sparse linear prediction of speech

被引:1
|
作者
ZHOU Bin
ZOU Xia
ZHANG Xiongwei
机构
[1] The 63rd Research Institute of PLA General Staff Headquarters
[2] PLA University of Science and Technology
关键词
An improved algorithm for noise-robust sparse linear prediction of speech; PESQ; LP;
D O I
10.15949/j.cnki.0217-9776.2015.01.008
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The performance of linear prediction analysis of speech deteriorates rapidly under noisy environments.To tackle this issue,an improved noise-robust sparse linear prediction algorithm is proposed.First,the linear prediction residual of speech is modeled as Student-t distribution,and the additive noise is incorporated explicitly to increase the robustness,thus a probabilistic model for sparse linear prediction of speech is built.Furthermore,variational Bayesian inference is utilized to approximate the intractable posterior distributions of the model parameters,and then the optimal linear prediction parameters are estimated robustly.The experimental results demonstrate the advantage of the developed algorithm in terms of several different metrics compared with the traditional algorithm and the l1 norm minimization based sparse linear prediction algorithm proposed in recent years.Finally it draws to a conclusion that the proposed algorithm is more robust to noise and is able to increase the speech quality in applications.
引用
下载
收藏
页码:84 / 95
页数:12
相关论文
共 50 条
  • [41] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Koekueer, Muenevver
    Jancovic, Peter
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
  • [42] Sparse Kernel Cepstral Coefficients (SKCC): Inner-product based Features for Noise-Robust Speech Recognition
    Fazel, Amin
    Chakrabartty, Shantanu
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 2401 - 2404
  • [43] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [44] Unsupervised modulation filter learning for noise-robust speech recognition
    Agrawal, Purvi
    Ganapathy, Sriram
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
  • [45] A noise-robust speech input interface for information kiosk terminals
    Ida, M
    Mori, H
    Nakamura, S
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (12): : 51 - 61
  • [46] Modeling human auditory perception for noise-robust speech recognition
    Lee, SY
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : PL72 - PL74
  • [47] Deep maxout networks applied to noise-robust speech recognition
    de-la-Calle-Silos, F. (fsilos@tsc.uc3m.es), 1600, Springer Verlag (8854):
  • [48] Noise-robust speech analysis using system identification methods
    Arima, Y
    Shimamura, T
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2003, 86 (03): : 20 - 32
  • [49] A companding front end for noise-robust automatic speech recognition
    Guinness, J
    Raj, B
    Schmidt-Nielsen, B
    Turicchia, L
    Sarpeshkar, R
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
  • [50] Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2080 - 2091