Iterative speech enhancement using a non-linear dynamic state model of speech and its parameters

被引:0
|
作者
Windmann, Stefan [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A marginalized particle filter is proposed for performing single channel speech enhancement with a non-linear dynamic state model. The system consists of a particle filter for tracking line spectral pair (LSP) parameters and a Kalman filter per particle for speech enhancement. The state model for the LSPs has been learnt on clean speech training data. In our approach parameters and speech samples are processed at different time scales by assuming the parameters to be constant for small blocks of data. Further enhancement is obtained by an iteration which can be applied on these small blocks. The experiments show that similar SNR gains are obtained as with the Kalman-EM-iterative algorithm. However better values of the noise level and the log-spectral distance are achieved.
引用
收藏
页码:465 / 468
页数:4
相关论文
共 50 条
  • [1] Tamil Speech Enhancement Using Non-Linear Spectral Subtraction
    Prabhakaran, G.
    Indra, J.
    Kasthuri, N.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [2] Non-Linear Filtering for Feature Enhancement of Reverberant Speech
    Verma, Amit Kumar
    Tomar, Hemendra
    Chetupalli, Srikanth Raj
    Sreenivas, T. V.
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1800 - 1805
  • [3] Speech enhancement using a constrained iterative sinusoidal model
    Jensen, J
    Hansen, JHL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (07): : 731 - 740
  • [4] Recognition of Emotion Using Non-Linear Dynamics of Speech
    Harimi, Ali
    Shalizadi, Ali
    Ahmadyfard, Alireza
    2014 7th International Symposium on Telecommunications (IST), 2014, : 446 - 451
  • [5] A speech enhancement algorithm based on non-linear filtering and noise masking
    Zhang, JJ
    Cao, ZG
    CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (03): : 296 - 300
  • [6] Comparing linear and non-linear transformation of speech
    Mesbahi, Larbi
    Barreaud, Vincent
    Boeffard, Olivier
    PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIGNALS, SPEECH AND IMAGE PROCESSING/9TH WSEAS INTERNATIONAL CONFERENCE ON MULTIMEDIA, INTERNET & VIDEO TECHNOLOGIES, 2009, : 68 - 73
  • [7] Non-linear speech transition visualization
    Reinhard, K
    Niranjan, M
    FIFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1997, (440): : 257 - 261
  • [8] Speech Recognition in Noisy Environments using a Switching Linear Dynamic Model for Feature Enhancement
    Schuller, Bjoern
    Woellmer, Martin
    Moosmayr, Tobias
    Rigoll, Gerhard
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1789 - +
  • [9] NOISE-REDUCTION USING FREQUENCY-DOMAIN NON-LINEAR PROCESSING FOR THE ENHANCEMENT OF SPEECH
    MUNDAY, E
    BRITISH TELECOM TECHNOLOGY JOURNAL, 1988, 6 (02): : 71 - 83
  • [10] Noisy speech segmentation using non-linear observation switching state space model and unscented Kalman filtering
    Jinachitra, Pamornpol
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1209 - 1212