Iterative speech enhancement using a non-linear dynamic state model of speech and its parameters

被引:0
|
作者
Windmann, Stefan [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A marginalized particle filter is proposed for performing single channel speech enhancement with a non-linear dynamic state model. The system consists of a particle filter for tracking line spectral pair (LSP) parameters and a Kalman filter per particle for speech enhancement. The state model for the LSPs has been learnt on clean speech training data. In our approach parameters and speech samples are processed at different time scales by assuming the parameters to be constant for small blocks of data. Further enhancement is obtained by an iteration which can be applied on these small blocks. The experiments show that similar SNR gains are obtained as with the Kalman-EM-iterative algorithm. However better values of the noise level and the log-spectral distance are achieved.
引用
收藏
页码:465 / 468
页数:4
相关论文
共 50 条
  • [21] Non-linear prediction of speech signal using artificial neural nets
    Ashouri, K
    Amini, M
    Savoji, MH
    EURASIA-ICT 2002: INFORMATION AND COMMUNICATION TECHNOLOGY, PROCEEDINGS, 2002, 2510 : 212 - 219
  • [22] Constrained Iterative Speech Enhancement Using Phonetic Classes
    Das, Amit
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1869 - 1883
  • [23] Non-linear sub-band processing for binaural adaptive speech-enhancement
    Hussain, A
    NINTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS (ICANN99), VOLS 1 AND 2, 1999, (470): : 121 - 125
  • [24] Insights Into Deep Non-Linear Filters for Improved Multi-Channel Speech Enhancement
    Tesch, Kristina
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 563 - 575
  • [25] Beyond Linear Transforms: Efficient Non-linear Dynamic Adaptation for Noise Robust Speech Recognition
    Rennie, Steven J.
    Dognin, Pierre L.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1305 - 1308
  • [26] Analysis of the SNR Estimator for Speech Enhancement Using a Cascaded Linear Model
    Kaur, Harjeet
    Talwar, Rajneesh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (01) : 178 - 182
  • [27] Speech enhancement using linear prediction residual
    Yegnanarayana, B
    Avendano, C
    Hermansky, H
    Murthy, PS
    SPEECH COMMUNICATION, 1999, 28 (01) : 25 - 42
  • [28] Improved modelling of speech dynamics using non-linear formant trajectories for HMM-based speech synthesis
    Hu, Hongwei
    Russell, Martin J.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 821 - 824
  • [29] SPEECH ENHANCEMENT USING ARCH MODEL
    Atkins, Aviva
    Cohen, Israel
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [30] Speech character analysis based on non-linear method
    Li, MD
    Zhang, H
    Tong, QY
    ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 1907 - 1910