Iterative speech enhancement using a non-linear dynamic state model of speech and its parameters

被引:0
|
作者
Windmann, Stefan [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A marginalized particle filter is proposed for performing single channel speech enhancement with a non-linear dynamic state model. The system consists of a particle filter for tracking line spectral pair (LSP) parameters and a Kalman filter per particle for speech enhancement. The state model for the LSPs has been learnt on clean speech training data. In our approach parameters and speech samples are processed at different time scales by assuming the parameters to be constant for small blocks of data. Further enhancement is obtained by an iteration which can be applied on these small blocks. The experiments show that similar SNR gains are obtained as with the Kalman-EM-iterative algorithm. However better values of the noise level and the log-spectral distance are achieved.
引用
收藏
页码:465 / 468
页数:4
相关论文
共 50 条
  • [41] Linear and Non-linear Speech Features for Detection of Parkinson's Disease
    Shahbakhti, Mohammad
    Taherifar, Danial
    Sorouri, Atefeh
    6TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2013), 2013,
  • [42] Speech enhancement based on AR model parameters estimation
    Deng, Feng
    Bao, Changchun
    SPEECH COMMUNICATION, 2016, 79 : 30 - 46
  • [43] Speech recognition using linear dynamic models
    Frankel, Joe
    King, Simon
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 246 - 256
  • [44] LPCs Enhancement in Iterative Kalman Filtering for Speech Enhancement using Overlapped Frames
    Mellahi, Tarek
    Hamdi, Rachid
    2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
  • [45] Enhancement of performance parameters of speech signal using model order reduction approach
    Arif, Mohammad
    Anand, R.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (04) : 369 - 375
  • [46] Special issue on non-linear and non-conventional speech processing
    Chetouani, Mohamed
    Faundez-Zanuy, Marcos
    Hussain, Amir
    Gas, Bruno
    Zarader, Jean-Luc
    Paliwal, Kuldip
    SPEECH COMMUNICATION, 2009, 51 (09) : 713 - 713
  • [47] Non-Linear and Non-Conventional Speech Processing: Alternative Techniques
    Sole-Casals, Jordi
    Zaiats, Vladimir
    Monte-Moreno, Enric
    COGNITIVE COMPUTATION, 2010, 2 (03) : 133 - 134
  • [48] NON-LINEAR MAPPING FOR MUTLI-CHANNEL SPEECH SEPARATION AND ROBUST OVERLAPPING SPEECH RECOGNITION
    Li, Weifeng
    Dines, John
    Magimai-Doss, Mathew
    Bourlard, Herve
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3921 - 3924
  • [49] Subspace state space model identification for speech enhancement
    Grivel, E
    Gabrea, M
    Najim, M
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 781 - 784
  • [50] Subspace state space model identification for speech enhancement
    Grivel, Eric
    Gabrea, Marcel
    Najim, Mohamel
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 2 : 781 - 784