Low-dimensional recurrent neural network-based Kalman filter for speech enhancement

被引:51
|
作者
Xia, Youshen [1 ]
Wang, Jun [2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
[2] Chinese Univ Hong Kong, Dept Mech & Automat Engn, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Recurrent neural network; Speech enhancement; Non-Gaussian noise; Noise-constrained estimation; EM ALGORITHM; NOISE; STABILITY; SYSTEMS; SINGLE; ERROR;
D O I
10.1016/j.neunet.2015.03.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new recurrent neural network-based Kalman filter for speech enhancement, based on a noise-constrained least squares estimate. The parameters of speech signal modeled as autoregressive process are first estimated by using the proposed recurrent neural network and the speech signal is then recovered from Kalman filtering. The proposed recurrent neural network is globally asymptomatically stable to the noise-constrained estimate. Because the noise-constrained estimate has a robust performance against non-Gaussian noise, the proposed recurrent neural network-based speech enhancement algorithm can minimize the estimation error of Kalman filter parameters in non-Gaussian noise. Furthermore, having a low-dimensional model feature, the proposed neural network-based speech enhancement algorithm has a much faster speed than two existing recurrent neural networks-based speech enhancement algorithms. Simulation results show that the proposed recurrent neural network-based speech enhancement algorithm can produce a good performance with fast computation and noise reduction. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:131 / 139
页数:9
相关论文
共 50 条
  • [1] Deep Residual Network-Based Augmented Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 667 - 673
  • [2] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
    Yu, Hongjiang
    Ouyang, Zhiheng
    Zhu, Wei-Ping
    Champagne, Benoit
    Ji, Yunyun
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [3] Integrating Uncertainty Into Neural Network-Based Speech Enhancement
    Fang, Huajian
    Becker, Dennis
    Wermter, Stefan
    Gerkmann, Timo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1587 - 1600
  • [4] A neural network-based nonlinear filter for image enhancement
    Zhang, S
    Salari, E
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2002, 12 (02) : 56 - 62
  • [5] Convolutional Neural Network-Based Adaptive Localization for an Ensemble Kalman Filter
    Wang, Zhongrui
    Lei, Lili
    Anderson, Jeffrey L.
    Tan, Zhe-Min
    Zhang, Yi
    [J]. JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2023, 15 (10)
  • [6] INTEGRATING STATISTICAL UNCERTAINTY INTO NEURAL NETWORK-BASED SPEECH ENHANCEMENT
    Fang, Huajian
    Peer, Tal
    Wermter, Stefan
    Gerkmann, Timo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 386 - 390
  • [7] Subjective intelligibility of deep neural network-based speech enhancement
    Gelderblom, Femke B.
    Tronstad, Tron V.
    Viggen, Erlend Magnus
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
  • [8] Recurrent neural network training with the Kalman Filter-based techniques
    Trebaticky, P
    [J]. NEURAL NETWORK WORLD, 2005, 15 (05) : 471 - 488
  • [9] Analysis of a low-dimensional bottleneck neural network representation of speech for modelling speech dynamics
    Bai, Linxue
    Jancovic, Peter
    Russell, Martin
    Weber, Philip
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 583 - 587
  • [10] Gammatone Filter Bank-Deep Neural Network-based Monaural speech enhancement for unseen conditions
    Sivapatham, Shoba
    Kar, Asutosh
    Christensen, Mads Graesboll
    [J]. APPLIED ACOUSTICS, 2022, 194