A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement

被引:0
|
作者
Yu, Hongjiang [1 ]
Ouyang, Zhiheng [1 ]
Zhu, Wei-Ping [1 ]
Champagne, Benoit [2 ]
Ji, Yunyun [1 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
speech enhancement; Kalman filter; deep neural network; NOISE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a novel deep neural network (DNN) based Kalman filter (KF) algorithm for speech enhancement, where DNN is applied for estimating key parameters in the KF, namely, the linear prediction coefficients (LPCs). By training the DNN with a large database and making use of the powerful learning ability of DNN, our proposed DNN-KF algorithm is able to estimate LPCs from noisy speech more accurately and robustly, leading to an improved performance as compared to traditional KF based approaches in speech enhancement. Experimental results demonstrate that our DNN-KF method outperforms two existing KF based speech enhancement methods in terms of both speech quality and intelligibility.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Deep Residual Network-Based Augmented Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 667 - 673
  • [2] A Deep Learning-based Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    [J]. INTERSPEECH 2020, 2020, : 2692 - 2696
  • [3] A FLOW-BASED NEURAL NETWORK FOR TIME DOMAIN SPEECH ENHANCEMENT
    Strauss, Martin
    Edler, Bernd
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5754 - 5758
  • [4] A NOISE PREDICTION AND TIME-DOMAIN SUBTRACTION APPROACH TO DEEP NEURAL NETWORK BASED SPEECH ENHANCEMENT
    Odelowo, Babafemi O.
    Anderson, David V.
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 372 - 377
  • [5] Low-dimensional recurrent neural network-based Kalman filter for speech enhancement
    Xia, Youshen
    Wang, Jun
    [J]. NEURAL NETWORKS, 2015, 67 : 131 - 139
  • [6] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [7] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [8] Frequency and time domain auditory masking threshold constrained Kalman filter for speech enhancement
    Ma, N
    Bouchard, M
    Goubran, RA
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 2659 - 2662
  • [9] A Novel Single Channel Speech Enhancement Based on Joint Deep Neural Network and Wiener Filter
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    [J]. PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 163 - 167
  • [10] An optimization method for speech enhancement based on deep neural network
    Sun, Haixia
    Li, Sikun
    [J]. 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69