On supervised LPC estimation training targets for augmented Kalman filter-based speech enhancement

被引:3
|
作者
Roy, Sujan Kumar [1 ]
Nicolson, Aaron [2 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Signal Proc Lab, Nathan, Qld 4111, Australia
[2] CSIRO, Australian eHlth Res Ctr, Herston, Qld 4006, Australia
关键词
Speechenhancement; AugmentedKalmanfilter; Linearpredicationcoefficients; Trainingtargets; Temporalconvolutionalnetwork; Multi-headattentionnetwork; DEEP LEARNING APPROACH; HEAD SELF-ATTENTION; COLORED-NOISE; QUALITY;
D O I
10.1016/j.specom.2022.06.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The performance of speech coding, speech recognition, and speech enhancement systems that rely on the augmented Kalman filter (AKF) largely depend upon the accuracy of clean speech and noise linear prediction coefficient (LPC) estimation. The formulation of clean speech and noise LPC estimation as a supervised learning task has shown considerable promise as of late. Generally, a deep neural network (DNN) learns to map noisy speech features to a training target that can be used for clean speech and noise LPC estimation. Such training targets fall into four categories: Line spectrum frequency (LSF), LPC power spectrum (LPC-PS), power spectrum (PS), and magnitude spectrum (MS) training targets. The choice of training target can have a significant impact on LPC estimation accuracy. Motivated by this, we perform a comprehensive study of the training targets with the aim of determining which is best for LPC estimation. To this end, we evaluate each training target using a temporal convolutional network (TCN) and a multi-head attention-based network. A large training set constructed from a wide variety of conditions, including real-world non-stationary and coloured noise sources over a range of signal-to-noise ratio (SNR) levels, is used for training. Testing on the NOIZEUS corpus demonstrates that the LPC-PS as the training target produces the lowest clean speech LPC spectral distortion (SD) level. We also construct the augmented Kalman filter (AKF) with the estimated speech and noise LPC parameters of each training target. Subjective AB listening tests and seven objective quality and intelligibility evaluation measures (CSIG, CBAK, COVL, PESQ, STOI, SegSNR, and SI-SDR) revealed that the LPC-PS training target produced enhanced speech at the highest quality and intelligibility amongst the training targets.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [41] OPTIMAL TUNER SELECTION FOR KALMAN FILTER-BASED AIRCRAFT ENGINE PERFORMANCE ESTIMATION
    Simon, Donald L.
    Garg, Sanjay
    [J]. PROCEEDINGS OF THE ASME TURBO EXPO 2009, VOL 1, 2009, : 659 - 671
  • [42] Cubature Kalman Filter-Based State Estimation for Distributed Drive Electric Vehicles
    Jin, Xianjian
    Yin, Guodong
    Hanif, Athar
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 9038 - 9042
  • [43] Kalman Filter-Based Channel Estimation for Mobile-to-Mobile and Relay Networks
    El Husseini, Ali Houssam
    Ros, Laurent
    Simon, Eric Pierre
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (05) : 680 - 684
  • [44] An Unscented Kalman Filter-based Visual Pose Estimation Method for Underwater Vehicles
    Zhang, Yuanxu
    Bian, Chenyi
    Gao, Jian
    [J]. PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 663 - 667
  • [45] MonoKalman: Monocular Vehicle Pose Estimation with Kalman Filter-based temporal consistency
    Di Bella, Leandro
    Lyu, Yangxintong
    Cornelis, Bruno
    Munteanu, Adrian
    [J]. PROCEEDINGS OF THE 2024 25TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, MDM 2024, 2024, : 247 - 250
  • [46] Improved rotor speed estimation using two Kalman filter-based algorithms
    Salvatore, L
    Stasi, S
    Cupertino, F
    [J]. CONFERENCE RECORD OF THE 2001 IEEE INDUSTRY APPLICATIONS CONFERENCE, VOLS 1-4, 2001, : 125 - 132
  • [47] Kalman Filter-based Wind Estimation for Forest Fire Monitoring with a Quadrotor UAV
    Xing, Zhewen
    Zhang, Youmin
    Su, Chun-Yi
    Qu, Yaohong
    Yu, Ziquan
    [J]. 2019 3RD IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2019), 2019, : 743 - 748
  • [48] SPACE INVARIANT BLUR ESTIMATION AND NOISELESS KALMAN FILTER-BASED IMAGE DECONVOLUTION
    Al Maki, Wikky Fawwaz
    Shimahashi, Takuya
    Kitagawa, Takanori
    Sugimoto, Sueo
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (01): : 201 - 213
  • [49] Impedance Parameters Estimation of Transmission Lines by an Extended Kalman Filter-Based Algorithm
    Ribeiro Pereira, Ronaldo Francisco
    de Albuquerque, Felipe Proenca
    Bartocci Liboni, Luisa Helena
    Marques Costa, Eduardo Coelho
    de Oliveira, Mauricio Carvalho
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [50] Application of the Tuned Kalman Filter in Speech Enhancement
    Das, Orchisama
    Goswami, Bhaswati
    Ghosh, Ratna
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON CONTROL, MEASUREMENT AND INSTRUMENTATION (CMI), 2016, : 62 - 66