Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

被引:2
|
作者
Roy, Sujan Kumar [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Signal Proc Lab, Nathan Campus, Brisbane, Qld 4111, Australia
关键词
Speech enhancement; Kalman filter; Augmented Kalman filter; Robustness metric; Sensitivity metric; LPC; NOISE;
D O I
10.1016/j.apacoust.2021.108355
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The inaccurate estimates of the speech and noise linear prediction coefficients (LPCs) introduce bias in augmented Kalman filter (AKF) gain, which impacts the quality and intelligibility of enhanced speech. Although current tuning methods offset the bias in AKF gain, particularly in colored noise conditions, they do not adequately address nonstationary noise conditions. This paper introduces a new tuning algorithm of the AKF gain for speech enhancement in real-life noise conditions. Due to this purpose, a speech presence probability (SPP) method first estimates the noise power spectral density (PSD) from each noisy speech frame to compute the noise LPC parameters. A whitening filter is constructed with the noise LPCs to pre-whiten each noisy speech frame prior to computing the speech LPC parameters. The AKF is then constructed with the estimated speech and noise LPC parameters. To achieve better noise reduction, the robustness metric is employed to dynamically offset the bias in AKF gain during speech absence of the noisy speech to that of the sensitivity metric during speech presence. The speech activity is obtained through adopting the speech and noise production model parameters. It is shown that the reduced-biased AKF gain achieved by the proposed tuning algorithm addresses speech enhancement in real-life noise conditions. Objective and subjective scores on the NOIZEUS corpus demonstrate that the proposed method produces enhanced speech with higher quality and intelligibility than the competing methods in real-life noise conditions for a wide range of signal-to-noise ratio (SNR) levels. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [2] Robustness and Sensitivity Tuning of the Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    [J]. SIGNALS, 2021, 2 (03): : 434 - 455
  • [3] DeepLPC: A Deep Learning Approach to Augmented Kalman Filter-Based Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    [J]. IEEE ACCESS, 2021, 9 : 64524 - 64538
  • [4] Robustness and Sensitivity Metrics for Tuning the Extended Kalman Filter
    Saha, Manika
    Ghosh, Ratna
    Goswami, Bhaswati
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2014, 63 (04) : 964 - 971
  • [5] Robustness metric-based tuning of the augmented Kalman filter for the enhancement of speech corrupted with coloured noise
    George, Aidan E. W.
    So, Stephen
    Ghosh, Ratna
    Paliwal, Kuldip K.
    [J]. SPEECH COMMUNICATION, 2018, 105 : 62 - 76
  • [6] Single-channel speech enhancement using Kalman filtering in the modulation domain
    So, Stephen
    Wojcicki, Kamil K.
    Paliwal, Kuldip K.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 993 - 996
  • [7] Modulation-domain Kalman filtering for single-channel speech enhancement
    So, Stephen
    Paliwal, Kuldip K.
    [J]. SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
  • [8] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
    Zhou, Tingting
    Zeng, Yumin
    Wang, Rongrong
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284
  • [9] Single Channel Speech Enhancement Using Subband Iterative Kalman Filter
    Roy, Sujan Kumar
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 762 - 765
  • [10] INCORPORATING MULTI-CHANNEL WIENER FILTER WITH SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM
    Yong, Pei Chee
    Nordholm, Sven
    Dam, Hai Huyen
    Leung, Yee Hong
    Lai, Chiong Ching
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7284 - 7288