Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

被引：2

作者：

Roy, Sujan Kumar ^{[1
]}

Paliwal, Kuldip K. ^{[1
]}

机构：

[1] Griffith Univ, Signal Proc Lab, Nathan Campus, Brisbane, Qld 4111, Australia

来源：

APPLIED ACOUSTICS | 2022年 / 185卷

关键词：

Speech enhancement; Kalman filter; Augmented Kalman filter; Robustness metric; Sensitivity metric; LPC; NOISE;

D O I：

10.1016/j.apacoust.2021.108355

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The inaccurate estimates of the speech and noise linear prediction coefficients (LPCs) introduce bias in augmented Kalman filter (AKF) gain, which impacts the quality and intelligibility of enhanced speech. Although current tuning methods offset the bias in AKF gain, particularly in colored noise conditions, they do not adequately address nonstationary noise conditions. This paper introduces a new tuning algorithm of the AKF gain for speech enhancement in real-life noise conditions. Due to this purpose, a speech presence probability (SPP) method first estimates the noise power spectral density (PSD) from each noisy speech frame to compute the noise LPC parameters. A whitening filter is constructed with the noise LPCs to pre-whiten each noisy speech frame prior to computing the speech LPC parameters. The AKF is then constructed with the estimated speech and noise LPC parameters. To achieve better noise reduction, the robustness metric is employed to dynamically offset the bias in AKF gain during speech absence of the noisy speech to that of the sensitivity metric during speech presence. The speech activity is obtained through adopting the speech and noise production model parameters. It is shown that the reduced-biased AKF gain achieved by the proposed tuning algorithm addresses speech enhancement in real-life noise conditions. Objective and subjective scores on the NOIZEUS corpus demonstrate that the proposed method produces enhanced speech with higher quality and intelligibility than the competing methods in real-life noise conditions for a wide range of signal-to-noise ratio (SNR) levels. (C) 2021 Elsevier Ltd. All rights reserved.

引用

页数：14

共 50 条

[1] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
[J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[2] Robustness and Sensitivity Tuning of the Kalman Filter for Speech Enhancement
Roy, Sujan Kumar
Paliwal, Kuldip K.
[J]. SIGNALS, 2021, 2 (03): : 434 - 455
[3] DeepLPC: A Deep Learning Approach to Augmented Kalman Filter-Based Single-Channel Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
[J]. IEEE ACCESS, 2021, 9 : 64524 - 64538
[4] Robustness and Sensitivity Metrics for Tuning the Extended Kalman Filter
Saha, Manika
Ghosh, Ratna
Goswami, Bhaswati
[J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2014, 63 (04) : 964 - 971
[5] Robustness metric-based tuning of the augmented Kalman filter for the enhancement of speech corrupted with coloured noise
George, Aidan E. W.
So, Stephen
Ghosh, Ratna
Paliwal, Kuldip K.
[J]. SPEECH COMMUNICATION, 2018, 105 : 62 - 76
[6] Single-channel speech enhancement using Kalman filtering in the modulation domain
So, Stephen
Wojcicki, Kamil K.
Paliwal, Kuldip K.
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 993 - 996
[7] Modulation-domain Kalman filtering for single-channel speech enhancement
So, Stephen
Paliwal, Kuldip K.
[J]. SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
[8] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
Zhou, Tingting
Zeng, Yumin
Wang, Rongrong
[J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284
[9] Single Channel Speech Enhancement Using Subband Iterative Kalman Filter
Roy, Sujan Kumar
Zhu, Wei-Ping
Champagne, Benoit
[J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 762 - 765
[10] INCORPORATING MULTI-CHANNEL WIENER FILTER WITH SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM
Yong, Pei Chee
Nordholm, Sven
Dam, Hai Huyen
Leung, Yee Hong
Lai, Chiong Ching
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7284 - 7288

← 1 2 3 4 5 →