Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

被引:2
|
作者
Roy, Sujan Kumar [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Signal Proc Lab, Nathan Campus, Brisbane, Qld 4111, Australia
关键词
Speech enhancement; Kalman filter; Augmented Kalman filter; Robustness metric; Sensitivity metric; LPC; NOISE;
D O I
10.1016/j.apacoust.2021.108355
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The inaccurate estimates of the speech and noise linear prediction coefficients (LPCs) introduce bias in augmented Kalman filter (AKF) gain, which impacts the quality and intelligibility of enhanced speech. Although current tuning methods offset the bias in AKF gain, particularly in colored noise conditions, they do not adequately address nonstationary noise conditions. This paper introduces a new tuning algorithm of the AKF gain for speech enhancement in real-life noise conditions. Due to this purpose, a speech presence probability (SPP) method first estimates the noise power spectral density (PSD) from each noisy speech frame to compute the noise LPC parameters. A whitening filter is constructed with the noise LPCs to pre-whiten each noisy speech frame prior to computing the speech LPC parameters. The AKF is then constructed with the estimated speech and noise LPC parameters. To achieve better noise reduction, the robustness metric is employed to dynamically offset the bias in AKF gain during speech absence of the noisy speech to that of the sensitivity metric during speech presence. The speech activity is obtained through adopting the speech and noise production model parameters. It is shown that the reduced-biased AKF gain achieved by the proposed tuning algorithm addresses speech enhancement in real-life noise conditions. Objective and subjective scores on the NOIZEUS corpus demonstrate that the proposed method produces enhanced speech with higher quality and intelligibility than the competing methods in real-life noise conditions for a wide range of signal-to-noise ratio (SNR) levels. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Comparative Studies of Single-Channel Speech Enhancement Techniques
    Kumar, Bittu
    Kumar, Neeraj
    Kumar, Manoj
    Prasad, S. V. S.
    Varma, Ashwini Kumar
    Ravi, Banoth
    [J]. IETE JOURNAL OF RESEARCH, 2024, 70 (06) : 5704 - 5720
  • [32] Single-Channel Speech Enhancement Using Double Spectrum
    Blass, Martin
    Mowlaee, Pejman
    Kleijn, W. Bastiaan
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1740 - 1744
  • [33] UltraSE: Single-Channel Speech Enhancement Using Ultrasound
    Sun, Ke
    Zhang, Xinyu
    [J]. PROCEEDINGS OF THE 27TH ACM ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING (ACM MOBICOM '21), 2021, : 160 - 173
  • [34] Phase-Aware Single-channel Speech Enhancement
    Mowlaee, Pejman
    Watanabe, Mario Kaoru
    Saeidi, Rahim
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1871 - 1873
  • [35] A spectral conversion approach to single-channel speech enhancement
    Mouchtaris, Athanasios
    Van der Spiegel, Jan
    Mueller, Paul
    Tsakalides, Panagiotis
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1180 - 1193
  • [36] On supervised LPC estimation training targets for augmented Kalman filter-based speech enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    [J]. SPEECH COMMUNICATION, 2022, 142 : 49 - 60
  • [37] Smartphone-based single-channel speech enhancement application for hearing aids
    Shankar, Nikhil
    Bhat, Gautam Shreedhar
    Panahi, Issa M. S.
    Tittle, Stephanie
    Thibodeau, Linda M.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (03): : 1663 - 1673
  • [38] Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging
    Wakabayashi, Yukoh
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1559 - 1569
  • [39] SINGLE-CHANNEL ENHANCEMENT OF CONVOLUTIVE NOISY SPEECH BASED ON A DISCRIMINATIVE NMF ALGORITHM
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2302 - 2306
  • [40] Single-Channel Speech Enhancement Based on Sub-Band Spectral Entropy
    Wei, Yi
    Zeng, Yumin
    Li, Chen
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 100 - 113