A data-driven approach to optimizing spectral speech enhancement methods for various error criteria

被引:46
|
作者
Erkelens, Jan [1 ]
Jensen, Jesper [1 ]
Heusdens, Richard [1 ]
机构
[1] Delft Univ Technol, Theory Grp, Dept Med Informat & Commun, NL-2628 CD Delft, Netherlands
关键词
speech enhancement; spectral distortion measures; speech model;
D O I
10.1016/j.specom.2006.06.012
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Gain functions for spectral noise suppression have been derived in literature for some error criteria and statistical models. These gain functions are only optimal when the statistical model is correct and the speech and noise spectral variances are known. Unfortunately, the speech distributions are unknown and can at best be determined conditionally on the estimated spectral variance. We show that the "decision-directed" approach for speech spectral variance estimation can have an important bias at low SNRs, which generally leads to too much speech suppression. To correct for such estimation inaccuracies and adapt to the unknown speech statistics, we propose a general optimization procedure, with two gain functions applied in parallel. A conventional algorithm is run in the background and is used for a priori SNR estimation only. For the final reconstruction a different gain function is used, optimized for a wide range of signal-to-noise ratios. The gain function providing for the reconstruction is trained on a speech database, by minimizing a relevant error criterion. The procedure is illustrated for several error criteria. The method compares favorably to current state-of-the-art methods, and needs less smoothing in the decision-directed spectral variance estimator. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:530 / 541
页数:12
相关论文
共 50 条
  • [1] A data-driven approach to optimizing clinical study eligibility criteria
    Fang, Yilu
    Liu, Hao
    Idnay, Betina
    Ta, Casey
    Marder, Karen
    Weng, Chunhua
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 142
  • [2] An experiment on an automated literature survey of data-driven speech enhancement methods
    Dos Santos, Arthur
    Pereira, Jayr
    Nogueira, Rodrigo
    Masiero, Bruno
    Tavallaey, Shiva Sander
    Zea, Elias
    [J]. Acta Acustica, 2024, 8
  • [3] An experiment on an automated literature survey of data-driven speech enhancement methods
    dos Santos, Arthur
    Pereira, Jayr
    Nogueira, Rodrigo
    Masiero, Bruno
    Tavallaey, Shiva Sander
    Zea, Elias
    [J]. ACTA ACUSTICA, 2024, 8
  • [4] A DATA-DRIVEN RESIDUAL GAIN APPROACH FOR TWO-STAGE SPEECH ENHANCEMENT
    Jin, Yu Gwang
    Lee, Chul Min
    Cho, Kiho
    Kim, Nam Soo
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4752 - 4755
  • [5] A novel data-driven approach to optimizing replacement policy
    Ahmadi, Reza
    Wu, Shaomin
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 167 : 506 - 516
  • [6] Data-driven spectral basis functions for automatic speech recognition
    Malayath, N
    Hermansky, H
    [J]. SPEECH COMMUNICATION, 2003, 40 (04) : 449 - 466
  • [7] Speech Enhancement Based on Data-Driven Residual Gain Estimation
    Jin, Yu Gwang
    Kim, Nam Soo
    Chang, Joon-Hyuk
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (12): : 2537 - 2540
  • [8] CLOSE-A Data-Driven Approach to Speech Separation
    Ming, Ji
    Srinivasan, Ramji
    Crookes, Danny
    Jafari, Ayeh
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1355 - 1368
  • [9] Automatic meter error detection with a data-driven approach
    Chu, Ruimin
    Chik, Li
    Chan, Jeffrey
    Gutzmann, Kurt
    Li, Xiaodong
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [10] Unified approach to data-driven quantum error mitigation
    Lowe, Angus
    Gordon, Max Hunter
    Czarnik, Piotr
    Arrasmith, Andrew
    Coles, Patrick J.
    Cincio, Lukasz
    [J]. PHYSICAL REVIEW RESEARCH, 2021, 3 (03):