Speech Enhancement With Inventory Style Speech Resynthesis

被引:25
|
作者
Xiao, X. [1 ]
Nickel, R. M. [2 ]
机构
[1] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA
[2] Bucknell Univ, Dept Elect Engn, Lewisburg, PA 17837 USA
关键词
Harmonic tunnelling; hidden Markov models (HMMs); inventory style speech synthesis; nonstationary noise; sinusoidal speech modeling; speaker dependent denoising; speech enhancement; STATISTICAL-MODEL; NOISE; SUPPRESSION; HMM;
D O I
10.1109/TASL.2009.2031793
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a new method for the enhancement of speech. The method is designed for scenarios in which targeted speaker enrollment as well as system training within the typical noise environment are feasible. The proposed procedure is fundamentally different from most conventional and state-of-the-art denoising approaches. Instead of filtering a distorted signal we are resynthesizing a new "clean" signal based on its likely characteristics. These characteristics are estimated from the distorted signal. A successful implementation of the proposed method is presented. Experiments were performed in a scenario with roughly one hour of clean speech training data. Our results show that the proposed method compares very favorably to other state-of-the-art systems in both objective and subjective speech quality assessments. Potential applications for the proposed method include jet cockpit communication systems and offline methods for the restoration of audio recordings.
引用
收藏
页码:1243 / 1257
页数:15
相关论文
共 50 条
  • [1] INVENTORY-STYLE SPEECH ENHANCEMENT WITH UNCERTAINTY-OF-OBSERVATION TECHNIQUES
    Nickel, R. M.
    Astudillo, R. F.
    Kolossa, D.
    Zeiler, S.
    Martin, R.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4645 - 4648
  • [2] MEMORY AND COMPLEXITY REDUCTION FOR INVENTORY-STYLE SPEECH ENHANCEMENT SYSTEMS
    Nickel, Robert M.
    Martin, Rainer
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 196 - 200
  • [3] Concatenative Resynthesis with Improved Training Signals for Speech Enhancement
    Syed, Ali Raza
    Trinh Viet Anh
    Mandel, Michael I.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1195 - 1199
  • [4] INVENTORY BASED SPEECH ENHANCEMENT FOR SPEAKER DEDICATED SPEECH COMMUNICATION SYSTEMS
    Xiao, Xiaoqiang
    Lee, Peng
    Nickel, Robert M.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3877 - +
  • [5] SPEAKER INDEPENDENCE OF NEURAL VOCODERS AND THEIR EFFECT ON PARAMETRIC RESYNTHESIS SPEECH ENHANCEMENT
    Maiti, Soumi
    Mandel, Michael, I
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 206 - 210
  • [6] Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis
    Carmona, Jose L.
    Barker, Jon
    Gomez, Angel M.
    Ma, Ning
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (06) : 563 - 566
  • [7] SPEECH DENOISING BY PARAMETRIC RESYNTHESIS
    Maiti, Soumi
    Mandel, Michael I.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6995 - 6999
  • [8] Speech Inventory Based Discriminative Training for Joint Speech Enhancement and Low-Rate Speech Coding
    Xiao, Xiaoqiang
    Nickel, Robert M.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2398 - +
  • [9] SPEECH STYLE AND ACTIVATION IN A FREE SPEECH
    WALSCHBURGER, P
    BRODA, M
    [J]. PSYCHOLOGISCHE BEITRAGE, 1980, 22 (02): : 304 - 321
  • [10] Speech enhancement for bandlimited speech
    Heide, DA
    Kang, GS
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396