DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL

被引:0
|
作者
Huang, Qizheng [1 ]
Bao, Changchun [1 ]
Wang, Xianyun [1 ]
Xiang, Yang [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech enhancement; MBE model; DNN; acoustic features; analysis-with-synthesis; NOISE ESTIMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper provides a novel deep neural networks (DNN) based speech enhancement method using multi-band excitation (MBE) model. Generally, the proposed system contains two stages, namely training stage and enhancing stage. In the training stage, two DNNs with different targets are trained. The training targets are harmonic magnitude and band difference function of clean speech, respectively. The input feature for two DNNs is log-power spectra (LPS) of noisy speech. In the enhancing stage, using the output of DNNs and online estimated pitch period, the enhanced speech can be obtained by MBE speech synthesis. Using the proposed method, the parameters of MBE model can be accurately estimated to synthesize the enhanced speech with the high quality. At the same time, the noise between the harmonics is effectively eliminated. The experiments show that the proposed method outperforms the reference methods for speech quality and intelligibility.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 50 条
  • [41] ADAPTING AND CONTROLLING DNN-BASED SPEECH SYNTHESIS USING INPUT CODES
    Luong, Hieu-Thi
    Takaki, Shinji
    Hente, Gustav Eje
    Yamagishi, Junichi
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4905 - 4909
  • [42] A DNN-based emotional speech synthesis by speaker adaptation
    Yang, Hongwu
    Zhang, Weizhao
    Zhi, Pengpeng
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 633 - 637
  • [43] COMPUTATIONALLY EFFICIENT DNN-BASED APPROXIMATION OF AN AUDITORY MODEL FOR APPLICATIONS IN SPEECH PROCESSING
    Nagathil, Anil
    Gobel, Florian
    Nelus, Alexandru
    Bruce, Ian C.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 301 - 305
  • [44] Prediction of speech intelligibility with DNN-based performance measures
    Martinez, Angel Mario Castro
    Spille, Constantin
    Rossbach, Jana
    Kollmeier, Birger
    Meyer, Bernd T.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [45] DNN-BASED SPEECH QUALITY ASSESSMENT FOR BINAURAL SIGNALS
    Reimes, Jan
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [46] DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 9 - 20
  • [47] Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
    Inoue, Katsuki
    Hara, Sunao
    Abe, Masanobu
    Hojo, Nobukatsu
    Ijima, Yusuke
    [J]. SPEECH COMMUNICATION, 2021, 126 : 35 - 43
  • [48] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
  • [49] DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FORMULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Tammen, Marvin
    Fischer, Doerte
    Meyer, Bernd T.
    Doclo, Simon
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 191 - 195
  • [50] A study of speaker adaptation for DNN-based speech synthesis
    Wu, Zhizheng
    Swietojanski, Pawel
    Veaux, Christophe
    Renals, Steve
    King, Simon
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 879 - 883