Denoising Recurrent Neural Network for Deep Bidirectional LSTM based Voice Conversion

被引:7
|
作者
Wu, Jie [1 ]
Huang, Dongyan [2 ]
Xie, Lei [1 ]
Li, Haizhou [2 ,3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
[3] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
residual error; Gaussian noise; denoising; recurrent neural network; voice conversion;
D O I
10.21437/Interepeech.2017-694
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper studies the post processing in deep bidirectional Long Short-Term Memory (DBLSTM) based voice conversion, where the statistical parameters are optimized to generate speech that exhibits similar properties to target speech. However, there always exists residual error between converted speech and target one. We reformulate the residual error problem as speech restoration, which aims to recover the target speech samples from the converted ones. Specifically, we propose a denoising recurrent neural network (DeRNN) by introducing regularization during training to shape the distribution of the converted data in latent space. We compare the proposed approach with global variance (GV), modulation spectrum (MS) and recurrent neural network (RNN) based postfilters, which serve a similar purpose. The subjective test results show that the proposed approach significantly outperforms these conventional approaches in terms of quality and similarity.
引用
收藏
页码:3379 / 3383
页数:5
相关论文
共 50 条
  • [1] Modulation Recognition Based on Denoising Bidirectional Recurrent Neural Network
    Ruiyan Du
    Fulai Liu
    Lijie Zhang
    Yahui Ji
    Jialiang Xu
    Fan Gao
    [J]. Wireless Personal Communications, 2023, 132 : 2437 - 2455
  • [2] Modulation Recognition Based on Denoising Bidirectional Recurrent Neural Network
    Du, Ruiyan
    Liu, Fulai
    Zhang, Lijie
    Ji, Yahui
    Xu, Jialiang
    Gao, Fan
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (04) : 2437 - 2455
  • [3] Deep Convolutional Bidirectional LSTM Recurrent Neural Network for Epileptic Seizure Detection
    Abdelhameed, Ahmed M.
    Daoud, Hisham G.
    Bayoumi, Magdy
    [J]. 2018 16TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2018, : 139 - 143
  • [4] Bidirectional LSTM Recurrent Neural Network for Keyphrase Extraction
    Basaldella, Marco
    Antolli, Elisa
    Serra, Giuseppe
    Tasso, Carlo
    [J]. DIGITAL LIBRARIES AND MULTIMEDIA ARCHIVES, IRCDL 2018, 2018, 806 : 180 - 187
  • [5] Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network
    Kantipudi, M. V. V. Prasad
    Kumar, Sandeep
    Jha, Ashish Kumar
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [6] Deep Bidirectional LSTM Modeling of Timbre and Prosody for Emotional Voice Conversion
    Ming, Huaiping
    Huang, Dongyan
    Xie, Lei
    Wu, Jie
    Dong, Minghui
    Li, Haizhou
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2453 - 2457
  • [7] VOICE CONVERSION USING DEEP BIDIRECTIONAL LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS
    Sun, Lifa
    Kang, Shiyin
    Li, Kun
    Meng, Helen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4869 - 4873
  • [8] Korean Singing Voice Synthesis System based on an LSTM Recurrent Neural Network
    Kim, Juntae
    Choi, Heejin
    Park, Jinuk
    Hahn, Minsoo
    Kim, Sangjin
    Kim, Jong-Jin
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1551 - 1555
  • [9] Bidirectional LSTM Malicious webpages detection algorithm based on convolutional neural network and independent recurrent neural network
    Huan-huan Wang
    Long Yu
    Sheng-wei Tian
    Yong-fang Peng
    Xin-jun Pei
    [J]. Applied Intelligence, 2019, 49 : 3016 - 3026
  • [10] Bidirectional LSTM Malicious webpages detection algorithm based on convolutional neural network and independent recurrent neural network
    Wang, Huan-huan
    Yu, Long
    Tian, Sheng-wei
    Peng, Yong-fang
    Pei, Xin-jun
    [J]. APPLIED INTELLIGENCE, 2019, 49 (08) : 3016 - 3026