High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong [1 ]
ZHANG Ling-hua [1 ]
机构
[1] College of Telecommunications and Information Engineering,Nanjing University of Posts and Telecommunications
关键词
VC system; STRAIGHT; vocal tract spectrum; GMM; RBF;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75 +93
页数:9
相关论文
共 50 条
  • [1] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    [J]. TheJournalofChinaUniversitiesofPostsandTelecommunications., 2014, 21 (05) - 75+93
  • [2] Cepstrum Liftering based Voice Conversion using RBF and GMM
    Nirmal, Jagannath
    Kachare, Pramod
    Patnaik, Suprava
    Zaveri, Mukesh
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 570 - 575
  • [3] On combining statistical methods and frequency warping for high-quality voice conversion
    Erro, Daniel
    Polyakova, Tatyana
    Moreno, Asuncion
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4665 - 4668
  • [4] HIGH-QUALITY NONPARALLEL VOICE CONVERSION BASED ON CYCLE-CONSISTENT ADVERSARIAL NETWORK
    Fang, Fuming
    Yamagishi, Junichi
    Echizen, Isao
    Lorenzo-Trueba, Jaime
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5279 - 5283
  • [5] FAST AND HIGH-QUALITY SINGING VOICE SYNTHESIS SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORKS
    Nakamura, Kazuhiro
    Takaki, Shinji
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7239 - 7243
  • [6] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [7] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder
    Chen, Kuan
    Chen, Bo
    Lai, Jiahao
    Yu, Kai
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1993 - 1997
  • [8] A Revisit to Feature Handling for High-quality Voice Conversion Based on Gaussian Mixture Model
    Suda, Hitoshi
    Kotani, Gaku
    Takamichi, Shinnosuke
    Saito, Daisuke
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 816 - 822
  • [9] Voice Conversion System Based on Deep Neural Network Capable of Parallel Computation
    Sato, Kunihiko
    Rekimoto, Jun
    [J]. 25TH 2018 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2018, : 677 - 678
  • [10] Study on Color Space Conversion Based on RBF Neural Network
    Cao Congjun
    Liu Qiangjun
    [J]. PRINTING AND PACKAGING STUDY, 2011, 174 : 28 - 31