High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong [1 ]
ZHANG Ling-hua [1 ]
机构
[1] College of Telecommunications and Information Engineering,Nanjing University of Posts and Telecommunications
关键词
VC system; STRAIGHT; vocal tract spectrum; GMM; RBF;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75 +93
页数:9
相关论文
共 50 条
  • [21] Automatic tuning of fuzzy controller parameters based on RBF neural network
    Juan, Wei
    Ping, Wang
    [J]. 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 191 - 194
  • [22] Mechanical Property Parameters Prediction of Tube Based on RBF Neural Network
    Jia Meihui
    Tang Chengtong
    Liu Jianhua
    Zhang Tian
    [J]. MECHATRONICS AND APPLIED MECHANICS II, PTS 1 AND 2, 2013, 300-301 : 882 - 888
  • [23] A novel voice morphing system using Bi-GMM for high quality transformation
    Xu, Ning
    Shao, Xi
    Yang, Zhen
    [J]. PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 485 - 489
  • [24] Research of Machine Vision System Based on RBF Neural Network
    Ge Dongyuan
    Yao Xifan
    Chen Weixiong
    Zhang Qing
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 218 - 222
  • [25] The identification of dynamic system based on memory RBF neural network
    Qiang, L
    Li, JX
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1080 - 1083
  • [26] Based on RBF neural network modeling of well test system
    Tian, Jia
    [J]. Energy Engineering and Environment Engineering, 2014, 535 : 606 - 609
  • [27] The Scheduling of Flexible Manufacturing System Based on RBF Neural Network
    Yu, Lianqing
    Zhang, Zhiming
    Mei, Shunqi
    [J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 678 - 681
  • [28] An ADRC Parameters Self-Tuning Control Strategy of Tension System Based on RBF Neural Network
    Liu, Shanhui
    Ding, Haodi
    Wang, Ziyu
    Li, Zheng
    Ma, Li'e
    [J]. JOURNAL OF RENEWABLE MATERIALS, 2023, 11 (04) : 1991 - 2014
  • [29] Continuous vocoder applied in deep neural network based voice conversion
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    Nemeth, Geza
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33549 - 33572
  • [30] A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 552 - 558