High-quality voice conversion system based on GMM statistical parameters and RBF neural network

被引:0
|
作者
CHEN Xian-tong [1 ]
ZHANG Ling-hua [1 ]
机构
[1] College of Telecommunications and Information Engineering,Nanjing University of Posts and Telecommunications
关键词
VC system; STRAIGHT; vocal tract spectrum; GMM; RBF;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
A voice conversion(VC) system was designed based on Gaussian mixture model(GMM) and radial basis function(RBF) neural network. As a voice conversion model, RBF network needs quantities of training data to improve its performance. For one speech, the networks trained by different segments of data have different transformation effects. Since trying segment by segment to obtain the best conversion effect is complex, a conversion method was proposed, that uses GMM for statistics before training RBF network to aim at the problem. The speech transformation and representation using adaptive interpolation of weighted spectrum(STRAIGHT) model is used for accurate extraction of vocal tract spectrum. Then GMM is used to classify the numerous spectral parameters. The obtained mean parameters were trained in RBF network. Experiment reveals that, the soft classification ability of GMM can promptly realize the reduction and classification of training data under the premise of ensuring the training effect. The selection complexity is decreased thereafter. Compared to the conventional RBF network training methods, this method can make the transformation of spectral parameters more effective and improve the quality of converted speech.
引用
收藏
页码:68 / 75 +93
页数:9
相关论文
共 50 条
  • [31] Risk assessment of power system network security based on RBF neural network
    Yu, Yunhao
    Di, Chameiling
    Guo, Xiang
    [J]. International Journal of Power and Energy Conversion, 2023, 14 (2-3) : 148 - 158
  • [32] Neural network-based voice quality measurement technique
    Tarraf, A
    Meyers, M
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 1999, : 375 - 381
  • [33] Mandarin-Tibetan Cross-Lingual Voice Conversion System Based on Deep Neural Network
    Gan, Zhenye
    Xing, Xiaotian
    Yang, Hongwu
    Zhao, Guangying
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 67 - 71
  • [34] A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models
    Takamichi, Shinnosuke
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2490 - 2498
  • [35] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [36] The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion
    Chen, Ling-Hui
    Liu, Li-Juan
    Ling, Zhen-Hua
    Jiang, Yuan
    Dai, Li-Rong
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1642 - 1646
  • [37] Prediction of the high-quality development level of inbound tourism based on adaptive neural network technology
    Zhang, Hongxi
    Wei, Wei
    Liu, Qiong
    [J]. JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 112 - 125
  • [38] High-quality direct ghost imaging of random dynamic targets based on convolutional neural network
    Liu, Qing
    Yin, LongFei
    Zhan, HaoDi
    Lu, YiQi
    Zhu, LingYun
    Long, XueWen
    Wu, GuoHua
    [J]. OPTICS AND LASER TECHNOLOGY, 2024, 179
  • [39] A Novel RBF Neural Network Design Based On Immune Algorithm System
    Li, Fei
    Yang, Cuili
    Qiao, Junfei
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 4598 - 4603
  • [40] Power System's Damping Analysis Based on RBF Neural Network
    Bu Jing
    Jiang Ning-qiang
    [J]. INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT B, 2012, 24 : 1018 - 1023