An Improved ANN Method Based on Clustering Optimization for Voice Conversion

被引:0
|
作者
Chen Xiantong [1 ]
Zhang Linghua [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; STRAIGHT; RBF; K-means; PSO; TRANSFORMATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Artificial neural network is a commonly used conversion model in voice conversion system, in which RBF is known for its concise convergence and fast learning. Based on optimizing the centers of RBF network, this article presents a method of using K-means algorithm to cluster and form centers and PSO algorithm to optimize the clustering number to improve the property of RBF, thus to enhance the transformation of speech parameters. Firstly, STRAIGHT model is used to extract linear prediction coefficients and pitch frequencies. Then the parameters are sent to RBF network, K-means and PSO algorithms are used to optimize the centers of RBF network until the fitness value is lowest. Experiment shows that, this method not only eliminates the trouble of finding the best clustering number one-by-one, but also effectively improves the performance of neural network, and the converted speeches are closer to the target one.
引用
收藏
页码:464 / 469
页数:6
相关论文
共 50 条
  • [21] Improved clustering method based on artificial immune
    Zhu, Lin
    Li, Bo
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (06) : 1111 - 1115
  • [22] An Improved Clustering Method Based on Data Field
    Liu, Yuhua
    Xu, Cui
    Xu, Ke
    Jin, Jianzhi
    [J]. FRONTIERS OF MECHANICAL ENGINEERING AND MATERIALS ENGINEERING II, PTS 1 AND 2, 2014, 457-458 : 919 - 925
  • [23] An improved clustering based on edge detection method
    Ju Wen
    Liu JiaoLong
    Jin SongZhi
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4026 - 4030
  • [24] Improved spectral clustering based on Nystrom method
    Zhan, Qiang
    Mao, Yu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20149 - 20165
  • [25] Improved Invasive Weed Optimization Based on Clustering Strategy
    Ren, Zhigang
    Huang, Shanshan
    Sun, Chenlin
    Liang, Yongsheng
    [J]. PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 4810 - 4815
  • [26] Emotional speech synthesis based on improved codebook mapping voice conversion
    Wang, YP
    Ling, ZH
    Wang, RH
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 374 - 381
  • [27] Traction Load Classification Method Based on Improved Clustering Method
    Zhang L.
    Chen Y.
    Han Z.
    [J]. 1600, Science Press (55): : 27 - 33and40
  • [28] A noise robust voice conversion algorithm based on joint dictionary optimization
    Zhang, Shilei
    Jian, Zhihua
    Sun, Minhong
    Zhong, Hua
    Liu, Erxiao
    [J]. Shengxue Xuebao/Acta Acustica, 2019, 44 (06): : 1074 - 1082
  • [29] Noise-robust voice conversion based on joint dictionary optimization
    ZHANG Shilei
    JIAN Zhihua
    SUN Minhong
    ZHONG Hua
    LIU Erxiao
    [J]. Chinese Journal of Acoustics, 2020, 39 (02) : 259 - 272
  • [30] Frame Correlation Based Autoregressive GMM Method for Voice Conversion
    Li, Xian
    Wang, Zeng-fu
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 221 - 225