An Improved ANN Method Based on Clustering Optimization for Voice Conversion

被引:0
|
作者
Chen Xiantong [1 ]
Zhang Linghua [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; STRAIGHT; RBF; K-means; PSO; TRANSFORMATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Artificial neural network is a commonly used conversion model in voice conversion system, in which RBF is known for its concise convergence and fast learning. Based on optimizing the centers of RBF network, this article presents a method of using K-means algorithm to cluster and form centers and PSO algorithm to optimize the clustering number to improve the property of RBF, thus to enhance the transformation of speech parameters. Firstly, STRAIGHT model is used to extract linear prediction coefficients and pitch frequencies. Then the parameters are sent to RBF network, K-means and PSO algorithms are used to optimize the centers of RBF network until the fitness value is lowest. Experiment shows that, this method not only eliminates the trouble of finding the best clustering number one-by-one, but also effectively improves the performance of neural network, and the converted speeches are closer to the target one.
引用
收藏
页码:464 / 469
页数:6
相关论文
共 50 条
  • [1] A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION
    Chen, Z.
    Zhang, L. H.
    [J]. 2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [2] Dictionary optimization and clustering for exemplar-based voice conversion
    Sun, Wei
    Yu, Yibiao
    [J]. FIFTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2020, 11526
  • [3] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [4] An improved spectral and prosodic transformation method in straight-based voice conversion
    Qin, L
    Chen, GP
    Ling, ZH
    Dai, LR
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 21 - 24
  • [5] An Improved Image Classification Method Based on Clustering Improvement and Codebook Optimization
    Wang Kegang
    Geng Guohua
    Qi Liying
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1136 - 1140
  • [6] Comparing ANN and GMM in a voice conversion framework
    Laskar, R. H.
    Chakrabarty, D.
    Talukdar, F. A.
    Rao, K. Sreenivasa
    Banerjee, K.
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (11) : 3332 - 3342
  • [7] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [8] Novel Method for Data Clustering and Mode Selection with Application in Voice Conversion
    Nurminen, Jani
    Tian, Jilei
    Popa, Victor
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2258 - 2261
  • [9] Conversion function clustering and selection for expressive voice conversion
    Hsia, Chi-Chun
    Wu, Chung-Hsien
    Wu, Jian-Qi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 689 - +
  • [10] Vocal Tract Spectrum Transformation Based on Clustering in Voice Conversion System
    Xie Weichao
    Zhang Linghua
    [J]. PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2012, : 236 - 240