High Quality Voice Conversion based on ISODATA Clustering Algorithm

被引:0
|
作者
Li, Yanping [1 ]
Zuo, Yutao [1 ]
Yang, Zhen [1 ]
Shao, Xi [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; ISODATA; similarity; quality; bilinear frequency; Gaussian mixture model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two main challenges introduced in current voice conversion are the dependence on parallel training data and the trade-off between speaker similarity and speech quality. To tackle the latter problem, this paper proposes a novel conversion method based on Iterative Self-organizing DATA Analysis Techniques Algorithm (ISODATA) clustering algorithm. Specially, we use ISODATA during the training of Gaussian mixture model, the optimized mixture number can guarantee the validity and accuracy of the GMM model, which can acquire speaker's identity effectively related to speaker similarity between original target speech and converted speech, Next, we combine improved GMM and bilinear frequency warping for the conversion stage, which can get a good balance between speaker similarity and speech quality. Theory analysis and experimental results demonstrate that the proposed algorithm can achieve higher quality and similarity compared with other two methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Palmprint recognition based on isodata clustering algorithm
    Liu, Fu
    Lin, Cai-Xia
    Cui, Ping-Yuan
    Dong, Tian
    [J]. 2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1129 - +
  • [2] Clustering analysis for fMRI dataset based on ISODATA algorithm
    Zheng, X
    Cao, ZT
    Shao, B
    Fang, JZ
    He, GG
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1373 - 1377
  • [3] A fast implementation of the ISODATA clustering algorithm
    Memarsadeghi, Nargess
    Mount, David M.
    Netanyahu, Nathan S.
    Le Moigne, Jacqueline
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2007, 17 (01) : 71 - 103
  • [4] A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION
    Chen, Z.
    Zhang, L. H.
    [J]. 2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [5] Improved ISODATA Clustering Method with Parameter Estimation based on Genetic Algorithm
    Arai, Kohei
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 187 - 193
  • [6] Power Consumption Portrait of Users Based on Improved ISODATA Clustering Algorithm
    Yang, HuiXuan
    Su, Ming
    Li, Xin
    Liu, JinHui
    Zhang, RuiZhao
    [J]. 2022 9TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2022, : 1060 - 1064
  • [7] THRESHOLDING USING THE ISODATA CLUSTERING-ALGORITHM
    DIASVELASCO, FR
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1980, 10 (11): : 771 - 774
  • [8] An adaptive isodata fuzzy clustering algorithm with partial supervision
    Macario, Valmir
    de Carvalho, Francisco de A. T.
    [J]. PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1978 - 1983
  • [9] ON THE CONVERGENCE OF THE FUZZY CLUSTERING-ALGORITHM FUZZY ISODATA
    VONTRZEBIATOWSKI, G
    BANK, B
    [J]. ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1986, 66 (06): : 201 - 208
  • [10] ON THE LOCAL OPTIMALITY OF THE FUZZY ISODATA CLUSTERING-ALGORITHM
    SELIM, SZ
    ISMAIL, MA
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (02) : 284 - 288