Vocal Tract Spectrum Transformation Based on Clustering in Voice Conversion System

被引:0
|
作者
Xie Weichao [1 ]
Zhang Linghua [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Jiangsu, Peoples R China
关键词
Voice Conversion; Spectrum Transformation; Cluster; K-Means algorithm; Gaussian Mixture Model (GMM);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By the conventional vocal tract spectrum transformation based on Gaussian Mixture Model (GMM), the transformation rule is not very accurate because of the large amount of voice data which is time-varying and non-stationary. This paper mainly studies a method of spectrum transformation based on clustering algorithm. First of all, the training data are classified into several clusters and each cluster is trained relatively to get a more accurate transformation rule. And in the stage of transformation, the source parameters of each frame are classified into one cluster, and then are converted by the transformation rule of that cluster. In this paper, K-means algorithm is used as the clustering method to classified data. Experiment results show that proposed method based on clustering is better than the transformation by conventional GMM, especially the one by K-Means algorithm with 20 centers is the best one.
引用
收藏
页码:236 / 240
页数:5
相关论文
共 50 条
  • [1] Voice conversion by prosody and vocal tract modification
    Rao, K. Sreenivasa
    Yegnanarayana, B.
    [J]. ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2006, : 111 - +
  • [2] On the transformation of the speech spectrum for voice conversion
    Baudoin, G
    Stylianou, Y
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1405 - 1408
  • [3] Voice Conversion System using SVM for Vocal Tract Modification and Codebook based Model for Pitch Contour Modification
    Laskar, R. H.
    Talukdar, F. A.
    Bhattacharjee, Rajib
    Das, Saugat
    [J]. 2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 2205 - 2210
  • [4] A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function
    Story, BH
    Titze, IR
    [J]. JOURNAL OF PHONETICS, 2002, 30 (03) : 485 - 509
  • [5] Subdividing of Pathological Voice Based on Vocal Tract Area
    Shao, Yating
    Xiao, Zhongzhe
    Zhang, Xiaojun
    Wu, Di
    Tao, Zhi
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1126 - 1130
  • [6] Mapping Articulatory-Features to Vocal-Tract Parameters for Voice Conversion
    Ariwardhani, Narpendyah Wisjnu
    Kimura, Masashi
    Iribe, Yurie
    Katsurada, Kouichi
    Nitta, Tsuneo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 911 - 918
  • [7] Iterative MMSE Estimation of Vocal Tract Length Normalization Factors for Voice Transformation
    Erro, Daniel
    Navas, Eva
    Hernaez, Inma
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 86 - 89
  • [8] Voice Pathology Detection Based on the Vocal Fold Signal and the Vocal Tract Signal Separation
    Zhang, Xuehui
    Hu, Weiping
    Zeng, Ying
    [J]. 2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 845 - 849
  • [9] Language Identification Based on Vocal Tract Spectrum Parameters
    Shao, Yu-Bin
    Liu, Jing
    Long, Hua
    Du, Qing-Zhi
    Li, Yi-Min
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2021, 44 (03): : 112 - 119
  • [10] Vocal Tract Imaging System for Post-Laryngectomy Voice Replacement
    Cai, Jun
    Hueber, Thomas
    Manitsaris, Sotiris
    Roussel, Pierre
    Crevier-Buchman, Lise
    Stone, Maureen
    Pillot-Loiseau, Claire
    Chollet, Gerard
    Dreyfus, Gerard
    Denby, Bruce
    [J]. 2013 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2013, : 676 - 680