Vocal Tract Spectrum Transformation Based on Clustering in Voice Conversion System

被引:0
|
作者
Xie Weichao [1 ]
Zhang Linghua [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Jiangsu, Peoples R China
关键词
Voice Conversion; Spectrum Transformation; Cluster; K-Means algorithm; Gaussian Mixture Model (GMM);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By the conventional vocal tract spectrum transformation based on Gaussian Mixture Model (GMM), the transformation rule is not very accurate because of the large amount of voice data which is time-varying and non-stationary. This paper mainly studies a method of spectrum transformation based on clustering algorithm. First of all, the training data are classified into several clusters and each cluster is trained relatively to get a more accurate transformation rule. And in the stage of transformation, the source parameters of each frame are classified into one cluster, and then are converted by the transformation rule of that cluster. In this paper, K-means algorithm is used as the clustering method to classified data. Experiment results show that proposed method based on clustering is better than the transformation by conventional GMM, especially the one by K-Means algorithm with 20 centers is the best one.
引用
收藏
页码:236 / 240
页数:5
相关论文
共 50 条
  • [21] Dictionary optimization and clustering for exemplar-based voice conversion
    Sun, Wei
    Yu, Yibiao
    [J]. FIFTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2020, 11526
  • [22] Transformation of Prosody in Voice Conversion
    Sisman, Berrak
    Li, Haizhou
    Tan, Kay Chen
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1588 - 1597
  • [23] On Using Warping Function for LSFs Transformation in a Voice Conversion System
    Hanzlicek, Zdenek
    Matousek, Jindrich
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 2722 - 2725
  • [24] Study on Manipulation Method of Voice Quality Based on the Vocal Tract Area Function
    Uchimura, Yoshinori
    Banno, Hideki
    Itakura, Fumitada
    Kawahara, Hideki
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1084 - 1087
  • [25] VOICE CONVERSION FOR ARBITRARY SPEAKERS USING ARTICULATORY-MOVEMENT TO VOCAL-TRACT PARAMETER MAPPING
    Ariwardhani, Narpendyah W.
    Iribe, Yurie
    Katsurada, Kouichi
    Nitta, Tsuneo
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [26] Estimation method of glottal vocal efficiency based on conversion function of voice source
    ZOU Yuan WAN Mingxi ZHAO Shouguo WANG Supin(1 Department of Biomedical Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 332 - 342
  • [27] Conversion function clustering and selection for expressive voice conversion
    Hsia, Chi-Chun
    Wu, Chung-Hsien
    Wu, Jian-Qi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 689 - +
  • [28] Prevalence of vocal tract discomfort in patients with voice disorders
    Nawaz, Saffa
    Shahid, Nameeka
    Mumtaz, Nazia
    Saqulain, Ghulam
    [J]. JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2022, 72 (08) : 1547 - 1552
  • [29] Enhancing Voice Quality in Vocal Tract Rehabilitation Device
    Sutcliffe, Bianca
    Wiggins, Lindzi
    Rubin, David M.
    Aharonson, Vered
    [J]. ADVANCES IN USABILITY, USER EXPERIENCE AND ASSISTIVE TECHNOLOGY, 2019, 794 : 1006 - 1013
  • [30] Acoustic interactions of the voice source with the lower vocal tract
    Titze, IR
    Story, BH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (04): : 2234 - 2243