Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models

被引:0
|
作者
Bao, Jingyi [1 ]
Xu, Ning [2 ,3 ]
机构
[1] Changzhou Inst Technol, Sch Elect Informat & Elect Engn, Liaohe Rd 666, Changzhou City 213032, Peoples R China
[2] Hohai Univ, Coll IoT Engn, Dept Commun Engn, North Jinling Rd 200, Changzhou City 213022, Peoples R China
[3] Hohai Univ, Coll IoT Engn, Changzhou Key Lab Robot & Intelligent Technol, North Jinling Rd 200, Changzhou City 213022, Peoples R China
来源
MODERN PHYSICS LETTERS B | 2018年 / 32卷 / 34-36期
关键词
Voice conversion; Gaussian process; flexible kernel; Gaussian mixture model;
D O I
10.1142/S0217984918400961
中图分类号
O59 [应用物理学];
学科分类号
摘要
Voice conversion (VC) is a technique that aims to transform the individuality of a source speech so as to mimic that of a target speech while keeping the message unaltered. In our previous work, Gaussian process (GP) was introduced into the literature of VC for the first time, for the sake of overcoming the "over-fitting" problem inherent in the state-of-the-art VC methods, which gives very promising results. However, standard GP usually acts as somewhat a smoothing device more than a universal approximator. In this paper, we further attempt to improve the flexibility of GP-based VC by resorting to the expressive kernels that are derived to model the spectral density with Gaussian mixture model (GMM). Our new method benefits from the expressiveness of the new kernel while the inference of GP remains simple and analytic as usual. Experiments demonstrate both objectively and subjectively that the individualities of the converted speech are much more closer to those of the target while speech quality obtained is comparable to the standard GP-based method.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Eigenvoice Conversion Based on Gaussian Mixture Model
    Toda, Tomoki
    Ohtani, Yamato
    Shikano, Kiyohiro
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2446 - 2449
  • [42] Stratified multi-density spectral clustering using Gaussian mixture model
    Yue, Guanli
    Deng, Ansheng
    Qu, Yanpeng
    Cui, Hui
    Wang, Xueying
    [J]. INFORMATION SCIENCES, 2023, 633 : 182 - 203
  • [43] Stratified Multi-Density Spectral Clustering Using Gaussian Mixture Model
    Yue, Guanli
    Deng, Ansheng
    Qu, Yanpeng
    Cui, Hui
    Wang, Xueying
    [J]. SSRN, 2023,
  • [44] Variance Matrix Priors for Dirichlet Process Mixture Models With Gaussian Kernels
    Jing, Wei
    Papathomas, Michail
    Liverani, Silvia
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2024,
  • [45] Gaussian Processes Spectral Kernels Recover Brain Metastable Oscillatory Modes
    Prieur-Coloma, Yunier
    Torres, Felipe
    Guevara, Pamela
    Contreras-Reyes, Javier E.
    El-Deredy, Wael
    [J]. 2023 19TH INTERNATIONAL SYMPOSIUM ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, SIPAIM, 2023,
  • [46] Nucleus Segmentation Using Gaussian Mixture based Shape Models
    Lee, Hyun-Gyu
    Lee, Sang-Chul
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (01) : 235 - 243
  • [47] Real Life Emotion Classification using Spectral Features and Gaussian Mixture Models
    Koolagudi, Shashidhar G.
    Barthwal, Anurag
    Devliyal, Swati
    Rao, K. Sreenivasa
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3892 - 3899
  • [48] Ensemble Gaussian mixture models for probability density estimation
    Glodek, Michael
    Schels, Martin
    Schwenker, Friedhelm
    [J]. COMPUTATIONAL STATISTICS, 2013, 28 (01) : 127 - 138
  • [49] Edgeworth-expanded Gaussian mixture density modeling
    Van Hulle, MM
    [J]. NEURAL COMPUTATION, 2005, 17 (08) : 1706 - 1714
  • [50] Ensemble Gaussian mixture models for probability density estimation
    Michael Glodek
    Martin Schels
    Friedhelm Schwenker
    [J]. Computational Statistics, 2013, 28 : 127 - 138