Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models

被引:0
|
作者
Bao, Jingyi [1 ]
Xu, Ning [2 ,3 ]
机构
[1] Changzhou Inst Technol, Sch Elect Informat & Elect Engn, Liaohe Rd 666, Changzhou City 213032, Peoples R China
[2] Hohai Univ, Coll IoT Engn, Dept Commun Engn, North Jinling Rd 200, Changzhou City 213022, Peoples R China
[3] Hohai Univ, Coll IoT Engn, Changzhou Key Lab Robot & Intelligent Technol, North Jinling Rd 200, Changzhou City 213022, Peoples R China
来源
MODERN PHYSICS LETTERS B | 2018年 / 32卷 / 34-36期
关键词
Voice conversion; Gaussian process; flexible kernel; Gaussian mixture model;
D O I
10.1142/S0217984918400961
中图分类号
O59 [应用物理学];
学科分类号
摘要
Voice conversion (VC) is a technique that aims to transform the individuality of a source speech so as to mimic that of a target speech while keeping the message unaltered. In our previous work, Gaussian process (GP) was introduced into the literature of VC for the first time, for the sake of overcoming the "over-fitting" problem inherent in the state-of-the-art VC methods, which gives very promising results. However, standard GP usually acts as somewhat a smoothing device more than a universal approximator. In this paper, we further attempt to improve the flexibility of GP-based VC by resorting to the expressive kernels that are derived to model the spectral density with Gaussian mixture model (GMM). Our new method benefits from the expressiveness of the new kernel while the inference of GP remains simple and analytic as usual. Experiments demonstrate both objectively and subjectively that the individualities of the converted speech are much more closer to those of the target while speech quality obtained is comparable to the standard GP-based method.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Voice Conversion Using Gaussian Mixture Models
    D'souza, Kevin
    Talele, K. T. V.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,
  • [2] STORYTELLING VOICE CONVERSION: EVALUATION EXPERIMENT USING GAUSSIAN MIXTURE MODELS
    Pribil, Jiri
    Pribilova, Anna
    Durackova, Daniela
    [J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2015, 66 (04): : 194 - 202
  • [3] Spectral Mixture Kernels for Multi-Output Gaussian Processes
    Parra, Gabriel
    Tobar, Felipe
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [4] Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model
    Nguyen, Binh Phu
    Akagi, Masato
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 222 - 227
  • [5] Voice Conversion Based on Gaussian Mixture Modules with Minimum Distance Spectral Mapping
    Jin, Gui
    Johnson, Michael T.
    Liu, Jia
    Lin, Xiaokang
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 356 - 359
  • [6] Voice conversion using Viterbi algorithm based on Gaussian mixture model
    Jian Zhi-Hua
    Yang Zhen
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 40 - 43
  • [7] Voice Conversion Using Structrued Gaussian Mixture Model
    Zeng, Daojian
    Yu, Yibiao
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 541 - 544
  • [8] Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
    Doi, Hironori
    Nakamura, Keigo
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2472 - 2482
  • [9] Compressible spectral mixture kernels with sparse dependency structures for Gaussian processes
    Chen, Kai
    Yin, Feng
    Cui, Shuguang
    [J]. SIGNAL PROCESSING, 2023, 213
  • [10] Voice conversion using canonical correlation analysis based on Gaussian mixture model
    Jian, ZhiHua
    Yang, Zhen
    [J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 1, PROCEEDINGS, 2007, : 210 - +