Voice Conversion Based on Hybrid SVR and GMM

被引:1
|
作者
Song, Peng [1 ]
Jin, Yun [2 ,3 ]
Zhao, Li [1 ]
Zou, Cairong [1 ]
机构
[1] Southeast Univ, Minist Educ, Key Lab Underwater Acoust Signal Proc, Nanjing 210096, Jiangsu, Peoples R China
[2] Xuzhou Normal Univ, Sch Phys & Elect Engn, Xuzhou 221116, Peoples R China
[3] Southeast Univ, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Jiangsu, Peoples R China
关键词
voice conversion; support vector regression; Gaussian mixture model; F0; prediction; speaker-specific information;
D O I
10.2478/v10168-012-0020-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel VC (voice conversion) method based on hybrid SVR (support vector regression) and GMM (Gaussian mixture model) is presented in the paper, the mapping abilities of SVR and GMM are exploited to map the spectral features of the source speaker to those of target ones. A new strategy of F0 transformation is also presented, the F0s are modeled with spectral features in a joint GMM and predicted from the converted spectral features using the SVR method. Subjective and objective tests are carried out to evaluate the VC performance; experimental results show that the converted speech using the proposed method can obtain a better quality than that using the state-of-the-art GMM method. Meanwhile, a VC method based on non-parallel data is also proposed, the speaker-specific information is investigated using the SVR method and preliminary subjective experiments demonstrate that the proposed method is feasible when a parallel corpus is not available.
引用
收藏
页码:143 / 149
页数:7
相关论文
共 50 条
  • [1] Modified method for voice conversion based on GMM
    Shen, Yi
    Jian, Zhi-Hua
    Yang, Zhen
    [J]. Nanjing Youdian Daxue Xuebao (Ziran Kexue Ban)/Journal of Nanjing University of Posts and Telecommunications (Natural Science), 2007, 27 (05): : 11 - 15
  • [2] Improving the Performance of GMM Based Voice Conversion Method
    Song, Peng
    Zhao, Li
    [J]. PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 436 - 440
  • [3] A GMM based residual prediction method for voice conversion
    Xia, J
    Yin, JX
    [J]. ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 389 - 392
  • [4] Voice Conversion Based on STRAIGHT and UBM-GMM
    Gao Yingying
    Zhu Weibin
    [J]. PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 342 - 345
  • [5] Voice Conversion Based on Improved GMM and Spectrum with Synchronous Prosody
    Zhang Bing
    Yu Yibiao
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 659 - 662
  • [6] Cepstrum Liftering based Voice Conversion using RBF and GMM
    Nirmal, Jagannath
    Kachare, Pramod
    Patnaik, Suprava
    Zaveri, Mukesh
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 570 - 575
  • [7] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [8] Frame Correlation Based Autoregressive GMM Method for Voice Conversion
    Li, Xian
    Wang, Zeng-fu
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 221 - 225
  • [9] Maximum Likelihood Voice Conversion Based on GMM with STRAIGHT Mixed Excitation
    Ohtani, Yamato
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2266 - 2269
  • [10] Voice Conversion for TTS Systems with Tuning on the Target Speaker Based on GMM
    Zahariev, Vadim
    Azarov, Elias
    Petrovsky, Alexander
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 788 - 798