Voice Conversion Based on State Space Model and Considering Global Variance

被引:0
|
作者
Ahangar, Mohsen [1 ]
Ghorbandoost, Mostafa [1 ]
Sheikhzadeh, Hamid [1 ]
Raahemifar, Kaamran [2 ]
Shahrebabaki, Abdoreza Sabzi [1 ]
Amini, Jamal [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
State space model; global variance; voice conversion;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Voice conversion based on State Space Model (SSM) has been recently proposed to address the discontinuity problem in the traditional frame-based voice conversion by considering the spectral envelope evolutions. However, the results are over-smoothed. To resolve this problem, in this paper we propose a new procedure for integrating the global variance constraint into the SSM-based voice conversion. Moreover, unlike the SSM-based method, we allow the state-vector order to be higher than the feature-vector order. Experimental results verify that the proposed method significantly improves the performance of the SSM-based voice conversion in terms of speaker individuality and speech quality. Our experiments also show that the proposed method outperforms the well-known Maximum Likelihood estimation method that considers the Global Variance in terms of speech quality.
引用
收藏
页码:416 / 421
页数:6
相关论文
共 50 条
  • [1] Voice conversion based on trajectory model training of neural networks considering global variance
    Hosaka, Naoki
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 307 - 311
  • [2] Analysis of State-Space Model based Voice Conversion
    Sun, Jian
    Zhang, Xiongwei
    PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (10): : 373 - 376
  • [3] Voice conversion based on state-space model for modelling spectral trajectory
    Xu, N.
    Yang, Z.
    Zhang, L. H.
    Zhu, W. P.
    Bao, J. Y.
    ELECTRONICS LETTERS, 2009, 45 (14) : 763 - U73
  • [4] MODULAR GLOBAL VARIANCE ENHANCEMENT FOR VOICE CONVERSION SYSTEMS
    Benisty, H.
    Malah, D.
    Crammer, K.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 370 - 374
  • [5] Voice Conversion using GMAT with Enhanced Global Variance
    Benisty, Hadas
    Malah, David
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 676 - 679
  • [6] Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2754 - 2758
  • [7] Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [8] Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
    Toda, T
    Black, AW
    Tokuda, K
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 9 - 12
  • [9] Voice conversion towards modeling dynamic characteristics using switching state space model
    Ning Xu
    JingYi Bao
    XiaoFeng Liu
    AiMing Jiang
    YiBing Tang
    Science China Information Sciences, 2013, 56 : 1 - 15
  • [10] Voice Conversion with a Strategy for Separating Speaker Individuality Using State-Space Model
    Xu, Ning
    Yang, Zhen
    Guo, Haiyan
    2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 1, 2010, : 298 - 301