Voice Conversion Based on State Space Model and Considering Global Variance

被引:0
|
作者
Ahangar, Mohsen [1 ]
Ghorbandoost, Mostafa [1 ]
Sheikhzadeh, Hamid [1 ]
Raahemifar, Kaamran [2 ]
Shahrebabaki, Abdoreza Sabzi [1 ]
Amini, Jamal [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
State space model; global variance; voice conversion;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Voice conversion based on State Space Model (SSM) has been recently proposed to address the discontinuity problem in the traditional frame-based voice conversion by considering the spectral envelope evolutions. However, the results are over-smoothed. To resolve this problem, in this paper we propose a new procedure for integrating the global variance constraint into the SSM-based voice conversion. Moreover, unlike the SSM-based method, we allow the state-vector order to be higher than the feature-vector order. Experimental results verify that the proposed method significantly improves the performance of the SSM-based voice conversion in terms of speaker individuality and speech quality. Our experiments also show that the proposed method outperforms the well-known Maximum Likelihood estimation method that considers the Global Variance in terms of speech quality.
引用
收藏
页码:416 / 421
页数:6
相关论文
共 50 条
  • [41] A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures
    Li, Lei
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 668 - 671
  • [42] A system for voice conversion based on probabilistic classification and a harmonic plus noise model
    Stylianou, Y
    Cappe, O
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 281 - 284
  • [43] On state space model based predictive control
    Di Ruscio, D
    Foss, B
    DYNAMICS & CONTROL OF PROCESS SYSTEMS 1998, VOLUMES 1 AND 2, 1999, : 301 - 306
  • [44] Speaker identification based on state space model
    Xu L.
    Yang Z.
    International Journal of Speech Technology, 2016, 19 (2) : 407 - 414
  • [45] Voice conversion using canonical correlation analysis based on Gaussian mixture model
    Jian, ZhiHua
    Yang, Zhen
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 1, PROCEEDINGS, 2007, : 210 - +
  • [46] Online Model Adaptation for Voice Conversion using Model-based Speech Synthesis Techniques
    Wu, Dalei
    Li, Baojie
    Jiang, Hui
    Fu, Qian-Jie
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1611 - +
  • [47] Space-based observation modeling method considering resident space object state uncertainty
    Cai, Yifan
    Colombo, Camilla
    AEROSPACE SCIENCE AND ENGINEERING, IV AEROSPACE PHD-DAYS 2024, 2024, 42 : 183 - 187
  • [48] Dynamic Model Selection for Spectral Voice Conversion
    Lanchantin, Pierre
    Rodet, Xavier
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1720 - 1723
  • [49] Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
    Wu, Yi-Jian
    Zen, Heiga
    Nankaku, Yoshilliko
    Tokuda, Keiichi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4621 - 4624
  • [50] A Color to Grayscale Conversion Considering Local and Global Contrast
    Kuk, Jung Gap
    Ahn, Jae Hyun
    Cho, Nam Ik
    COMPUTER VISION - ACCV 2010, PT IV, 2011, 6495 : 513 - 524