Voice Conversion Based on State Space Model and Considering Global Variance

被引：0

作者：

Ahangar, Mohsen ^{[1
]}

Ghorbandoost, Mostafa ^{[1
]}

Sheikhzadeh, Hamid ^{[1
]}

Raahemifar, Kaamran ^{[2
]}

Shahrebabaki, Abdoreza Sabzi ^{[1
]}

Amini, Jamal ^{[1
]}

机构：

[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran

[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada

来源：

2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013) | 2013年

关键词：

State space model; global variance; voice conversion;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Voice conversion based on State Space Model (SSM) has been recently proposed to address the discontinuity problem in the traditional frame-based voice conversion by considering the spectral envelope evolutions. However, the results are over-smoothed. To resolve this problem, in this paper we propose a new procedure for integrating the global variance constraint into the SSM-based voice conversion. Moreover, unlike the SSM-based method, we allow the state-vector order to be higher than the feature-vector order. Experimental results verify that the proposed method significantly improves the performance of the SSM-based voice conversion in terms of speaker individuality and speech quality. Our experiments also show that the proposed method outperforms the well-known Maximum Likelihood estimation method that considers the Global Variance in terms of speech quality.

引用

页码：416 / 421

页数：6

共 50 条

[41] A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures
Li, Lei
Nankaku, Yoshihiko
Tokuda, Keiichi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 668 - 671
[42] A system for voice conversion based on probabilistic classification and a harmonic plus noise model
Stylianou, Y
Cappe, O
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 281 - 284
[43] On state space model based predictive control
Di Ruscio, D
Foss, B
DYNAMICS & CONTROL OF PROCESS SYSTEMS 1998, VOLUMES 1 AND 2, 1999, : 301 - 306
[44] Speaker identification based on state space model
Xu L.
Yang Z.
International Journal of Speech Technology, 2016, 19 (2) : 407 - 414
[45] Voice conversion using canonical correlation analysis based on Gaussian mixture model
Jian, ZhiHua
Yang, Zhen
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 1, PROCEEDINGS, 2007, : 210 - +
[46] Online Model Adaptation for Voice Conversion using Model-based Speech Synthesis Techniques
Wu, Dalei
Li, Baojie
Jiang, Hui
Fu, Qian-Jie
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1611 - +
[47] Space-based observation modeling method considering resident space object state uncertainty
Cai, Yifan
Colombo, Camilla
AEROSPACE SCIENCE AND ENGINEERING, IV AEROSPACE PHD-DAYS 2024, 2024, 42 : 183 - 187
[48] Dynamic Model Selection for Spectral Voice Conversion
Lanchantin, Pierre
Rodet, Xavier
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1720 - 1723
[49] Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
Wu, Yi-Jian
Zen, Heiga
Nankaku, Yoshilliko
Tokuda, Keiichi
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4621 - 4624
[50] A Color to Grayscale Conversion Considering Local and Global Contrast
Kuk, Jung Gap
Ahn, Jae Hyun
Cho, Nam Ik
COMPUTER VISION - ACCV 2010, PT IV, 2011, 6495 : 513 - 524

← 1 2 3 4 5 →