Voice Conversion Based on State Space Model and Considering Global Variance

被引：0

作者：

Ahangar, Mohsen ^{[1
]}

Ghorbandoost, Mostafa ^{[1
]}

Sheikhzadeh, Hamid ^{[1
]}

Raahemifar, Kaamran ^{[2
]}

Shahrebabaki, Abdoreza Sabzi ^{[1
]}

Amini, Jamal ^{[1
]}

机构：

[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran

[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada

来源：

2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013) | 2013年

关键词：

State space model; global variance; voice conversion;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Voice conversion based on State Space Model (SSM) has been recently proposed to address the discontinuity problem in the traditional frame-based voice conversion by considering the spectral envelope evolutions. However, the results are over-smoothed. To resolve this problem, in this paper we propose a new procedure for integrating the global variance constraint into the SSM-based voice conversion. Moreover, unlike the SSM-based method, we allow the state-vector order to be higher than the feature-vector order. Experimental results verify that the proposed method significantly improves the performance of the SSM-based voice conversion in terms of speaker individuality and speech quality. Our experiments also show that the proposed method outperforms the well-known Maximum Likelihood estimation method that considers the Global Variance in terms of speech quality.

引用

页码：416 / 421

页数：6

共 50 条

[1] Voice conversion based on trajectory model training of neural networks considering global variance
Hosaka, Naoki
Hashimoto, Kei
Oura, Keiichiro
Nankaku, Yoshihiko
Tokuda, Keiichi
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 307 - 311
[2] Analysis of State-Space Model based Voice Conversion
Sun, Jian
Zhang, Xiongwei
PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (10): : 373 - 376
[3] Voice conversion based on state-space model for modelling spectral trajectory
Xu, N.
Yang, Z.
Zhang, L. H.
Zhu, W. P.
Bao, J. Y.
ELECTRONICS LETTERS, 2009, 45 (14) : 763 - U73
[4] MODULAR GLOBAL VARIANCE ENHANCEMENT FOR VOICE CONVERSION SYSTEMS
Benisty, H.
Malah, D.
Crammer, K.
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 370 - 374
[5] Voice Conversion using GMAT with Enhanced Global Variance
Benisty, Hadas
Malah, David
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 676 - 679
[6] Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance
Kobayashi, Kazuhiro
Toda, Tomoki
Neubig, Graham
Sakti, Sakriani
Nakamura, Satoshi
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2754 - 2758
[7] Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
Hwang, Hsin-Te
Tsao, Yu
Wang, Hsin-Min
Wang, Yih-Ru
Chen, Sin-Horng
2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[8] Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
Toda, T
Black, AW
Tokuda, K
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 9 - 12
[9] Voice conversion towards modeling dynamic characteristics using switching state space model
Ning Xu
JingYi Bao
XiaoFeng Liu
AiMing Jiang
YiBing Tang
Science China Information Sciences, 2013, 56 : 1 - 15
[10] Voice Conversion with a Strategy for Separating Speaker Individuality Using State-Space Model
Xu, Ning
Yang, Zhen
Guo, Haiyan
2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 1, 2010, : 298 - 301

← 1 2 3 4 5 →