Voice Conversion Based on State Space Model and Considering Global Variance

被引:0
|
作者
Ahangar, Mohsen [1 ]
Ghorbandoost, Mostafa [1 ]
Sheikhzadeh, Hamid [1 ]
Raahemifar, Kaamran [2 ]
Shahrebabaki, Abdoreza Sabzi [1 ]
Amini, Jamal [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
State space model; global variance; voice conversion;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Voice conversion based on State Space Model (SSM) has been recently proposed to address the discontinuity problem in the traditional frame-based voice conversion by considering the spectral envelope evolutions. However, the results are over-smoothed. To resolve this problem, in this paper we propose a new procedure for integrating the global variance constraint into the SSM-based voice conversion. Moreover, unlike the SSM-based method, we allow the state-vector order to be higher than the feature-vector order. Experimental results verify that the proposed method significantly improves the performance of the SSM-based voice conversion in terms of speaker individuality and speech quality. Our experiments also show that the proposed method outperforms the well-known Maximum Likelihood estimation method that considers the Global Variance in terms of speech quality.
引用
收藏
页码:416 / 421
页数:6
相关论文
共 50 条
  • [31] Voice conversion using Viterbi algorithm based on Gaussian mixture model
    Jian Zhi-Hua
    Yang Zhen
    2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 40 - 43
  • [32] A Physiologically-Based Pharmacokinetic Model of the Brain Considering Regional Lipid Variance
    Heitman, Andrew McPherson
    Bies, Robert R.
    Schwartz, Sorell L.
    JOURNAL OF PHARMACOLOGY AND EXPERIMENTAL THERAPEUTICS, 2022, 383 (03): : 217 - 226
  • [33] Voice conversion using partitions of spectral feature space
    Verhelst, W
    Mertens, J
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 365 - 368
  • [34] A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    Toda, Tomoki
    Tokuda, Keiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (05): : 816 - 824
  • [35] State space models with a common Stochastic variance
    Koopman, SJ
    Bos, CS
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2004, 22 (03) : 346 - 357
  • [36] Introduction: Space, Conversion, and Global History
    Marcocci, Giuseppe
    de Boer, Wietse
    Maldavsky, Aliocha
    Pavan, Ilaria
    SPACE AND CONVERSION IN GLOBAL PERSPECTIVE, 2015, 35 : 1 - 11
  • [37] Prediction-variance relation in a state-space fish stock assessment model
    Breivik, Olav Nikolai
    Nielsen, Anders
    Berg, Casper W.
    ICES JOURNAL OF MARINE SCIENCE, 2021, 78 (10) : 3650 - 3657
  • [38] STATISTICAL VOICE CONVERSION BASED ON WAVENET
    Niwa, Jumpei
    Yoshimura, Takenori
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5289 - 5293
  • [39] VTLN-based voice conversion
    Sündermann, D
    Ney, H
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 556 - 559
  • [40] Controllable voice conversion based on quantization of voice factor scores
    Isako, Takumi
    Onishi, Kotaro
    Kishida, Takuya
    Nakashika, Toru
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1444 - 1448