Dynamic Model Selection for Spectral Voice Conversion

被引:0
|
作者
Lanchantin, Pierre [1 ]
Rodet, Xavier [1 ]
机构
[1] Anal Synth Team, STMS, IRCAM, CNRS,UMR9912, F-75004 Paris, France
关键词
Voice conversion; model selection; TRANSFORMATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Statistical methods for voice conversion are usually based on a single model selected in order to represent a tradeoff between goodness of fit and complexity. In this paper we assume that the best model may change over time, depending on the source acoustic features. We present a new method for spectral voice conversion(1) called Dynamic Model Selection (DMS), in which a set of potential best models with increasing complexity - including a mixture of Gaussian and probabilistic principal component analyzers - are considered during the conversion of a source speech signal into a target speech signal. This set is built during the learning phase, according to the Bayes information criterion (BIC). During the conversion, the best model is dynamically selected among the models in the set, according to the acoustical features of each source frame. Subjective tests show that the method improves the conversion in terms of proximity to the target and quality.
引用
收藏
页码:1720 / 1723
页数:4
相关论文
共 50 条
  • [1] OBJECTIVE EVALUATION OF THE DYNAMIC MODEL SELECTION METHOD FOR SPECTRAL VOICE CONVERSION
    Lanchantin, Pierre
    Rodet, Xavier
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5132 - 5135
  • [2] Conversion function clustering and selection using linguistic and spectral information for emotional voice conversion
    Hsia, Chi-Chun
    Wu, Chung-Hsien
    Wu, Jian-Qi
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2007, 56 (09) : 1245 - 1254
  • [3] Conversion function clustering and selection for expressive voice conversion
    Hsia, Chi-Chun
    Wu, Chung-Hsien
    Wu, Jian-Qi
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 689 - +
  • [4] EXEMPLAR SELECTION METHODS IN VOICE CONVERSION
    Zhao, Guanlong
    Gutierrez-Osuna, Ricardo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5525 - 5529
  • [5] Voice conversion based on state-space model for modelling spectral trajectory
    Xu, N.
    Yang, Z.
    Zhang, L. H.
    Zhu, W. P.
    Bao, J. Y.
    [J]. ELECTRONICS LETTERS, 2009, 45 (14) : 763 - U73
  • [6] A DYNAMIC GAUSSIAN PROCESS FOR VOICE CONVERSION
    Huang, Dong-Yan
    Dong, Minghui
    Li, Haizhou
    [J]. ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [7] Parametric model for voice conversion
    Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
    [J]. Shengxue Xuebao, 2006, 6 (542-548):
  • [8] Voice conversion: Wavelet based residual selection
    Kachare, Pramod
    Cheeran, Alice
    Nirmal, Jagganath
    Zaveri, Mukesh
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1513 - 1518
  • [9] Automatic source speaker selection for voice conversion
    Turk, Oytun
    Arslan, Levent M.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (01): : 480 - 491
  • [10] OBSERVATION-MODEL ERROR COMPENSATION FOR ENHANCED SPECTRAL ENVELOPE TRANSFORMATION IN VOICE CONVERSION
    Villavicencio, Fernando
    Bonada, Jordi
    Hisaminato, Yuji
    [J]. 2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,