Efficient Gaussian Mixture Model Evaluation in Voice Conversion

被引:0
|
作者
Tian, Jilei [1 ]
Nurminen, Jani [1 ]
Popa, Victor [1 ]
机构
[1] Nokia Res Ctr, Multimedia Technol Lab, Tampere, Finland
关键词
voice conversion; speech subjective evaluation; Gaussian mixture model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Voice conversion refers to the adaptation of the characteristics of a source speaker's voice to those of a target speaker. Gaussian mixture models (GMM) have been found to be efficient in the voice conversion task. The GMM parameters are estimated from a training set with the goal to minimize the mean squared error (MSE) between the transformed and target vectors. Obviously, the quality of the GMM model plays an important role in achieving better voice conversion quality. This paper presents a very efficient approach for the evaluation of GMM models directly from the model parameters without using any test data, facilitating the improvement of the transformation performance especially in the case of embedded implementations. Though the proposed approach can be used in any application that utilizes GMM based transformation, we take voice conversion as an example application throughout the paper. The proposed approach is experimented with in this context and evaluated against an MSE based evaluation method. The results show that the proposed method is in line with all subjective observations and MSE results.
引用
收藏
页码:2282 / 2285
页数:4
相关论文
共 50 条
  • [31] Robust voice activity detection algorithm based on complex Gaussian mixture model
    Lei, Jian-Jun
    Yang, Zhen
    Liu, Gang
    Guo, Jun
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2009, 42 (04): : 353 - 356
  • [32] An efficient approach for shadow detection based on Gaussian mixture model
    Yan-xiang Han
    Zhi-sheng Zhang
    Fang Chen
    Kai Chen
    Journal of Central South University, 2014, 21 : 1385 - 1395
  • [33] An efficient approach for shadow detection based on Gaussian mixture model
    Han Yan-xiang
    Zhang Zhi-sheng
    Chen Fang
    Chen Kai
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2014, 21 (04) : 1385 - 1395
  • [34] An efficient mixture sampling model for gaussian estimation of distribution algorithm
    Dang, Qianlong
    Gao, Weifeng
    Gong, Maoguo
    Information Sciences, 2022, 608 : 1157 - 1182
  • [35] An efficient mixture sampling model for gaussian estimation of distribution algorithm
    Dang, Qianlong
    Gao, Weifeng
    Gong, Maoguo
    INFORMATION SCIENCES, 2022, 608 : 1157 - 1182
  • [36] IMAGE RESTORATION VIA EFFICIENT GAUSSIAN MIXTURE MODEL LEARNING
    Feng, Jianzhou
    Song, Li
    Huo, Xiaoming
    Yang, Xiaokang
    Zhang, Wenjun
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1056 - 1060
  • [37] An efficient approach for shadow detection based on Gaussian mixture model
    韩延祥
    张志胜
    陈芳
    陈恺
    Journal of Central South University, 2014, 21 (04) : 1385 - 1395
  • [38] AN ESTIMATION METHOD OF VOICE TIMBRE EVALUATION VALUES USING FEATURE EXTRACTION WITH GAUSSIAN MIXTURE MODEL BASED ON REFERENCE SINGER
    Yamane, Soichi
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Nakamura, Satoshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5265 - 5269
  • [39] Voice Activity Detection Based on Sequential Gaussian Mixture Model with Maximum Likelihood Criterion
    Shen, Zhan
    Wei, Jianguo
    Lu, Wenhuan
    Dang, Jianwu
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [40] VOICE CONVERSION BASED ON A MIXTURE DENSITY NETWORK
    Ahangar, Mohsen
    Ghorbandoost, Mostafa
    Sharma, Sudhendu
    Smith, Mark J. T.
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 329 - 333