Voice Conversion Using Structrued Gaussian Mixture Model

被引:0
|
作者
Zeng, Daojian [1 ]
Yu, Yibiao [1 ]
机构
[1] Soochow Univ, Sch Elect & Informat Engn, Suzhou, Peoples R China
关键词
voice conversion; SGMM; AUS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Gaussian Mixture Model (GMM) is commonly used in voice conversion.. However, traditional GMM based voice conversion usually extracts a conversion function from parallel corpus, which greatly limits the application of the technology. In an attempt to overcome this drawback, structured Gaussian Mixture Model (SGMM) is applied to model the speaker's acoustic feature distribution. In particular, two speakers' isolated SGMMs are aligned based on Acoustic Universal Structure (AUS) theory. Then the conversion function is extracted from two aligned SGMMs in a manner similar to conventional method. The subjective listening tests indicate that the proposed method achieves equivalent: speech quality and speaker individuality compared with conventional method.
引用
收藏
页码:541 / 544
页数:4
相关论文
共 50 条
  • [21] Speech Analysis/Synthesis by Gaussian Mixture Approximation of the Speech Spectrum for Voice Conversion
    Amini, Jamal
    Shahrebabaki, Abdoreza Sabzi
    Shokouhi, Navid
    Sheikhzadeh, Hamid
    Raahemifa, Kaamran
    Eslami, Mehdi
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 428 - 433
  • [22] Voice Conversion Based on Gaussian Mixture Modules with Minimum Distance Spectral Mapping
    Jin, Gui
    Johnson, Michael T.
    Liu, Jia
    Lin, Xiaokang
    [J]. 2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 356 - 359
  • [23] Voice Conversion Using A Two-Factor Gaussian Process Latent Variable Model
    Sun, Xinjian
    Zhang, Xiongwei
    Cao, Tieyong
    Yang, Jibin
    Sun, Jian
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (12A): : 318 - 324
  • [24] A Voice Morphing Model Based on the Gaussian Mixture Model and Generative Topographic Mapping
    Rassam, Murad A.
    Almekhlafi, Rasha
    Alosaily, Eman
    Hassan, Haneen
    Hassan, Reem
    Saeed, Eman
    Alqershi, Elham
    [J]. EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 396 - 406
  • [25] Interpretable parametric voice conversion functions based on Gaussian mixture models and constrained transformations
    Erro, Daniel
    Alonso, Agustin
    Serrano, Luis
    Navas, Eva
    Hernaez, Inma
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01): : 3 - 15
  • [26] Voice biometrics using linear Gaussian model
    Yang, Hai
    Xu, Yunfei
    Huang, Houjun
    Zhou, Ruohua
    Yan, Yonghong
    [J]. IET BIOMETRICS, 2014, 3 (01) : 9 - 15
  • [27] Voice Interaction Using Gaussian Mixture Models for Augmented Reality Applications
    Hamidia, Mahfoud
    Zenati, Nadia
    Belghit, Hayet
    Guetiteni, Kamila
    Achour, Nouara
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 387 - +
  • [28] Gender based Voice Authentication Using Gaussian Mixture Model and Mel-Frequency Cepstrum Coefficients
    Rajeh, Wahid
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 539 - 545
  • [29] Gaussian Process Experts for Voice Conversion
    Pilkington, Nicholas C. V.
    Zen, Heiga
    Gales, Mark J. F.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2772 - +
  • [30] A DYNAMIC GAUSSIAN PROCESS FOR VOICE CONVERSION
    Huang, Dong-Yan
    Dong, Minghui
    Li, Haizhou
    [J]. ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,