A novel voice morphing system using Bi-GMM for high quality transformation

被引:0
|
作者
Xu, Ning [1 ]
Shao, Xi [1 ]
Yang, Zhen [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Inst Signal Proc & Transmiss, Nanjing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel voice morphing system which reproduces high quality speech while maintaining the majority of the target characteristics. Bi-GMM is named for using GMM technique to estimate mapping functions as well as a codebook generated by GMM either. Compared with the traditional GMM technique, a maximum likelihood estimation framework combined with codebook compensation technique is proposed to overcome the overly smoothed problem caused by conventional GMM. Furthermore, in order to alleviate the discontinuities between frames, a time domain median filler is applied. The STRAIGHT algorithm is adopted for the analysis and synthesis process. The objective and subjective evaluations show that the quality of the speech converted by the proposed method is significantly improved compared with the results by the traditional GMM method.
引用
收藏
页码:485 / 489
页数:5
相关论文
共 50 条
  • [1] High quality voice morphing
    Ye, H
    Young, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 9 - 12
  • [2] Voice Conversion Using Dynamic Features for High Quality Transformation
    Wang, Wei
    Yang, Zhen
    [J]. SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546
  • [3] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    [J]. The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 68 - 75
  • [4] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    [J]. TheJournalofChinaUniversitiesofPostsandTelecommunications., 2014, 21 (05) - 75+93
  • [5] A Precise Estimation of Vocal Tract Parameters for High Quality Voice Morphing
    Xu, Ning
    Yang, Zhen
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 684 - 687
  • [6] Quality-enhanced voice morphing using maximum likelihood transformations
    Ye, Hui
    Young, Steve
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1301 - 1312
  • [7] A High Quality Steganographic Method Using Morphing
    Bagade, Anant M.
    Talbar, Sanjay N.
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2014, 10 (02): : 256 - 270
  • [8] On Using Warping Function for LSFs Transformation in a Voice Conversion System
    Hanzlicek, Zdenek
    Matousek, Jindrich
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 2722 - 2725
  • [9] VOICE MORPHING THAT IMPROVES TTS QUALITY USING AN OPTIMAL DYNAMIC FREQUENCY WARPING-AND-WEIGHTING TRANSFORM
    Agiomyrgiannakis, Yannis
    Roupakia, Zoi
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5650 - 5654
  • [10] Development of a novel voice verification system using wavelets
    Kumar, Rajparthiban
    Aravind, C., V
    Naidu, Kanendra
    Fariza, Anis
    [J]. 2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 923 - +