A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引:0
|
作者
Chen, Z. [1 ]
Zhang, L. H. [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] An Improved ANN Method Based on Clustering Optimization for Voice Conversion
    Chen Xiantong
    Zhang Linghua
    [J]. 2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 464 - 469
  • [2] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [3] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    [J]. MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [4] IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD
    Jiao, Yishan
    Xie, Xiang
    Na, Xingyu
    Tu, Ming
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Comparing ANN and GMM in a voice conversion framework
    Laskar, R. H.
    Chakrabarty, D.
    Talukdar, F. A.
    Rao, K. Sreenivasa
    Banerjee, K.
    [J]. APPLIED SOFT COMPUTING, 2012, 12 (11) : 3332 - 3342
  • [6] Runtime and Speech Quality Survey of a Voice Conversion Method
    Jokisch, Oliver
    Birhanu, Yitagessu
    Hoffmann, Ruediger
    [J]. 2013 IEEE EUROCON, 2013, : 1684 - 1688
  • [7] Modeling glottal source for high quality voice conversion
    Sun, Jun
    Dai, Beiqian
    Zhang, Jian
    Xie, Yanlu
    [J]. WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 319 - 319
  • [8] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder
    Chen, Kuan
    Chen, Bo
    Lai, Jiahao
    Yu, Kai
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1993 - 1997
  • [9] APPLYING IMPROVED SPECTRAL MODELING FOR HIGH QUALITY VOICE CONVERSION
    Villavicencio, Fernando
    Roebel, Axel
    Rodet, Xavier
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4285 - +
  • [10] Voice Conversion Using Dynamic Features for High Quality Transformation
    Wang, Wei
    Yang, Zhen
    [J]. SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546