A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引:0
|
作者
Chen, Z. [1 ]
Zhang, L. H. [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] High-Individuality Voice Conversion Based on Concatenative Speech Synthesis
    Fujii, Kei
    Okawa, Jun
    Suigetsu, Kaori
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 483 - 488
  • [32] Voice System for Ordering Songs Based on MP ANN
    Gao, Jianhua
    Gong, Ningsheng
    [J]. 2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL I, 2010, : 654 - 657
  • [33] Voice System for Ordering Songs Based on MP ANN
    Gao, Jianhua
    Gong, Ningsheng
    [J]. PROCEEDINGS OF THE 2011 INTERNATIONAL CONFERENCE ON INFORMATICS, CYBERNETICS, AND COMPUTER ENGINEERING (ICCE2011), VOL 2: INFORMATION SYSTEMS AND COMPUTER ENGINEERING, 2011, 111 : 25 - 30
  • [34] Voice Quality Assessment Method Based on Contribution Degree Analysis
    Xuan, Zhangjian
    Cai, Xiaoxia
    [J]. PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON MODELLING, SIMULATION AND APPLIED MATHEMATICS (MSAM 2018), 2018, 160 : 259 - 263
  • [35] An improved spectral and prosodic transformation method in straight-based voice conversion
    Qin, L
    Chen, GP
    Ling, ZH
    Dai, LR
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 21 - 24
  • [36] Estimation method of glottal vocal efficiency based on conversion function of voice source
    ZOU Yuan WAN Mingxi ZHAO Shouguo WANG Supin(1 Department of Biomedical Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 332 - 342
  • [37] Improving Segmental GMM Based Voice Conversion Method with Target Frame Selection
    Gu, Hung-Yan
    Tsai, Sung-Fung
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 483 - 487
  • [38] A novel method for prosody prediction in voice conversion
    Helander, Elina E.
    Nurminen, Jani
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 509 - +
  • [39] A New ANN-SNN Conversion Method with High Accuracy, Low Latency and Good Robustness
    Wang, Bingsen
    Cao, Jian
    Chen, Jue
    Feng, Shuo
    Wang, Yuan
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 3067 - 3075
  • [40] A Comparison of Voice Conversion Methods for Transforming Voice Quality in Emotional Speech Synthesis
    Tuerk, Oytun
    Schroeder, Marc
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2282 - 2285