A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引:0
|
作者
Chen, Z. [1 ]
Zhang, L. H. [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
关键词
voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] VTLN-based voice conversion
    Sündermann, D
    Ney, H
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 556 - 559
  • [42] A Comparison of Voice Conversion Methods for Transforming Voice Quality in Emotional Speech Synthesis
    Tuerk, Oytun
    Schroeder, Marc
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2282 - 2285
  • [43] An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation
    He, Xiangheng
    Chen, Junjie
    Rizos, Georgios
    Schuller, Bjorn W.
    [J]. INTERSPEECH 2021, 2021, : 821 - 825
  • [44] Controllable voice conversion based on quantization of voice factor scores
    Isako, Takumi
    Onishi, Kotaro
    Kishida, Takuya
    Nakashika, Toru
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1444 - 1448
  • [45] High quality voice morphing
    Ye, H
    Young, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 9 - 12
  • [47] Adaptive Voice-Quality Control Based on One-to-Many Eigenvoice Conversion
    Ohta, Kumi
    Toda, Tomoki
    Ohtani, Yamato
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2158 - +
  • [48] A Method Used for Quality Assessment of Construction Project Based on FCE-ANN
    Shi, Huawang
    Yang, Zhengang
    [J]. 2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 71 - 74
  • [49] A novel method for power quality comprehensive evaluation based on ANN and subordinate degree
    Yuan, Shuai
    Tong, Weiming
    Tong, Chengde
    Li, Zhongwei
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 62 - 65
  • [50] Research on Voice Quality Evaluation Method Based on Artificial Neural Network
    Di, Zixiang
    Xiao, Tian
    Li, Yi
    Cheng, Xinzhou
    Li, Bei
    Xu, Lexi
    Zhu, Xiaomeng
    Zhi, Lu
    Xia, Rui
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1510 - 1515