Mandarin Singing Synthesis Based on Generative Adversarial Network

被引:0
|
作者
Zhou, Yun [2 ]
Yang, Hongwu [1 ,3 ]
Chen, Ziyan [2 ]
Yan, Yajing [2 ]
机构
[1] Northwest Normal Univ, Coll Educ Technol, Lanzhou, Peoples R China
[2] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China
[3] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
singing synthesis; GAN; singing voice corpus; over-smoothing;
D O I
10.1109/icicsp50920.2020.9232118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposed a method for statistical parametric singing synthesis incorporating GAN (Generative Adversarial Network) that trained acoustic model. In GAN, the acoustic model was trained to minimize the weighted sum of the conventional minimum generation loss and adversarial loss, which was minimizing the distance between the natural and generated samples parameter, thus effectively solved the problem of over-smoothing. In the experimental part, we established a singing voice corpus with 60 songs and divided them that have been recorded and labeled into about 1000 sentences, of which 950 sentences were for training model. Comparing the generated songs of the method proposed in this paper and HMM, through 10 people MOS scores, the score of the former was 3.12 that was better than the latter of 2.81.
引用
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [1] SINGING VOICE SYNTHESIS BASED ON GENERATIVE ADVERSARIAL NETWORKS
    Hono, Yukiya
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6955 - 6959
  • [2] FGP-GAN: Fine-Grained Perception Integrated Generative Adversarial Network for Expressive Mandarin Singing Voice Synthesis
    Liu, Xin
    Zhang, Weiwei
    Zheng, Zhaohui
    Pan, Mingyang
    Wang, Rong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6054 - 6063
  • [3] SVSGAN: SINGING VOICE SEPARATION VIA GENERATIVE ADVERSARIAL NETWORK
    Fan, Zhe-Cheng
    Lai, Yen-Lin
    Jang, Jyh-Shing R.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 726 - 730
  • [4] Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network
    Wang, Chunhui
    Zeng, Chang
    He, Xing
    INTERSPEECH 2023, 2023, : 5401 - 5405
  • [5] SingGAN: Generative Adversarial NetWork For High-Fidelity Singing Voice Generation
    Huang, Rongjie
    Cui, Chenye
    Chen, Feiyang
    Ren, Yi
    Liu, Jinglin
    Zhao, Zhou
    Huai, Baoxing
    Wang, Zhefeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2525 - 2535
  • [6] Generative Adversarial Network for Radar Signal Synthesis
    Truong, Thomas
    Yanushkevich, Svetlana
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] DATA AUGMENTATION FOR MONAURAL SINGING VOICE SEPARATION BASED ON VARIATIONAL AUTOENCODER-GENERATIVE ADVERSARIAL NETWORK
    He, Boxin
    Wang, Shengbei
    Yuan, Weitao
    Wang, Jianming
    Unoki, Masashi
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1354 - 1359
  • [8] Progressive Face Age Synthesis Algorithm Based on Generative Adversarial Network
    Yang, Xiao-Yu
    Wang, Ai-Xia
    Yang, Gang
    Li, Jing-Jiao
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 944 - 952
  • [9] Instance Map Based Image Synthesis With a Denoising Generative Adversarial Network
    Zheng, Ziqiang
    Wang, Chao
    Yu, Zhibin
    Zheng, Haiyong
    Zheng, Bing
    IEEE ACCESS, 2018, 6 : 33654 - 33665
  • [10] Cognitive Covert Traffic Synthesis Method Based on Generative Adversarial Network
    Tang, Zhangguo
    Wang, Junfeng
    Li, Huanzhou
    Zhang, Jian
    Wang, Junhao
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021