Mandarin Singing Synthesis Based on Generative Adversarial Network

被引:0
|
作者
Zhou, Yun [2 ]
Yang, Hongwu [1 ,3 ]
Chen, Ziyan [2 ]
Yan, Yajing [2 ]
机构
[1] Northwest Normal Univ, Coll Educ Technol, Lanzhou, Peoples R China
[2] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China
[3] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
singing synthesis; GAN; singing voice corpus; over-smoothing;
D O I
10.1109/icicsp50920.2020.9232118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposed a method for statistical parametric singing synthesis incorporating GAN (Generative Adversarial Network) that trained acoustic model. In GAN, the acoustic model was trained to minimize the weighted sum of the conventional minimum generation loss and adversarial loss, which was minimizing the distance between the natural and generated samples parameter, thus effectively solved the problem of over-smoothing. In the experimental part, we established a singing voice corpus with 60 songs and divided them that have been recorded and labeled into about 1000 sentences, of which 950 sentences were for training model. Comparing the generated songs of the method proposed in this paper and HMM, through 10 people MOS scores, the score of the former was 3.12 that was better than the latter of 2.81.
引用
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [31] Mandarin singing voice synthesis using an HNM based scheme
    Gu, Hung-Yan
    Liau, Huang-Liang
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 347 - 351
  • [32] CGWGAN: crystal generative framework based on Wyckoff generative adversarial network
    Su, Tianhao
    Cao, Bin
    Hu, Shunbo
    Li, Musen
    Zhang, Tong-Yi
    JOURNAL OF MATERIALS INFORMATICS, 2024, 4 (04):
  • [33] Data Synthesis based on Generative Adversarial Networks
    Park, Noseong
    Mohammadi, Mahmoud
    Gorde, Kshitij
    Jajodia, Sushil
    Park, Hongkyu
    Kim, Youngmin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1071 - 1083
  • [34] Dual Wasserstein generative adversarial network condition: A generative adversarial network-based acoustic impedance inversion method
    Wang, Zixu
    Wang, Shoudong
    Zhou, Chen
    Cheng, Wanli
    Geophysics, 2022, 87 (06):
  • [35] Dual Wasserstein generative adversarial network condition: A generative adversarial network-based acoustic impedance inversion method
    Wang, Zixu
    Wang, Shoudong
    Zhou, Chen
    Cheng, Wanli
    GEOPHYSICS, 2022, 87 (06) : R401 - R411
  • [36] Text-to-image synthesis based on modified deep convolutional generative adversarial network
    Li Y.
    Zhu M.
    Ren J.
    Su X.
    Zhou X.
    Yu H.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (08): : 1875 - 1883
  • [37] X-Ray Image with Prohibited Items Synthesis Based on Generative Adversarial Network
    Zhao, Tengfei
    Zhang, Haigang
    Zhang, Yutao
    Yang, Jinfeng
    BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 379 - 387
  • [38] A Generative-Adversarial Network-Based Method for Image Synthesis of Diverse Pedestrian
    Li, Bo
    Liu, Zhenyuan
    Xing, Xingyu
    Jia, Tong
    Lu, Yuxiao
    Chen, Junyi
    CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 300 - 309
  • [39] Mask-guided generative adversarial network for MRI-based CT synthesis
    Luo, Yu
    Zhang, Shaowei
    Ling, Jie
    Lin, Zhiyi
    Wang, Zongming
    Yao, Shun
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [40] Hybrid generative adversarial network based on frequency and spatial domain for histopathological image synthesis
    Liu, Qifeng
    Zhou, Tao
    Cheng, Chi
    Ma, Jin
    Tania, Marzia Hoque
    BMC BIOINFORMATICS, 2025, 26 (01):