Mandarin Singing Synthesis Based on Generative Adversarial Network

被引:0
|
作者
Zhou, Yun [2 ]
Yang, Hongwu [1 ,3 ]
Chen, Ziyan [2 ]
Yan, Yajing [2 ]
机构
[1] Northwest Normal Univ, Coll Educ Technol, Lanzhou, Peoples R China
[2] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou, Peoples R China
[3] Natl & Prov Joint Engn Lab Learning Anal Technol, Lanzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
singing synthesis; GAN; singing voice corpus; over-smoothing;
D O I
10.1109/icicsp50920.2020.9232118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposed a method for statistical parametric singing synthesis incorporating GAN (Generative Adversarial Network) that trained acoustic model. In GAN, the acoustic model was trained to minimize the weighted sum of the conventional minimum generation loss and adversarial loss, which was minimizing the distance between the natural and generated samples parameter, thus effectively solved the problem of over-smoothing. In the experimental part, we established a singing voice corpus with 60 songs and divided them that have been recorded and labeled into about 1000 sentences, of which 950 sentences were for training model. Comparing the generated songs of the method proposed in this paper and HMM, through 10 people MOS scores, the score of the former was 3.12 that was better than the latter of 2.81.
引用
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [21] GENERATIVE ADVERSARIAL NETWORK-BASED POSTFILTER FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
    Kaneko, Takuhiro
    Kameoka, Hirokazu
    Hojo, Nobukatsu
    Ijima, Yusuke
    Hiramatsu, Kaoru
    Kashino, Kunio
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4910 - 4914
  • [22] Generative Adversarial Network Implementation for Batik Motif Synthesis
    Abdurrahman, Miqdad
    Shabrina, Nabila Husna
    Halim, Dareen K.
    PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON NEW MEDIA STUDIES (CONMEDIA 2019), 2019, : 63 - 67
  • [23] Glioblastoma MR Images Synthesis with Generative Adversarial Network
    Wen, Ning
    Dai, Zhenzhen
    Carver, Eric
    Liang, Evan
    Snyder, James
    Griffith, Brent
    Movsas, Benjamin
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2020, 108 (02): : E28 - E28
  • [24] Image Synthesis with a Convolutional Capsule Generative Adversarial Network
    Bass, Cher
    Dai, Tianhong
    Billot, Benjamin
    Arulkumaran, Kai
    Creswell, Antonia
    Clopath, Claudia
    De Paola, Vincenzo
    Bharath, Anil Anthony
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 102, 2019, 102 : 39 - 62
  • [25] TEXT TO IMAGE SYNTHESIS WITH BIDIRECTIONAL GENERATIVE ADVERSARIAL NETWORK
    Wang, Zixu
    Quan, Zhe
    Wang, Zhi-Jie
    Hu, Xinjian
    Chen, Yangyang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [26] Multimodal Fusion Generative Adversarial Network for Image Synthesis
    Zhao, Liang
    Hu, Qinghao
    Li, Xiaoyuan
    Zhao, Jingyuan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1865 - 1869
  • [27] Generative Adversarial Network Synthesis of Hyperspectral Vegetation Data
    Hennessy, Andrew
    Clarke, Kenneth
    Lewis, Megan
    REMOTE SENSING, 2021, 13 (12)
  • [28] A Generative Adversarial Network Approach to Reflectarray Pattern Synthesis
    Li, Hong-Wen
    Chen, You-Cheng
    Liu, Alan
    Lin, Shih-Cheng
    Hsieh, Meng-Yuan
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [29] Mandarin speech reconstruction from surface electromyography based on generative adversarial networks
    Li, Fengji
    Shen, Fei
    Ma, Ding
    Zhou, Jie
    Wang, Li
    Fan, Fan
    Liu, Tao
    Chen, Xiaohong
    Toda, Tomoki
    Niu, Haijun
    MEDICINE IN NOVEL TECHNOLOGY AND DEVICES, 2025, 26
  • [30] SINGAN: Singing Voice Conversion with Generative Adversarial Networks
    Sisman, Berrak
    Vijayan, Karthika
    Dong, Minghui
    Li, Haizhou
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 112 - 118