Diverse style oriented many-to-many emotional voice conversion

被引:0
|
作者
Zhou, Jian [1 ]
Luo, Xiangyu [1 ]
Wang, Huabin [1 ]
Zheng, Wenming [2 ]
Tao, Liang [1 ]
机构
[1] Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei,230601, China
[2] Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University, Nanjing,210096, China
来源
Shengxue Xuebao/Acta Acustica | 2024年 / 49卷 / 06期
关键词
Network coding - Speech enhancement;
D O I
10.12395/0371-0025.2023192
中图分类号
学科分类号
摘要
To address the issues of insufficient emotional separation and lack of diversity in emotional expression in existing generative adversarial network (GAN)-based emotional voice conversion methods, this paper proposes a many-to-many speech emotional voice conversion method aimed at style diversification. The method is based on a GAN model with a dual-generator structure, where a consistency loss is applied to the latent representations of different generators to ensure the consistency of speech content and speaker characteristics, thereby improving the similarity between the converted speech emotion and the target emotion. Additionally, this method utilizes an emotion mapping network and emotion feature encoder to provide diversified emotional representations of the same emotion category for the generators. Experimental results show that the proposed emotion conversion method yields speech emotions that are closer to the target emotion, with a richer variety of emotional styles. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1297 / 1303
相关论文
共 50 条
  • [31] STARGAN-VC: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING STAR GENERATIVE ADVERSARIAL NETWORKS
    Kameoka, Hirokazu
    Kaneko, Takuhiro
    Tanaka, Kou
    Hojo, Nobukatsu
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 266 - 273
  • [32] Many-to-Many Pair Trading
    Wang, Yingying
    Li, Xiaodong
    Wu, Pangjing
    Xie, Haoran
    WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 399 - 407
  • [33] IMPERFECT MANY-TO-MANY TELEPORTATION
    Ghiu, Iulia
    Isdraila, Tudor
    Suciu, Serban
    ROMANIAN JOURNAL OF PHYSICS, 2012, 57 (3-4): : 564 - 570
  • [34] Many-To-Many Innovation Contexts
    D'Auria, Anna
    Tregua, Marco
    Spena, Tiziana Russo
    Bifulco, Francesco
    IFKAD 2015: 10TH INTERNATIONAL FORUM ON KNOWLEDGE ASSET DYNAMICS: CULTURE, INNOVATION AND ENTREPRENEURSHIP: CONNECTING THE KNOWLEDGE DOTS, 2015, : 2082 - 2093
  • [35] Singing Voice Conversion Method Based on Many-to-Many Eigenvoice Conversion and Training Data Generation Using a Singing-to-Singing Synthesis System
    Doi, Hironori
    Toda, Tomoki
    Nakano, Tomoyasu
    Goto, Masataka
    Nakamura, Satoshi
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [36] High-Quality Many-to-Many Voice Conversion Using Transitive Star Generative Adversarial Networks with Adaptive Instance Normalization
    Li, Yanping
    He, Zhengtao
    Zhang, Yan
    Yang, Zhen
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (10)
  • [37] Implementation in the many-to-many matching market
    Sotomayor, M
    GAMES AND ECONOMIC BEHAVIOR, 2004, 46 (01) : 199 - 212
  • [38] Many-to-many aggregation for sensor networks
    Silberstein, Adam
    Yang, Jun
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 961 - +
  • [39] SEARCH MEMORY FOR MANY-TO-MANY COMPARISONS
    DIGBY, DW
    IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (08) : 768 - 772
  • [40] Many-to-many matching and price discrimination
    Gomes, Renato
    Pavan, Alessandro
    THEORETICAL ECONOMICS, 2016, 11 (03): : 1005 - 1052