Diverse style oriented many-to-many emotional voice conversion

被引:0
|
作者
Zhou, Jian [1 ]
Luo, Xiangyu [1 ]
Wang, Huabin [1 ]
Zheng, Wenming [2 ]
Tao, Liang [1 ]
机构
[1] Key Laboratory of Intelligent Computing and Signal Processing, Anhui University, Hefei,230601, China
[2] Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University, Nanjing,210096, China
来源
Shengxue Xuebao/Acta Acustica | 2024年 / 49卷 / 06期
关键词
Network coding - Speech enhancement;
D O I
10.12395/0371-0025.2023192
中图分类号
学科分类号
摘要
To address the issues of insufficient emotional separation and lack of diversity in emotional expression in existing generative adversarial network (GAN)-based emotional voice conversion methods, this paper proposes a many-to-many speech emotional voice conversion method aimed at style diversification. The method is based on a GAN model with a dual-generator structure, where a consistency loss is applied to the latent representations of different generators to ensure the consistency of speech content and speaker characteristics, thereby improving the similarity between the converted speech emotion and the target emotion. Additionally, this method utilizes an emotion mapping network and emotion feature encoder to provide diversified emotional representations of the same emotion category for the generators. Experimental results show that the proposed emotion conversion method yields speech emotions that are closer to the target emotion, with a richer variety of emotional styles. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1297 / 1303
相关论文
共 50 条
  • [41] A many-to-many 'rural hospital theorem'
    Klijn, Flip
    Yazici, Ayse
    JOURNAL OF MATHEMATICAL ECONOMICS, 2014, 54 : 63 - 73
  • [42] Conveyors for Streaming Many-To-Many Communication
    Maley, F. Miller
    DeVinney, Jason G.
    2019 IEEE/ACM 9TH WORKSHOP ON IRREGULAR APPLICATIONS - ARCHITECTURES AND ALGORITHMS (IA3), 2019, : 1 - 8
  • [43] Many-to-many information flow policies
    Baldan, Paolo
    Lafuente, Alberto Lluch
    SCIENCE OF COMPUTER PROGRAMMING, 2018, 168 : 118 - 141
  • [44] Congestion Control Algorithm based on Loss Trend Oriented to Many-to-many Reliable Multicast
    Liu Li
    Zhou Zhong
    Liu Dongmei
    Wu Wei
    SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 93 - +
  • [45] Many-to-Many Communication in Radio Networks
    Chlebus, Bogdan S.
    Kowalski, Dariusz R.
    Radzik, Tomasz
    ALGORITHMICA, 2009, 54 (01) : 118 - 139
  • [46] Visualizing many-to-many association rules
    Yang, L
    IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 92 - 95
  • [47] Many-to-Many Information Flow Policies
    Baldan, Paolo
    Beggiato, Alessandro
    Lafuente, Alberto Lluch
    COORDINATION MODELS AND LANGUAGES, COORDINATION 2017, 2017, 10319 : 159 - 177
  • [48] Stable many-to-many matchings with contracts
    Klaus, Bettina
    Walzl, Markus
    JOURNAL OF MATHEMATICAL ECONOMICS, 2009, 45 (7-8) : 422 - 434
  • [49] Many-to-Many Communication in Radio Networks
    Bogdan S. Chlebus
    Dariusz R. Kowalski
    Tomasz Radzik
    Algorithmica, 2009, 54 : 118 - 139
  • [50] Many-to-Many Superpixel Matching for Robust Tracking
    Wang, Junqiu
    Yagi, Yasushi
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (07) : 1237 - 1248