Alleviating the Over-Smoothing Problem in GMM-Based Voice Conversion with Discriminative Training

被引:0
|
作者
Hwang, Hsin-Te [1 ,3 ]
Tsao, Yu [2 ]
Wang, Hsin-Min [3 ]
Wang, Yih-Ru [1 ]
Chen, Sin-Horng [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[3] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
Voice conversion; discriminative training; GMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a discriminative training (DT) method to alleviate the muffled sound effect caused by over smoothing in the Gaussian mixture model (GMM)-based voice conversion (VC). For the conventional GMM-based VC, we often observed a large degree of ambiguities among acoustic classes (generative classes), determined by the source feature vectors for generating the converted feature vectors, causing the "muffled sound" effect on the converted voice. The proposed DT method is applied to refine the parameters in the maximum likelihood (ML)-trained joint density GMM (JDGMM) in the training stage to reduce the ambiguities among acoustic classes (generative classes) to alleviate the muffled sound effect. Experimental results demonstrate that the DT method significantly enhances the discriminative power between acoustic classes (generative classes) in the objective evaluation and effectively alleviates the muffled sound effect in the subjective evaluation.
引用
收藏
页码:3061 / 3065
页数:5
相关论文
共 50 条
  • [1] Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [2] Another Perspective of Over-Smoothing: Alleviating Semantic Over-Smoothing in Deep GNNs
    Li, Jin
    Zhang, Qirong
    Liu, Wenxi
    Chan, Antoni B.
    Fu, Yang-Geng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [3] MODULATION SPECTRUM-CONSTRAINED TRAJECTORY TRAINING ALGORITHM FOR GMM-BASED VOICE CONVERSION
    Takamichi, Shinnosuke
    Toda, Tomoki
    Black, Alan W.
    Nakamura, Satoshi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4859 - 4863
  • [4] Alleviating Over-smoothing for Unsupervised Sentence Representation
    Chen, Nuo
    Shou, Linjun
    Pei, Jian
    Gong, Ming
    Cao, Bowen
    Chang, Jianhui
    Jiang, Daxin
    Li, Jia
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3552 - 3566
  • [5] Alleviating Over-Smoothing via Aggregation over Compact Manifolds
    Zhou, Dongzhuoran
    Yang, Hui
    Xiong, Bo
    Ma, Yue
    Kharlamov, Evgeny
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PAKDD 2024, 2024, 14646 : 390 - 404
  • [6] On the Over-Smoothing Problem of CNN Based Disparity Estimation
    Chen, Chuangrong
    Chen, Xiaozhi
    Cheng, Hui
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8996 - 9004
  • [7] Alleviating over-smoothing via graph sparsification based on vertex feature similarity
    Gongce Wu
    Shukuan Lin
    Yilin Zhuang
    Jianzhong Qiao
    [J]. Applied Intelligence, 2023, 53 : 20223 - 20238
  • [8] GMM-Based Speaker Gender and Age Classification After Voice Conversion
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. 2016 FIRST INTERNATIONAL WORKSHOP ON SENSING, PROCESSING AND LEARNING FOR INTELLIGENT MACHINES (SPLINE), 2016,
  • [9] Alleviating over-smoothing via graph sparsification based on vertex feature similarity
    Wu, Gongce
    Lin, Shukuan
    Zhuang, Yilin
    Qiao, Jianzhong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (17) : 20223 - 20238
  • [10] Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion
    Tanaka, Kei
    Hara, Sunao
    Abe, Masanobu
    Minagi, Shogo
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,