Intra-class variation reduction of speaker representation in disentanglement framework

被引:11
|
作者
Kwo, Yoohwan [1 ]
Chun, Soo-Whan [1 ]
Kan, Hong-Goo [1 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul, South Korea
来源
基金
芬兰科学院;
关键词
speaker verification; disentanglement; mutual information;
D O I
10.21437/Interspeech.2020-2075
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we propose an effective training strategy to extract robust speaker representations from a speech signal. One of the key challenges in speaker recognition tasks is to learn latent representations or embeddings containing solely speaker characteristic information in order to be robust in terms of intra-speaker variations. By modifying the network architecture to generate both speaker-related and speaker-unrelated representations, we exploit a learning criterion which minimizes the mutual information between these disentangled embeddings. We also introduce an identity change loss criterion which utilizes a reconstruction error to different utterances spoken by the same speaker. Since the proposed criteria reduce the variation of speaker characteristics caused by changes in background environment or spoken content, the resulting embeddings of each speaker become more consistent. The effectiveness of the proposed method is demonstrated through two tasks; disentanglement performance, and improvement of speaker recognition accuracy compared to the baseline model on a benchmark dataset, VoxCeleb1. Ablation studies also show the impact of each criterion on overall performance.
引用
收藏
页码:3231 / 3235
页数:5
相关论文
共 50 条
  • [1] Predefined Prototypes for Intra-Class Separation and Disentanglement
    Almudevar, Antonio
    Mariotte, Theo
    Ortega, Alfonso
    Tahon, Marie
    Vicente, Luis
    Miguel, Antonio
    Lleida, Eduardo
    INTERSPEECH 2024, 2024, : 3809 - 3813
  • [2] Intra-Class Variation Reduction Using Training Expression Images for Sparse Representation Based Facial Expression Recognition
    Lee, Seung Ho
    Plataniotis, Konstantinos N.
    Ro, Yong Man
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2014, 5 (03) : 340 - 351
  • [3] Adaptive face representation via class-specific and intra-class variation dictionaries for recognition
    Wenjing Li
    Jiuzhen Liang
    Multimedia Tools and Applications, 2018, 77 : 14783 - 14802
  • [4] Adaptive face representation via class-specific and intra-class variation dictionaries for recognition
    Li, Wenjing
    Liang, Jiuzhen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 14783 - 14802
  • [5] Intra-class Variation Representation for On-line Signature Verification Using Wavelet and Fractal Analysis
    Liew, Lee Hung
    Lee, Beng Yong
    Wang, Yin Chai
    2017 FIRST INTERNATIONAL CONFERENCE ON COMPUTER AND DRONE APPLICATIONS (ICONDA), 2017, : 87 - 91
  • [6] Sparse representation based classification with intra-class variation dictionary on symmetric positive definite manifolds
    Kasai, Hiroyuki
    Yoshikawa, Kohei
    2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 255 - 258
  • [7] Modeling intra-class variation for nonideal iris recognition
    Li, X
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 419 - 427
  • [8] Local Preserving Graphs Using Intra-Class Competitive Representation for Dimensionality Reduction of Hyperspectral Image
    Zhen Ye
    Shihao Shi
    Tao Sun
    Lin Bai
    Journal of Beijing Institute of Technology, 2021, 30 (02) : 139 - 158
  • [9] Local Preserving Graphs Using Intra-Class Competitive Representation for Dimensionality Reduction of Hyperspectral Image
    Ye Z.
    Shi S.
    Sun T.
    Bai L.
    Journal of Beijing Institute of Technology (English Edition), 2021, 30 (02): : 139 - 158
  • [10] INTRA-CLASS CONFLICT
    Ross, Edward A.
    SOCIOLOGY AND SOCIAL RESEARCH, 1930, 14 (06): : 524 - 530