Dynamic Margin Softmax Loss for Speaker Verification

被引:18
|
作者
Zhou, Dao [1 ]
Wang, Longbiao [1 ]
Lee, Kong Aik [2 ]
Wu, Yibo [1 ]
Liu, Meng [1 ]
Dang, Jianwu [1 ,3 ]
Wei, Jianguo [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
[2] NEC Corp Ltd, Biometr Res Labs, Tokyo, Japan
[3] Japan Adv Inst Sci & Technol, Nomi, Ishikawa, Japan
来源
基金
中国国家自然科学基金;
关键词
speaker verification; large-margin loss; intra-class; compactness;
D O I
10.21437/Interspeech.2020-1106
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We propose a dynamic-margin softmax loss for the training of deep speaker embedding neural network. Our proposal is inspired by the additive-margin softmax (AM-Softmax) loss reported earlier. In AM-Softmax loss, a constant margin is used for all training samples. However, the angle between the feature vector and the ground-truth class center is rarely the same for all samples. Furthermore, the angle also changes during training. Thus, it is more reasonable to set a dynamic margin for each training sample. In this paper, we propose to dynamically set the margin of each training sample commensurate with the cosine angle of that sample, hence, the name dynamic-additive-margin softmax (DAM-Softmax) loss. More specifically, the smaller the cosine angle is, the larger the margin between the training sample and the corresponding class in the feature space should be to promote intra-class compactness. Experimental results show that the proposed DAM-Softmax loss achieves state-of-the-art performance on the VoxCeleb dataset by 1.94% in equal error rate (EER). In addition, our method also outperforms AM-Softmax loss when evaluated on the Speakers in the Wild (SITW) corpus.
引用
收藏
页码:3800 / 3804
页数:5
相关论文
共 50 条
  • [1] Large Margin Softmax Loss for Speaker Verification
    Liu, Yi
    He, Liang
    Liu, Jia
    [J]. INTERSPEECH 2019, 2019, : 2873 - 2877
  • [2] REAL ADDITIVE MARGIN SOFTMAX FOR SPEAKER VERIFICATION
    Li, Lantian
    Nai, Ruiqian
    Wang, Dong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7527 - 7531
  • [3] ENSEMBLE ADDITIVE MARGIN SOFTMAX FOR SPEAKER VERIFICATION
    Yu, Ya-Qi
    Fan, Lei
    Li, Wu-Jun
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6046 - 6050
  • [4] Contrapositive Margin Softmax Loss for Face Verification
    Xu, Dongxue
    Zhao, Qijun
    [J]. PROCEEDINGS OF ICRCA 2018: 2018 THE 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION / ICRMV 2018: 2018 THE 3RD INTERNATIONAL CONFERENCE ON ROBOTICS AND MACHINE VISION, 2018, : 190 - 194
  • [5] IMPROVED LARGE-MARGIN SOFTMAX LOSS FOR SPEAKER DIARISATION
    Fathullah, Y.
    Zhang, C.
    Woodland, P. C.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7104 - 7108
  • [6] Angular Softmax Loss for End-to-end Speaker Verification
    Li, Yutian
    Gao, Feng
    Ou, Zhijian
    Sun, Jiasong
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 190 - 194
  • [7] Adaptive Margin Circle Loss for Speaker Verification
    Xiao, Runqiu
    Miao, Xiaoxiao
    Wang, Wenchao
    Zhang, Pengyuan
    Cai, Bin
    Luo, Liuping
    [J]. INTERSPEECH 2021, 2021, : 4618 - 4622
  • [8] Additive Margin Softmax for Face Verification
    Wang, Feng
    Cheng, Jian
    Liu, Weiyang
    Liu, Haijun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (07) : 926 - 930
  • [9] Scalable Softmax Loss for Face Verification
    Zhang, Kun
    Zhang, Dongping
    Jing, Changxing
    Li, Jianchao
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 491 - 496
  • [10] Improving finger vein discriminant representation using dynamic margin softmax loss
    Huachuan Li
    Yi Lyu
    Guiduo Duan
    Ci Chen
    [J]. Neural Computing and Applications, 2022, 34 : 3589 - 3601