Additive Phoneme-aware Margin Softmax Loss for Language Recognition

被引:3
|
作者
Li, Zheng [1 ]
Liu, Yan [1 ]
Li, Lin [1 ]
Hong, Qingyang [2 ]
机构
[1] Xiamen Univ, Sch Elect Sci & Engn, Xiamen, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
language recognition; oriental language recognition; margin loss; phonetic information; SPEAKER;
D O I
10.21437/Interspeech.2021-1167
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper proposes an additive phoneme-aware margin softmax (APM-Softmax) loss to train the multi-task learning network with phonetic information for language recognition. In additive margin softmax (AM-Softmax) loss, the margin is set as a constant during the entire training for all training samples, and that is a suboptimal method since the recognition difficulty varies in training samples. In additive angular margin softmax (AAM-Softmax) loss, the additional angular margin is set as a costant as well. In this paper, we propose an APM-Softmax loss for language recognition with phoneitc multi-task learning, in which the additive phoneme-aware margin is automatically tuned for different training samples. More specifically, the margin of language recognition is adjusted according to the results of phoneme recognition. Experiments are reported on Oriental Language Recognition (OLR) datasets, and the proposed method improves AM-Softmax loss and AAM-Softmax loss in different language recognition testing conditions.
引用
收藏
页码:3276 / 3280
页数:5
相关论文
共 50 条
  • [1] Double Additive Margin Softmax Loss for Face Recognition
    Zhou, Shengwei
    Chen, Caikou
    Han, Guojiang
    Hou, Xielian
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [2] Additive Margin Softmax with Center Loss for Face Recognition
    Jiang, Mingchao
    Yang, Zhenguo
    Liu, Wenyin
    Liu, Xiaochun
    [J]. PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2018), 2018, : 1 - 6
  • [3] PAN: PHONEME-AWARE NETWORK FOR MONAURAL SPEECH ENHANCEMENT
    Du, Zhihao
    Lei, Ming
    Han, Jiqing
    Zhang, Shiliang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6634 - 6638
  • [4] Angular Margin-Mining Softmax Loss for Face Recognition
    Lee, Jwajin
    Wang, Yooseung
    Cho, Sunyoung
    [J]. IEEE ACCESS, 2022, 10 : 43071 - 43080
  • [5] Additive Margin Softmax for Face Verification
    Wang, Feng
    Cheng, Jian
    Liu, Weiyang
    Liu, Haijun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (07) : 926 - 930
  • [6] Phoneme-aware Encoding for Prefix-tree-based Contextual ASR
    Futami, Hayato
    Tsunoo, Emiru
    Kashiwagi, Yosuke
    Ogawa, Hiroaki
    Arora, Siddhant
    Watanabe, Shinji
    [J]. arXiv, 2023,
  • [7] REAL ADDITIVE MARGIN SOFTMAX FOR SPEAKER VERIFICATION
    Li, Lantian
    Nai, Ruiqian
    Wang, Dong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7527 - 7531
  • [8] ENSEMBLE ADDITIVE MARGIN SOFTMAX FOR SPEAKER VERIFICATION
    Yu, Ya-Qi
    Fan, Lei
    Li, Wu-Jun
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6046 - 6050
  • [9] Class-Variant Margin Normalized Softmax Loss for Deep Face Recognition
    Zhang, Wanping
    Chen, Yongru
    Yang, Wenming
    Wang, Guijin
    Xue, Jing-Hao
    Liao, Qingmin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4742 - 4747
  • [10] Additive cosine margin for unsupervised softmax embedding
    Wang, Dan
    Yang, Jianwei
    Wang, Cailing
    [J]. Journal of Electronic Imaging, 2024, 33 (04)