ECAPA plus plus : Fine-grained Deep Embedding Learning for TDNN Based Speaker Verification

被引:1
|
作者
Liu, Bei [1 ]
Qian, Yanmin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, X LANCE Lab, MoE Key Lab Artificial Intelligence,AI Inst, Shanghai, Peoples R China
来源
关键词
speaker verification; time-delay neural network; ECAPA; ResNet; system fusion; RECOGNITION;
D O I
10.21437/Interspeech.2023-777
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we aim to bridge the performance gap between TDNN and 2D CNN based speaker verification systems. Specifically, three types of architectural enhancements to ECAPA-TDNN are proposed: 1) follow depth-first design to significantly increase network depth while maintaining its complexity. 2) introduce recursive convolution to better capture fine-grained speaker information. 3) propose pyramid-based multi-path feature enhancement module to yield more discriminative speaker representation. Experiments on Voxceleb show that our final model, named ECAPA++, achieves 25%, 23% and 24% relative improvements on Vox1-O, E and H respectively, while with 2.4x fewer parameters and 2.3x fewer FLOPs over the previous best TDNN-based system. Meanwhile, it is comparable to the state-of-the-art ResNet-based systems with higher computational efficiency. In addition, further performance gains can be achieved by fusing ECAPA++ and ResNetbased systems.
引用
收藏
页码:3132 / 3136
页数:5
相关论文
共 50 条
  • [21] VulDeeLocator: A Deep Learning-Based Fine-Grained Vulnerability Detector
    Li, Zhen
    Zou, Deqing
    Xu, Shouhuai
    Chen, Zhaoxuan
    Zhu, Yawei
    Jin, Hai
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (04) : 2821 - 2837
  • [22] Deep Embedding Learning for Text-Dependent Speaker Verification
    Zhang, Peng
    Hu, Peng
    Zhang, Xueliang
    INTERSPEECH 2020, 2020, : 3461 - 3465
  • [23] Location Embedding Based Pairwise Distance Learning for Fine-Grained Diagnosis of Urinary Stones
    Jin, Qiangguo
    Huang, Jiapeng
    Sun, Changming
    Cui, Hui
    Xuan, Ping
    Su, Ran
    Wei, Leyi
    Wu, Yu-Jie
    Wu, Chia-An
    Duh, Henry B. L.
    Lu, Yueh-Hsun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 405 - 414
  • [24] Fine-Grained Image Analysis With Deep Learning: A Survey
    Wei, Xiu-Shen
    Song, Yi-Zhe
    Mac Aodha, Oisin
    Wu, Jianxin
    Peng, Yuxin
    Tang, Jinhui
    Yang, Jian
    Belongie, Serge
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 8927 - 8948
  • [25] DEEP DICTIONARY LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
    Srinivas, M.
    Lin, Yen-Yu
    Liao, Hong-Yuan Mark
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 835 - 839
  • [26] Learning Deep Representations of Fine-Grained Visual Descriptions
    Reed, Scott
    Akata, Zeynep
    Lee, Honglak
    Schiele, Bernt
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 49 - 58
  • [27] Interpreting Fine-Grained Dermatological Classification by Deep Learning
    Mishra, Sourav
    Imaizumi, Hideaki
    Yamasaki, Toshihiko
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2729 - 2737
  • [28] Learning Fine-grained Image Similarity with Deep Ranking
    Wang, Jiang
    Song, Yang
    Leung, Thomas
    Rosenberg, Chuck
    Wang, Jingbin
    Philbin, James
    Chen, Bo
    Wu, Ying
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1386 - 1393
  • [29] FSX: A Tool for Fine-Grained Incremental Unit Test Generation for C/C plus plus Programs
    Yoshida, Hiroaki
    Tokumoto, Susumu
    Prasad, Mukul R.
    Ghosh, Indradeep
    Uehara, Tadahiro
    FSE'16: PROCEEDINGS OF THE 2016 24TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2016, : 1052 - 1056
  • [30] VSR plus plus : Improving Visual Semantic Reasoning for Fine-Grained Image-Text Matching
    Yuan, Hui
    Huang, Yan
    Zhang, Dongbo
    Chen, Zerui
    Cheng, Wenlong
    Wang, Liang
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3728 - 3735