Conformer-Based Speaker Recognition Model for Real-Time Multi-Scenarios

被引:0
|
作者
Xuan, Xi [1 ]
Han, Runping [2 ]
Gao, Jingxin [1 ]
机构
[1] School of Arts and Sciences, Beijing Institute of Fashion Technology, Beijing,100029, China
[2] School of Fashion, Beijing Institute of Fashion Technology, Beijing,100029, China
关键词
Real time systems - Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
To handle the problems of poor performances of speaker verification systems, appearing in multiple scenarios with cross-domain utterances, long-duration utterances and noisy utterances, a real-time robust speaker recognition model, PMS-Conformer, is designed based on Conformer in this paper. The architecture of the PMS-Conformer is inspired by the state-of-the-art model named MFA-Conformer. PMS-Conformer has made the improvements on the acoustic feature extractor, network components and loss calculation module of MFA-Conformer respectively, having the novel and effective acoustic feature extractor and the robust speaker embedding extractor with high generalization capability. PMS-Conformer is trained on VoxCeleb1&2 dataset, and it is compared with the baseline MFA-Conformer and ECAPA-TDNN, and extensive comparison experiments are conducted on the speaker verification tasks. The experimental results show that on VoxMovies with cross-domain utterances, SITW with long-duration utterances and VoxCeleb-O processed by adding noise to its utterances, the ASV system built with PMS-Conformer is more competitive than those built with MFA-Conformer and ECAPA-TDNN respectively. Moreover, the trainable Params and RTF of the speaker embedding extractor of PMS-Conformer are significantly lower than those of ECAPA-TDNN. All evaluation experiment results demonstrate that PMS-Conformer exhibits good performances in real-time multi-scenarios. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:147 / 156
相关论文
共 50 条
  • [1] Efficient Conformer-Based CTC Model for Intelligent Cockpit Speech Recognition
    Guo, Hanzhi
    Chen, Yunshu
    Xie, Xukang
    Xu, Gaopeng
    Guo, Wei
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 522 - 526
  • [2] A Robust Conformer-Based Speech Recognition Model for Mandarin Air Traffic Control
    Jiang, Peiyuan
    Pan, Weijun
    Zhang, Jian
    Wang, Teng
    Huang, Junxiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 911 - 940
  • [3] Efficient conformer-based speech recognition with linear attention
    Li, Shengqiang
    Xu, Menglong
    Zhang, Xiao-Lei
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 448 - 453
  • [4] SPEAKER-CONDITIONING SINGLE-CHANNEL TARGET SPEAKER EXTRACTION USING CONFORMER-BASED ARCHITECTURES
    Sinha, Ragini
    Tammen, Marvin
    Rollwage, Christian
    Doclo, Simon
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [5] Conformer-Based Human Activity Recognition Using Inertial Measurement Units
    Seenath, Sowmiya
    Dharmaraj, Menaka
    SENSORS, 2023, 23 (17)
  • [6] A Real-Time License Plate Detection and Recognition Model in Unconstrained Scenarios
    Tao, Lingbing
    Hong, Shunhe
    Lin, Yongxing
    Chen, Yangbing
    He, Pingan
    Tie, Zhixin
    SENSORS, 2024, 24 (09)
  • [7] Sampleformer: An efficient conformer-based Neural Network for Automatic Speech Recognition
    Fan, Zeping
    Zhang, Xuejun
    Huang, Min
    Bu, Zhaohui
    INTELLIGENT DATA ANALYSIS, 2024, 28 (06) : 1647 - 1659
  • [8] Enhanced Conformer-Based Speech Recognition via Model Fusion and Adaptive Decoding with Dynamic Rescoring
    Geng, Junhao
    Jia, Dongyao
    He, Zihao
    Wu, Nengkai
    Li, Ziqi
    APPLIED SCIENCES-BASEL, 2024, 14 (24):
  • [9] Improving the Training Recipe for a Robust Conformer-based Hybrid Model
    Zeineldeen, Mohammad
    Xu, Jingjing
    Luescher, Christoph
    Schlueter, Ralf
    Ney, Hermann
    INTERSPEECH 2022, 2022, : 1036 - 1040
  • [10] Multi-Area Unit Commitment model Based on Multi-Scenarios Risk Analysis
    Wang, Jinhao
    Ren, Jianyun
    Zhang, Qi
    Jia, Yanbing
    Yang, Chaoying
    Zhang, Min
    Duan, Yongze
    Huang, Ming
    2018 2ND IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2018,