Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection

被引:0
|
作者
Amir Mohammad Rostami
Mohammad Mehdi Homayounpour
Ahmad Nickabadi
机构
[1] Amirkabir University of Technology,Department of Computer Engineering
来源
Circuits, Systems, and Signal Processing | 2023年 / 42卷
关键词
Automatic speaker verification; Spoof detection; ASVspoof; Efficient attention branch network; Combined loss function; EfficientNet-A0;
D O I
暂无
中图分类号
学科分类号
摘要
Many endeavors have sought to develop countermeasure techniques as enhancements on Automatic Speaker Verification (ASV) systems, in order to make them more robust against spoof attacks. As evidenced by the latest ASVspoof 2019 countermeasure challenge, models currently deployed for the task of ASV are, at their best, devoid of suitable degrees of generalization to unseen attacks. A joint improvement of components of ASV spoof detection systems including the classifier, feature extraction phase, and model loss function may lead to a better detection of attacks by these systems. Accordingly, the present study proposes the Efficient Attention Branch Network (EABN) architecture with a combined loss function to address the model generalization to unseen attacks. The EABN is based on attention and perception branches. The attention branch provides an attention mask that improves the classification performance and at the same time is interpretable from a human point of view. The perception branch, is used for our main purpose which is spoof detection. The new EfficientNet-A0 architecture was optimized and employed for the perception branch, with nearly ten times fewer parameters and approximately seven times fewer floating-point operations than the SE-Res2Net50 as the best existing network. The proposed method on ASVspoof 2019 dataset achieved EER = 0.86% and t-DCF = 0.0239 in the Physical Access (PA) scenario using the logPowSpec as the input feature extraction method. Furthermore, using the LFCC feature, and the SE-Res2Net50 for the perception branch, the proposed model achieved EER = 1.89% and t-DCF = 0.507 in the Logical Access (LA) scenario, which to the best of our knowledge, is the best single system ASV spoofing countermeasure method.
引用
收藏
页码:4252 / 4270
页数:18
相关论文
共 50 条
  • [1] Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection
    Rostami, Amir Mohammad
    Homayounpour, Mohammad Mehdi
    Nickabadi, Ahmad
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4252 - 4270
  • [2] Optimized deep network based spoof detection in automatic speaker verification system
    Neelima, Medikonda
    Prabha, I. Santi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 13073 - 13091
  • [3] Optimized deep network based spoof detection in automatic speaker verification system
    Medikonda Neelima
    I. Santi Prabha
    Multimedia Tools and Applications, 2024, 83 : 13073 - 13091
  • [4] Automatic speaker verification systems and spoof detection techniques: review and analysis
    Mittal, Aakshi
    Dua, Mohit
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 105 - 134
  • [5] Automatic speaker verification systems and spoof detection techniques: review and analysis
    Aakshi Mittal
    Mohit Dua
    International Journal of Speech Technology, 2022, 25 : 105 - 134
  • [6] Deep generative variational autoencoding for replay spoof detection in automatic speaker verification
    Chettri, Bhusan
    Kinnunen, Tomi
    Benetos, Emmanouil
    COMPUTER SPEECH AND LANGUAGE, 2020, 63
  • [7] LSTM and CNN based ensemble approach for spoof detection task in automatic speaker verification systems
    Mohit Dua
    Chhavi Jain
    Sushil Kumar
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 1985 - 2000
  • [8] LSTM and CNN based ensemble approach for spoof detection task in automatic speaker verification systems
    Dua, Mohit
    Jain, Chhavi
    Kumar, Sushil
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) : 1985 - 2000
  • [9] FACTOR ANALYSIS METHODS FOR JOINT SPEAKER VERIFICATION AND SPOOF DETECTION
    Dhanush, B. K.
    Suparna, S.
    Aarthy, R.
    Likhita, C.
    Shashank, D.
    Harish, H.
    Ganapathy, Sriram
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5385 - 5389
  • [10] Speaker Verification Based on Channel Attention and Adaptive Joint Loss
    Fan, Houbin
    Li, Jun
    Ge, Fengpei
    Liang, Chunyan
    ELECTRONICS, 2025, 14 (03):