Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection

被引:0
|
作者
Li, Jiakang [1 ]
Sun, Meng [1 ]
Zhang, Xiongwei [1 ]
机构
[1] Army Engn Univ, Nanjing, Peoples R China
关键词
anti-spoofing; speaker recognition; replay detection; multi-task learning; joint detection; COUNTERMEASURES;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the development of spoofing technologies, automatic speaker verification (ASV) systems have encountered serious challenges on security. In order to address this problem, many anti-spoofing countermeasures have been explored. There are two intuitive recipes to protect an ASV system from spoofing. The first one is to use a cascaded structure where spoofing detection is performed firstly and ASV is subsequently conducted only on the attempts which have passed the spoofing detection. The other one is to perform spoofing detection and ASV jointly. The discriminate reliably of the joint system has been proven to be more advantageous than cascaded systems with traditional methods, not only in accuracy, but also in convenience and computational efficiency. In this paper, we proposed a multi-task learning approach based on deep neural network to make a joint system of ASV and anti-spoofing. The performance of different acoustic features and structures of deep neural networks has been investigated on the ASVspoof 2017 version 2.0 dataset. The experimental results showed that the joint equal error rate (EER) of our approach was reduced by 0.55% compared to a joint system with Gaussian back-end fusion baseline.
引用
收藏
页码:1517 / 1522
页数:6
相关论文
共 50 条
  • [1] Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss
    Li, Jiakang
    Sun, Meng
    Zhang, Xiongwei
    Wang, Yimin
    [J]. IEEE ACCESS, 2020, 8 : 7907 - 7915
  • [2] Replay spoofing detection system for automatic speaker verification using multi-task learning of noise classes
    Shim, Hye-Jin
    Jung, Jee-Weon
    Heo, Hee-Soo
    Yoon, Sung-Hyun
    Yu, Ha-Jin
    [J]. 2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 172 - 176
  • [3] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
    Yuanjun Zhao
    Roberto Togneri
    Victor Sreeram
    [J]. Circuits, Systems, and Signal Processing, 2022, 41 : 4068 - 4089
  • [4] Multi-task Learning-Based Spoofing-Robust Automatic Speaker Verification System
    Zhao, Yuanjun
    Togneri, Roberto
    Sreeram, Victor
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (07) : 4068 - 4089
  • [5] Utilization of age information for speaker verification using multi-task learning deep neural networks
    Kim, Ju-ho
    Heo, Hee-Soo
    Jung, Jee-weon
    Shim, Hye-jin
    Kim, Seung-Bin
    Yu, Ha-Jin
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (05): : 593 - 600
  • [6] MULTI-TASK LEARNING FOR SPEAKER VERIFICATION AND VOICE TRIGGER DETECTION
    Sigtia, Siddharth
    Marchi, Erik
    Kajarekar, Sachin
    Naik, Devang
    Bridle, John
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6844 - 6848
  • [7] Anti-Spoofing Speaker Verification System with Multi-Feature Integration and Multi-Task Learning
    Li, Rongjin
    Zhao, Miao
    Li, Zheng
    Li, Lin
    Hong, Qingyang
    [J]. INTERSPEECH 2019, 2019, : 1048 - 1052
  • [8] Multi-Task Deep Neural Networks for Joint Sarcasm Detection and Sentiment Analysis
    Yongheng Chunyan Yin
    Wanli Chen
    [J]. Pattern Recognition and Image Analysis, 2021, 31 : 103 - 108
  • [9] Multi-Task Deep Neural Networks for Joint Sarcasm Detection and Sentiment Analysis
    Yin, Chunyan
    Chen, Yongheng
    Zuo, Wanli
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (01) : 103 - 108
  • [10] MULTI-TASK JOINT-LEARNING OF DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Qian, Yanmin
    Yin, Maofan
    You, Yongbin
    Yu, Kai
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 310 - 316