Experimental Case Study of Self-Supervised Learning for Voice Spoofing Detection

被引:1
|
作者
Lee, Yerin [1 ]
Kim, Narin [1 ]
Jeong, Jaehong [2 ,3 ]
Kwak, Il-Youp [1 ]
机构
[1] Chung Ang Univ, Dept Appl Stat, Seoul 06974, South Korea
[2] Hanyang Univ, Dept Math, Seoul 04763, South Korea
[3] Hanyang Univ, Res Inst Nat Sci, Seoul 04763, South Korea
来源
IEEE ACCESS | 2023年 / 11卷
基金
新加坡国家研究基金会;
关键词
Self-supervised learning; Task analysis; Supervised learning; Speech processing; Deep learning; Training; Microphones; Spoofing detection; self-supervised learning; contrastive learning;
D O I
10.1109/ACCESS.2023.3254880
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study aims to improve the performance of voice spoofing attack detection through self-supervised pre-training. Supervised learning needs appropriate input variables and corresponding labels for constructing the machine learning models that are to be applied. It is necessary to secure a large number of labeled datasets to improve the performance of supervised learning processes. However, labeling requires substantial inputs of time and effort. One of the methods for managing this requirement is self-supervised learning, which uses pseudo-labeling without the necessity for substantial human input. This study experimented with contrastive learning, a well-performing self-supervised learning approach, to construct a voice spoofing detection model. We applied MoCo's dynamic dictionary, SimCLR's symmetric loss, and COLA's bilinear similarity in our contrastive learning framework. Our model was trained using VoxCeleb data and voice data extracted from YouTube videos. Our self-supervised model improved the performance of the baseline model from 6.93% to 5.26% for a logical access (LA) scenario and improved the performance of the baseline model from 0.60% to 0.40% for a physical access (PA) scenario. In the case of PA, the best performance was achieved when random crop augmentation was applied, and in the case of LA, the best performance was obtained when random crop and random shifting augmentations were considered.
引用
收藏
页码:24216 / 24226
页数:11
相关论文
共 50 条
  • [31] A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion
    Huang, Wen-Chin
    Yang, Shu-Wen
    Hayashi, Tomoki
    Toda, Tomoki
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1308 - 1318
  • [32] Contrastive self-supervised learning for diabetic retinopathy early detection
    Jihong Ouyang
    Dong Mao
    Zeqi Guo
    Siguang Liu
    Dong Xu
    Wenting Wang
    Medical & Biological Engineering & Computing, 2023, 61 : 2441 - 2452
  • [33] Generative and Contrastive Self-Supervised Learning for Graph Anomaly Detection
    Zheng, Yu
    Jin, Ming
    Liu, Yixin
    Chi, Lianhua
    Phan, Khoa T.
    Chen, Yi-Ping Phoebe
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12220 - 12233
  • [34] Classification-Based Self-Supervised Learning for Anomaly Detection
    Li, Honghu
    Zhu, Yuesheng
    He, Ying
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [35] Repeatable adaptive keypoint detection via self-supervised learning
    Pei Yan
    Yihua Tan
    Yuan Tai
    Science China Information Sciences, 2022, 65
  • [36] A NOVEL CONTRASTIVE LEARNING FRAMEWORK FOR SELF-SUPERVISED ANOMALY DETECTION
    Li, Jingze
    Lian, Zhichao
    Li, Min
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3366 - 3370
  • [37] Contrastive self-supervised learning for diabetic retinopathy early detection
    Ouyang, Jihong
    Mao, Dong
    Guo, Zeqi
    Liu, Siguang
    Xu, Dong
    Wang, Wenting
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (09) : 2441 - 2452
  • [38] Self-Supervised Video Representation Learning by Video Incoherence Detection
    Cao, Haozhi
    Xu, Yuecong
    Mao, Kezhi
    Xie, Lihua
    Yin, Jianxiong
    See, Simon
    Xu, Qianwen
    Yang, Jianfei
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (06) : 3810 - 3822
  • [39] Contrastive Self-Supervised Learning for Globally Distributed Landslide Detection
    Ghorbanzadeh, Omid
    Shahabi, Hejar
    Piralilou, Sepideh Tavakkoli
    Crivellari, Alessandro
    La Rosa, Laura Elena Cue
    Atzberger, Clement
    Li, Jonathan
    Ghamisi, Pedram
    IEEE ACCESS, 2024, 12 : 118453 - 118466
  • [40] Online Self-Supervised Deep Learning for Intrusion Detection Systems
    Nakip, Mert
    Gelenbe, Erol
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 5668 - 5683