Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection

被引:6
|
作者
Kawa, Piotr [1 ]
Plata, Marcin [1 ]
Syga, Piotr [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Wroclaw, Poland
来源
关键词
DeepFake detection; spoofing detection; deep neural networks; LFCC; MFCC; dataset;
D O I
10.21437/Interspeech.2022-10078
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio DeepFakes allow the creation of high-quality, convincing utterances and therefore pose a threat due to its potential applications such as impersonation or fake news. Methods for detecting these manipulations should be characterized by good generalization and stability leading to robustness against attacks conducted with techniques that are not explicitly included in the training. In this work, we introduce Attack Agnostic Dataseta combination of two audio DeepFakes and one anti-spoofing datasets that, thanks to the disjoint use of attacks, can lead to better generalization of detection methods. We present a thorough analysis of current DeepFake detection methods and consider different audio features (front-ends). In addition, we propose a model based on LCNN with LFCC and mel-spectrogram front-end, which not only is characterized by a good generalization and stability results but also shows improvement over LFCC-based mode - we decrease standard deviation on all folds and EER in two folds by up to 5%.
引用
收藏
页码:4023 / 4027
页数:5
相关论文
共 50 条
  • [41] Temporal Feature Prediction in Audio-Visual Deepfake Detection
    Gao, Yuan
    Wang, Xuelong
    Zhang, Yu
    Zeng, Ping
    Ma, Yingjie
    ELECTRONICS, 2024, 13 (17)
  • [42] Speech Audio Deepfake Detection via Convolutional Neural Networks
    Valente, Lucas P.
    de Souza, Marcelo M. S.
    da Rocha, Alan M.
    IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS 2024, IEEE EAIS 2024, 2024, : 382 - 387
  • [43] CSTAN: A Deepfake Detection Network with CST Attention for Superior Generalization
    Yang, Rui
    You, Kang
    Pang, Cheng
    Luo, Xiaonan
    Lan, Rushi
    SENSORS, 2024, 24 (22)
  • [44] Leveraging facial landmarks improves generalization ability for deepfake detection
    Gao, Qi
    Zhang, Baopeng
    Wu, Jianghao
    Luo, Wenxin
    Teng, Zhu
    Fan, Jianping
    PATTERN RECOGNITION, 2025, 164
  • [45] WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection
    Zi, Bojia
    Chang, Minghao
    Chen, Jingjing
    Ma, Xingjun
    Jiang, Yu-Gang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2382 - 2390
  • [46] AUDIO DEEPFAKE DETECTION SYSTEM WITH NEURAL STITCHING FOR ADD 2022
    Yan, Rui
    Wen, Cheng
    Zhou, Shuran
    Guo, Tingwei
    Zou, Wei
    Li, Xiangang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9226 - 9230
  • [47] Quality-Agnostic Deepfake Detection with Intra-model Collaborative Learning
    Le, Binh M.
    Woo, Simon S.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22321 - 22332
  • [48] CUSTOM ATTRIBUTION LOSS FOR IMPROVING GENERALIZATION AND INTERPRETABILITY OF DEEPFAKE DETECTION
    Korshunov, Pavel
    Jain, Anubhav
    Marcel, Sebastien
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8972 - 8976
  • [49] Towards Model-Agnostic Dataset Condensation by Heterogeneous Models
    Moon, Jun-Yeong
    Kim, Jung Uk
    Park, Gyeong-Moon
    COMPUTER VISION - ECCV 2024, PT XXIX, 2025, 15087 : 234 - 250
  • [50] Assessing deepfake detection methods: a comparative evaluation on novel large-scale Asian deepfake dataset
    Kingra, Staffy
    Aggarwal, Naveen
    Kaur, Nirmal
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2025,