Improving Generalization of Deepfake Detection With Data Farming and Few-Shot Learning

被引:19
|
作者
Korshunov, Pavel [1 ]
Marcel, Sebastien [1 ]
机构
[1] Idiap Res Inst, Biometr Secur & Privacy Grp, CH-1920 Martigny, Switzerland
关键词
Deepfakes detection; generalization; evaluation; deepfake dataset;
D O I
10.1109/TBIOM.2022.3143404
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in automated video and audio editing tools, generative adversarial networks (GANs), and social media allow creation and fast dissemination of high quality tampered videos, which are generally called deepfakes. Typically, in these videos, a face is swapped with someone else's using GANs. Accessible open source software and apps for the face swapping led to a wide and rapid dissemination of the generated deepfakes, posing a significant technical challenge for their detection and filtering. In response to the threat, which deepfake videos can pose to our trust in video evidence, several large datasets of deepfake videos and several methods to detect them were proposed recently. However, the proposed methods suffer from a problem of overfitting on the training data and the lack of the generalization across different databases and the generative models. Therefore, in this paper, we investigate the techniques for improving the generalization of deepfake detection methods that can be employed in practical settings. We have selected two popular state of the art deepfake detectors: based on Xception and EfficientNet models, and we use five databases: from Google and Jigsaw, FaceForensics++, DeeperForensics, Celeb-DF, and our own publicly available large dataset DF-Mobio. To improve generalization, we apply different augmentation strategies used during training, including a proposed aggressive 'data farming' technique based on random patches. We also tested two fewshot tuning methods, when either a first convolutional layer or a last layer of a pre-trained model is tuned on 100 seconds from a training set of the test database. The experimental results clearly expose the generalization problem of deepfake detection methods, since the accuracy drops significantly when a model is trained on one dataset and evaluated on another. However, the silver lining is that an aggressive augmentation during training and a fewshot tuning on the test database can improve the accuracy of the detection methods in a cross-database scenario. As a side observation, we show the importance of database selection for training and evaluation, as FaceForensics++ is found to be better to use for training, while DeeperForensics is found to be significantly more challenging as a test database.
引用
收藏
页码:386 / 397
页数:12
相关论文
共 50 条
  • [31] Fast Hierarchical Learning for Few-Shot Object Detection
    She, Yihang
    Bhat, Goutam
    Danelljan, Martin
    Yu, Fisher
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1993 - 2000
  • [32] Few-shot learning for signal detection in wideband spectrograms
    Li, Weihao
    Deng, Wen
    Wang, Keren
    You, Ling
    Huang, Zhitao
    DIGITAL SIGNAL PROCESSING, 2025, 162
  • [33] Few-Shot Anomaly Detection in Text with Deviation Learning
    Das, Anindya Sundar
    Ajay, Aravind
    Saha, Sriparna
    Bhuyan, Monowar
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 425 - 438
  • [34] A Gated Few-shot Learning Model For Anomaly Detection
    Huang, Shaohan
    Liu, Yi
    Fung, Carol
    An, Wanhe
    He, Rong
    Zhao, Yining
    Yang, Hailong
    Luan, Zhongzhi
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 505 - 509
  • [35] Extensively Matching for Few-shot Learning Event Detection
    Viet Dac Lai
    Dernoncourt, Franck
    Thien Huu Nguyen
    NARRATIVE UNDERSTANDING, STORYLINES, AND EVENTS, 2020, : 38 - 45
  • [36] Few-shot learning through contextual data augmentation
    Arthaud, Farid
    Bawden, Rachel
    Birch, Alexandra
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1049 - 1062
  • [37] Identification of Novel Classes for Improving Few-Shot Object Detection
    Shangguan, Zeyu
    Rostami, Mohammad
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3348 - 3358
  • [38] Geoclidean: Few-Shot Generalization in Euclidean Geometry
    Hsu, Joy
    Wu, Jiajun
    Goodman, Noah D.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [39] IMPROVING FEW-SHOT OBJECT DETECTION WITH OBJECT PART PROPOSALS
    Chevalley, Arthur
    Tomoiaga, Ciprian
    Detyniecki, Marcin
    Russwurm, Marc
    Tuia, Devis
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6502 - 6505
  • [40] Discriminative learning of imaginary data for few-shot classification
    Zhang, Xu
    Zhang, Youjia
    Zhang, Zuyu
    Liu, Jinzhuo
    NEUROCOMPUTING, 2022, 467 : 406 - 417