Improving Generalization of Deepfake Detection With Data Farming and Few-Shot Learning

被引:19
|
作者
Korshunov, Pavel [1 ]
Marcel, Sebastien [1 ]
机构
[1] Idiap Res Inst, Biometr Secur & Privacy Grp, CH-1920 Martigny, Switzerland
关键词
Deepfakes detection; generalization; evaluation; deepfake dataset;
D O I
10.1109/TBIOM.2022.3143404
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in automated video and audio editing tools, generative adversarial networks (GANs), and social media allow creation and fast dissemination of high quality tampered videos, which are generally called deepfakes. Typically, in these videos, a face is swapped with someone else's using GANs. Accessible open source software and apps for the face swapping led to a wide and rapid dissemination of the generated deepfakes, posing a significant technical challenge for their detection and filtering. In response to the threat, which deepfake videos can pose to our trust in video evidence, several large datasets of deepfake videos and several methods to detect them were proposed recently. However, the proposed methods suffer from a problem of overfitting on the training data and the lack of the generalization across different databases and the generative models. Therefore, in this paper, we investigate the techniques for improving the generalization of deepfake detection methods that can be employed in practical settings. We have selected two popular state of the art deepfake detectors: based on Xception and EfficientNet models, and we use five databases: from Google and Jigsaw, FaceForensics++, DeeperForensics, Celeb-DF, and our own publicly available large dataset DF-Mobio. To improve generalization, we apply different augmentation strategies used during training, including a proposed aggressive 'data farming' technique based on random patches. We also tested two fewshot tuning methods, when either a first convolutional layer or a last layer of a pre-trained model is tuned on 100 seconds from a training set of the test database. The experimental results clearly expose the generalization problem of deepfake detection methods, since the accuracy drops significantly when a model is trained on one dataset and evaluated on another. However, the silver lining is that an aggressive augmentation during training and a fewshot tuning on the test database can improve the accuracy of the detection methods in a cross-database scenario. As a side observation, we show the importance of database selection for training and evaluation, as FaceForensics++ is found to be better to use for training, while DeeperForensics is found to be significantly more challenging as a test database.
引用
收藏
页码:386 / 397
页数:12
相关论文
共 50 条
  • [1] MCW: A Generalizable Deepfake Detection Method for Few-Shot Learning
    Guan, Lei
    Liu, Fan
    Zhang, Ru
    Liu, Jianyi
    Tang, Yifan
    SENSORS, 2023, 23 (21)
  • [2] Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
    Albalak, Alon
    Raffel, Colin
    Wang, William Yang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Meta-Learning With Relation Embedding for Few-Shot Deepfake Detection
    Liu, Xiaoyong
    Song, Pengcheng
    Lu, Pei
    Wang, Yanjun
    IEEE ACCESS, 2024, 12 : 180135 - 180145
  • [4] Improving Embedding Generalization in Few-Shot Learning With Instance Neighbor Constraints
    Zhou, Zhenyu
    Luo, Lei
    Liao, Qing
    Liu, Xinwang
    Zhu, En
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5197 - 5208
  • [5] DFCP: Few-Shot DeepFake Detection via Contrastive Pretraining
    Zou, Bo
    Yang, Chao
    Guan, Jiazhi
    Quan, Chengbin
    Zhao, Youjian
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2303 - 2308
  • [6] IMPROVING GENERALIZATION FOR FEW-SHOT REMOTE SENSING CLASSIFICATION WITH META-LEARNING
    Sharma, Surbhi
    Roscher, Ribana
    Riedel, Morris
    Memon, Shahbaz
    Cavallaro, Gabriele
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 5061 - 5064
  • [7] Learning a Universal Template for Few-shot Dataset Generalization
    Triantafillou, Eleni
    Larochelle, Hugo
    Zemel, Richard
    Dumoulin, Vincent
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7435 - 7446
  • [8] Few-Shot Learning with Novelty Detection
    Bjerge, Kim
    Bodesheim, Paul
    Karstoft, Henrik
    DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 : 340 - 363
  • [9] Irecut plus MM: Data Generalization and Metric Improvement for Few-shot Learning
    Lin, Xixiang
    Li, Zhenghao
    Liu, Liangchen
    Wu, Jun
    Zhang, Lijun
    Zhou, Xiang-Dong
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2915 - 2920
  • [10] Improving Augmentation Efficiency for Few-Shot Learning
    Cho, Wonhee
    Kim, Eunwoo
    IEEE ACCESS, 2022, 10 : 17697 - 17706