Improving Generalization of Deepfake Detection With Data Farming and Few-Shot Learning

被引：19

作者：

Korshunov, Pavel ^{[1
]}

Marcel, Sebastien ^{[1
]}

机构：

[1] Idiap Res Inst, Biometr Secur & Privacy Grp, CH-1920 Martigny, Switzerland

来源：

IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE | 2022年 / 4卷 / 03期

关键词：

Deepfakes detection; generalization; evaluation; deepfake dataset;

D O I：

10.1109/TBIOM.2022.3143404

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in automated video and audio editing tools, generative adversarial networks (GANs), and social media allow creation and fast dissemination of high quality tampered videos, which are generally called deepfakes. Typically, in these videos, a face is swapped with someone else's using GANs. Accessible open source software and apps for the face swapping led to a wide and rapid dissemination of the generated deepfakes, posing a significant technical challenge for their detection and filtering. In response to the threat, which deepfake videos can pose to our trust in video evidence, several large datasets of deepfake videos and several methods to detect them were proposed recently. However, the proposed methods suffer from a problem of overfitting on the training data and the lack of the generalization across different databases and the generative models. Therefore, in this paper, we investigate the techniques for improving the generalization of deepfake detection methods that can be employed in practical settings. We have selected two popular state of the art deepfake detectors: based on Xception and EfficientNet models, and we use five databases: from Google and Jigsaw, FaceForensics++, DeeperForensics, Celeb-DF, and our own publicly available large dataset DF-Mobio. To improve generalization, we apply different augmentation strategies used during training, including a proposed aggressive 'data farming' technique based on random patches. We also tested two fewshot tuning methods, when either a first convolutional layer or a last layer of a pre-trained model is tuned on 100 seconds from a training set of the test database. The experimental results clearly expose the generalization problem of deepfake detection methods, since the accuracy drops significantly when a model is trained on one dataset and evaluated on another. However, the silver lining is that an aggressive augmentation during training and a fewshot tuning on the test database can improve the accuracy of the detection methods in a cross-database scenario. As a side observation, we show the importance of database selection for training and evaluation, as FaceForensics++ is found to be better to use for training, while DeeperForensics is found to be significantly more challenging as a test database.

引用

页码：386 / 397

页数：12

共 50 条

[31] Fast Hierarchical Learning for Few-Shot Object Detection
She, Yihang
Bhat, Goutam
Danelljan, Martin
Yu, Fisher
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1993 - 2000
[32] Few-shot learning for signal detection in wideband spectrograms
Li, Weihao
Deng, Wen
Wang, Keren
You, Ling
Huang, Zhitao
DIGITAL SIGNAL PROCESSING, 2025, 162
[33] Few-Shot Anomaly Detection in Text with Deviation Learning
Das, Anindya Sundar
Ajay, Aravind
Saha, Sriparna
Bhuyan, Monowar
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 425 - 438
[34] A Gated Few-shot Learning Model For Anomaly Detection
Huang, Shaohan
Liu, Yi
Fung, Carol
An, Wanhe
He, Rong
Zhao, Yining
Yang, Hailong
Luan, Zhongzhi
2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 505 - 509
[35] Extensively Matching for Few-shot Learning Event Detection
Viet Dac Lai
Dernoncourt, Franck
Thien Huu Nguyen
NARRATIVE UNDERSTANDING, STORYLINES, AND EVENTS, 2020, : 38 - 45
[36] Few-shot learning through contextual data augmentation
Arthaud, Farid
Bawden, Rachel
Birch, Alexandra
16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1049 - 1062
[37] Identification of Novel Classes for Improving Few-Shot Object Detection
Shangguan, Zeyu
Rostami, Mohammad
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3348 - 3358
[38] Geoclidean: Few-Shot Generalization in Euclidean Geometry
Hsu, Joy
Wu, Jiajun
Goodman, Noah D.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[39] IMPROVING FEW-SHOT OBJECT DETECTION WITH OBJECT PART PROPOSALS
Chevalley, Arthur
Tomoiaga, Ciprian
Detyniecki, Marcin
Russwurm, Marc
Tuia, Devis
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6502 - 6505
[40] Discriminative learning of imaginary data for few-shot classification
Zhang, Xu
Zhang, Youjia
Zhang, Zuyu
Liu, Jinzhuo
NEUROCOMPUTING, 2022, 467 : 406 - 417

← 1 2 3 4 5 →