Real face foundation representation learning for generalized deepfake detection

被引:0
|
作者
Shi, Liang [1 ,2 ]
Zhang, Jie [1 ,2 ,3 ]
Ji, Zhilong [4 ]
Bai, Jinfeng [4 ]
Shan, Shiguang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab AI Safety CAS, Beijing 100090, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Inst Intelligent Comp Technol, Suzhou 215124, Peoples R China
[4] Tomorrow Adv Life Educ Grp, Beijing 100085, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Deepfake detection; Masked image modeling; Representation learning; Vision Transformers;
D O I
10.1016/j.patcog.2024.111299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of deepfake technologies has become a matter of social concern as they pose threats to individual privacy and public security. It is now of great significance to develop reliable deepfake detectors. However, with numerous face manipulation algorithms present, existing detectors fail to generalize to all types of manipulated faces. To address this, we propose an alternative method that primarily models the distribution of real faces. Our approach, named Real Face Foundation Representation Learning (RFFR), aims to learn a general representation from large-scale real face datasets and identifies potential artifacts that deviate from the distribution of RFFR. Specifically, we train a model on real face datasets by masked image modeling (MIM). When applying the model on fake samples, we observe clear discrepancies between input faces and the reconstructed ones, which reveals artifacts absent in the RFFR distribution. However, these discrepancies are significantly less evident in real faces, which makes it easier to build a deepfake detector sensitive to a wide range of potential artifacts. Extensive experiments demonstrate that our method achieves better generalization performance, as it significantly outperforms the state-of-the-art methods in cross-manipulation evaluations, and has the potential to further improve by introducing extra real faces for training RFFR.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Towards Generalized Detection of Face-Swap Deepfake Images
    Ghasemzadeh, Faraz
    Moghaddam, Tina
    Dai, Jingming
    Yun, Joobeom
    Kim, Dan Dongseong
    PROCEEDINGS OF THE 3RD ACM WORKSHOP ON SECURITY IMPLICATIONS OF DEEPFAKES AND CHEAPFAKES, ACM WDC 2024, 2024, : 8 - 13
  • [2] ESRL: efficient similarity representation learning for deepfake detection
    Wang, Feng
    Zhang, Dengyong
    Guo, Zhiqing
    Wang, Dewang
    Yang, Gaobo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76991 - 77007
  • [3] Deepfake Video Detection via Predictive Representation Learning
    Ge, Shiming
    Lin, Fanzhao
    Li, Chenyu
    Zhang, Daichi
    Wang, Weiping
    Zeng, Dan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)
  • [4] Learning A Self-Supervised Domain-Invariant Feature Representation for Generalized Audio Deepfake Detection
    Xie, Yuankun
    Cheng, Haonan
    Wang, Yutian
    Ye, Long
    INTERSPEECH 2023, 2023, : 2808 - 2812
  • [5] Towards generalized face forgery detection with domain-robust representation learning
    Li, Caiyu
    Wo, Yan
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [6] The Face Deepfake Detection Challenge
    Guarnera, Luca
    Giudice, Oliver
    Guarnera, Francesco
    Ortis, Alessandro
    Puglisi, Giovanni
    Paratore, Antonino
    Bui, Linh M. Q.
    Fontani, Marco
    Coccomini, Davide Alessandro
    Caldelli, Roberto
    Falchi, Fabrizio
    Gennaro, Claudio
    Messina, Nicola
    Amato, Giuseppe
    Perelli, Gianpaolo
    Concas, Sara
    Cuccu, Carlo
    Orru, Giulia
    Marcialis, Gian Luca
    Battiato, Sebastiano
    JOURNAL OF IMAGING, 2022, 8 (10)
  • [7] Learning Face Forgery Detection in Unseen Domain with Generalization Deepfake Detector
    Tran, Van-Nhan
    Lee, Suk-Hwan
    Le, Hoanh-Su
    Kim, Bo-Sung
    Kwon, Ki-Ryong
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [8] Audio-visual deepfake detection using articulatory representation learning
    Wang, Yujia
    Huang, Hua
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [9] Face detection using representation learning
    Zhan, Shu
    Tao, Qin-Qin
    Li, Xiao-Hong
    NEUROCOMPUTING, 2016, 187 : 19 - 26
  • [10] Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection
    Xu, Yuting
    Liang, Jian
    Sheng, Lijun
    Zhang, Xiao-Yu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5663 - 5680