Generalizable Deepfake Detection With Phase-Based Motion Analysis

被引:1
|
作者
Prashnani, Ekta [1 ,2 ]
Goebel, Michael [1 ,3 ]
Manjunath, B. S. [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
[2] NVIDIA Res, Santa Clara, CA 95051 USA
[3] Google LLC, Mountain View, CA 94043 USA
关键词
Feature extraction; Deepfakes; Faces; Robustness; Detectors; Dynamics; Generators; Deepfake detection; deep learning; video analysis; video forensics;
D O I
10.1109/TIP.2024.3441821
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose PhaseForensics, a DeepFake (DF) video detection method that uses a phase-based motion representation of facial temporal dynamics. Existing methods that rely on temporal information across video frames for DF detection have many advantages over the methods that only utilize the per-frame features. However, these temporal DF detection methods still show limited cross-dataset generalization and robustness to common distortions due to factors such as error-prone motion estimation, inaccurate landmark tracking, or the susceptibility of the pixel intensity-based features to adversarial distortions and the cross-dataset domain shifts. Our key insight to overcome these issues is to leverage the temporal phase variations in the band-pass frequency components of a face region across video frames. This not only enables a robust estimate of the temporal dynamics in the facial regions, but is also less prone to cross-dataset variations. Furthermore, we show that the band-pass filters used to compute the local per-frame phase form an effective defense against the perturbations commonly seen in gradient-based adversarial attacks. Overall, with PhaseForensics, we show improved distortion and adversarial robustness, and state-of-the-art cross-dataset generalization, with 92.4% video-level AUC on the challenging CelebDFv2 benchmark (a recent state of-the-art method, FTCN, compares at 86.9%).
引用
收藏
页码:100 / 112
页数:13
相关论文
共 50 条
  • [1] MeST-Former: Motion-enhanced Spatiotemporal Transformer for generalizable Deepfake detection
    Liu, Baoping
    Liu, Bo
    Ding, Ming
    Zhu, Tianqing
    NEUROCOMPUTING, 2024, 610
  • [2] Learning Pairwise Interaction for Generalizable DeepFake Detection
    Xu, Ying
    Raja, Kiran
    Verdoliva, Luisa
    Pedersen, Marius
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 672 - 682
  • [3] Nonlinear ultrasonic analysis inspired by phase-based motion magnification
    Liu, Peipei
    Ma, Zhanxiong
    Jang, Jinho
    Sohn, Hoon
    HEALTH MONITORING OF STRUCTURAL AND BIOLOGICAL SYSTEMS XVII, 2023, 12488
  • [4] Phase-Based Video Motion Processing
    Wadhwa, Neal
    Rubinstein, Michael
    Durand, Fredo
    Freeman, William T.
    ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04):
  • [5] UCF: Uncovering Common Features for Generalizable Deepfake Detection
    Yan, Zhiyuan
    Zhang, Yong
    Fan, Yanbo
    Wu, Baoyuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22355 - 22366
  • [6] Phase-based 3D Optical Flow Sensors for Motion Detection
    Wang, Albert
    Molnar, Alyosha
    2011 IEEE SENSORS, 2011, : 683 - 686
  • [7] Experimental Modal Analysis Using Phase Quantities from Phase-Based Motion Processing and Motion Magnification
    D.P. Rohe
    P.L. Reu
    Experimental Techniques, 2021, 45 : 297 - 312
  • [8] On the usability of phase-based video motion magnification for defect detection in vibrating panels
    Cosco, F.
    Cuenca, J.
    Desmet, W.
    Janssens, K.
    Mundo, D.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING (ISMA2020) / INTERNATIONAL CONFERENCE ON UNCERTAINTY IN STRUCTURAL DYNAMICS (USD2020), 2020, : 2321 - 2331
  • [9] Experimental Modal Analysis Using Phase Quantities from Phase-Based Motion Processing and Motion Magnification
    Rohe, D. P.
    Reu, P. L.
    EXPERIMENTAL TECHNIQUES, 2021, 45 (03) : 297 - 312
  • [10] Phase-based image motion estimation and registration
    Hemmendorff, Magnus
    Andersson, Mats T.
    Knutsson, Hans
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 6 : 3345 - 3348