Spatiotemporal Inconsistency Learning for DeepFake Video Detection

被引:59
|
作者
Gu, Zhihao [1 ]
Chen, Yang [2 ]
Yao, Taiping [2 ]
Ding, Shouhong [2 ]
Li, Jilin [2 ]
Huang, Feiyue [2 ]
Ma, Lizhuang [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Tencent Youtu Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
deepfake video detection; spatiotemporal inconsistency modeling; video analysis;
D O I
10.1145/3474085.3475508
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of facial manipulation techniques has aroused public concerns in recent years. Following the success of deep learning, existing methods always formulate DeepFake video detection as a binary classification problem and develop frame-based and video-based solutions. However, little attention has been paid to capturing the spatial-temporal inconsistency in forged videos. To address this issue, we term this task as a Spatial-Temporal Inconsistency Learning (STIL) process and instantiate it into a novel STIL block,which consists of a Spatial Inconsistency Module (SIM), a Temporal Inconsistency Module (TIM), and an Information Supplement Module (ISM). Specifically, we present a novel temporal modeling paradigm in TIM by exploiting the temporal difference over adjacent frames along with both horizontal and vertical directions. And the ISM simultaneously utilizes the spatial information from SIM and temporal information from TIM to establish a more comprehensive spatial-temporal representation. Moreover, our STIL block is flexible and could be plugged into existing 2D CNNs. Extensive experiments and visualizations are presented to demonstrate the effectiveness of our method against the state-of-the-art competitors.
引用
收藏
页码:3473 / 3481
页数:9
相关论文
共 50 条
  • [1] Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection
    Gu, Zhihao
    Yao, Taiping
    Chen, Yang
    Ding, Shouhong
    Ma, Lizhuang
    [J]. COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 596 - 613
  • [2] Delving into the Local: Dynamic Inconsistency Learning for DeepFake Video Detection
    Gu, Zhihao
    Chen, Yang
    Yao, Taiping
    Ding, Shouhong
    Li, Jilin
    Ma, Lizhuang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 744 - 752
  • [3] DeepFake Videos Detection via Spatiotemporal Inconsistency Learning and Interactive Fusion
    Ding, Xiangling
    Zhu, Wenjie
    Zhang, Dengyong
    [J]. 2022 19TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2022, : 425 - 433
  • [4] Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection
    Xu, Yuting
    Liang, Jian
    Sheng, Lijun
    Zhang, Xiao-Yu
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [5] Dynamic Inconsistency-aware DeepFake Video Detection
    Hu, Ziheng
    Xie, Hongtao
    Wang, Yuxin
    Li, Jiahong
    Wang, Zhongyuan
    Zhang, Yongdong
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 736 - 742
  • [6] Deepfake Video Detection with Spatiotemporal Dropout Transformer
    Zhang, Daichi
    Lin, Fanzhao
    Hua, Yingying
    Wang, Pengju
    Zeng, Dan
    Ge, Shiming
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5833 - 5841
  • [7] Augmented Multi-Scale Spatiotemporal Inconsistency Magnifier for Generalized DeepFake Detection
    Yu, Yang
    Zhao, Xiaohui
    Ni, Rongrong
    Yang, Siyuan
    Zhao, Yao
    Kot, Alex C.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8487 - 8498
  • [8] MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection
    Yu, Yang
    Ni, Rongrong
    Zhao, Yao
    Yang, Siyuan
    Xia, Fen
    Jiang, Ning
    Zhao, Guoqing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4462 - 4471
  • [9] Local Region Frequency Guided Dynamic Inconsistency Network for Deepfake Video Detection
    Yue, Pengfei
    Chen, Beijing
    Fu, Zhangjie
    [J]. BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 889 - 904
  • [10] Video Transformer for Deepfake Detection with Incremental Learning
    Khan, Sohail Ahmed
    Dai, Hang
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1821 - 1828