Exposing Deepfake Videos with Spatial, Frequency and Multi-scale Temporal Artifacts

被引:2
|
作者
Hu, Yongjian [1 ]
Zhao, Hongjie [1 ]
Yu, Zeqiong [1 ]
Liu, Beibei [1 ]
Yu, Xiangyu [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Deepfake video detection; Multi-domain features; Multi-scale temporal features; Cross-dataset performance;
D O I
10.1007/978-3-030-95398-0_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The deepfake technique replaces the face in a source video with a fake face which is generated using deep learning tools such as generative adversarial networks (GANs). Even the facial expression can be well synchronized, making it difficult to identify the fake videos. Using features from multiple domains has been proved effective in the literature. It is also known that the temporal information is particularly critical in detecting deepfake videos, since the face-swapping of a video is implemented frame by frame. In this paper, we argue that the temporal differences between authentic and fake videos are complex and can not be adequately depicted from a single time scale. To obtain a complete picture of the temporal deepfake traces, we design a detection model with a short-term feature extraction module and a long-term feature extraction module. The short-term module captures the gradient information of adjacent frames. which is incorporated with the frequency and spatial information to make a multi-domain feature set. The long-term module then reveals the artifacts from a longer period of context. The proposed algorithm is tested on several popular databases, namely FaceForensics++, DeepfakeDetection (DFD), TIMIT-DF and FFW. Experimental results have validated the effectiveness of our algorithm through improved detection performance compared with related works.
引用
收藏
页码:47 / 57
页数:11
相关论文
共 50 条
  • [31] ST-SBV: Spatial-Temporal Self-Blended Videos for Deepfake Detection
    Guan, Weinan
    Wang, Wei
    Peng, Bo
    Dong, Jing
    Tan, Tieniu
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 274 - 288
  • [32] Study on temporal and spatial differentiation of biocapacity in Shenyang from a multi-scale perspective
    Gao, Yanpeng
    Chen, Wenjun
    Guo, Chunyao
    PLOS ONE, 2022, 17 (02):
  • [33] MTESformer: Multi-Scale Temporal and Enhance Spatial Transformer for Traffic Flow Prediction
    Dong, Xinhua
    Zhao, Wanbo
    Han, Hongmu
    Zhu, Zhanyi
    Zhang, Hui
    IEEE ACCESS, 2024, 12 : 47231 - 47245
  • [34] MULTI-SCALE SPATIAL-TEMPORAL NETWORK FOR PERSON RE-IDENTIFICATION
    Wang, Zhikang
    He, Lihuo
    Gao, Xinbo
    Huang, Yuanfei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2052 - 2056
  • [35] Multi-scale cross-correlation analysis of temporal and spatial seismic data
    Min Lin
    Jiaxin Qin
    Gang Wang
    The European Physical Journal B, 2020, 93
  • [36] Multi-scale scenarios of spatial-temporal dynamics in the European livestock sector
    Neumann, Kathleen
    Verburg, Peter H.
    Elbersen, Berien
    Stehfest, Elke
    Woltjer, Geert B.
    AGRICULTURE ECOSYSTEMS & ENVIRONMENT, 2011, 140 (1-2) : 88 - 101
  • [37] MTTPRE: A Multi-Scale Spatial-Temporal Model for Travel Time Prediction
    Wan, Feng
    Li, Linsen
    Wang, Ke
    Chen, Lu
    Gao, Yunjun
    Jiang, Weihao
    Pu, Shiliang
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 384 - 393
  • [38] MULTI-SCALE TEMPORAL FREQUENCY CONVOLUTIONAL NETWORK WITH AXIAL ATTENTION FOR SPEECH ENHANCEMENT
    Zhang, Guochang
    Yu, Libiao
    Wang, Chunliang
    Wei, Jianqiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9122 - 9126
  • [39] Multi-Scale Inter-Communication Spatio-Temporal Network for Video Compression Artifacts Reduction
    Zhang, Tingrong
    Teng, Qizhi
    He, Xiaohai
    Ren, Chao
    Chen, Zhengxin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (03) : 1229 - 1233
  • [40] Query-aware multi-scale proposal network for weakly supervised temporal sentence grounding in videos
    Zhou, Mingyao
    Chen, Wenjing
    Sun, Hao
    Xie, Wei
    Dong, Ming
    Lu, Xiaoqiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 304