Exposing Deepfake Videos with Spatial, Frequency and Multi-scale Temporal Artifacts

被引:2
|
作者
Hu, Yongjian [1 ]
Zhao, Hongjie [1 ]
Yu, Zeqiong [1 ]
Liu, Beibei [1 ]
Yu, Xiangyu [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Deepfake video detection; Multi-domain features; Multi-scale temporal features; Cross-dataset performance;
D O I
10.1007/978-3-030-95398-0_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The deepfake technique replaces the face in a source video with a fake face which is generated using deep learning tools such as generative adversarial networks (GANs). Even the facial expression can be well synchronized, making it difficult to identify the fake videos. Using features from multiple domains has been proved effective in the literature. It is also known that the temporal information is particularly critical in detecting deepfake videos, since the face-swapping of a video is implemented frame by frame. In this paper, we argue that the temporal differences between authentic and fake videos are complex and can not be adequately depicted from a single time scale. To obtain a complete picture of the temporal deepfake traces, we design a detection model with a short-term feature extraction module and a long-term feature extraction module. The short-term module captures the gradient information of adjacent frames. which is incorporated with the frequency and spatial information to make a multi-domain feature set. The long-term module then reveals the artifacts from a longer period of context. The proposed algorithm is tested on several popular databases, namely FaceForensics++, DeepfakeDetection (DFD), TIMIT-DF and FFW. Experimental results have validated the effectiveness of our algorithm through improved detection performance compared with related works.
引用
收藏
页码:47 / 57
页数:11
相关论文
共 50 条
  • [1] Watching the BiG artifacts: Exposing DeepFake videos via Bi-granularity artifacts
    Chen, Han
    Li, Yuezun
    Lin, Dongdong
    Li, Bin
    Wu, Junqiang
    PATTERN RECOGNITION, 2023, 135
  • [2] Social Relation Recognition from Videos via Multi-scale Spatial-Temporal Reasoning
    Liu, Xinchen
    Liu, Wu
    Zhang, Meng
    Chen, Jingwen
    Gao, Lianli
    Yan, Chenggang
    Mei, Tao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3561 - 3569
  • [3] A Multi-Scale Spatial-Temporal Attention Model for Person Re-Identification in Videos
    Zhang, Wei
    He, Xuanyu
    Yu, Xiaodong
    Lu, Weizhi
    Zha, Zhengjun
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3365 - 3373
  • [4] Refining Localized Attention Features with Multi-Scale Relationships for Enhanced Deepfake Detection in Spatial-Frequency Domain
    Gao, Yuan
    Zhang, Yu
    Zeng, Ping
    Ma, Yingjie
    ELECTRONICS, 2024, 13 (09)
  • [5] BZNet: Unsupervised Multi-scale Branch Zooming Network for Detecting Low-quality Deepfake Videos
    Lee, Sangyup
    An, Jaeju
    Woo, Simon S.
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3500 - 3510
  • [6] MULTI-SCALE PERMUTATION ENTROPY FOR AUDIO DEEPFAKE DETECTION
    Wang, Chenglong
    He, Jiayi
    Yi, Jiangyan
    Tao, Jianhua
    Zhang, Chu Yuan
    Zhang, Xiaohui
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1406 - 1410
  • [7] DeepFake detection with multi-scale convolution and vision transformer
    Lin, Hao
    Huang, Wenmin
    Luo, Weiqi
    Lu, Wei
    DIGITAL SIGNAL PROCESSING, 2023, 134
  • [8] Perceptual Annoyance Models for Videos With Combinations of Spatial and Temporal Artifacts
    Silva, Alexandre F.
    Farias, Mylene C. Q.
    Redi, Judith A.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (12) : 2446 - 2456
  • [9] Contrastive learning-based general Deepfake detection with multi-scale RGB frequency clues
    Dong, Fengkai
    Zou, Xiaoqiang
    Wang, Jiahui
    Liu, Xiyao
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (04) : 90 - 99
  • [10] Noise-aware progressive multi-scale deepfake detection
    Ding X.
    Pang S.
    Guo W.
    Multimedia Tools and Applications, 2024, 83 (36) : 83677 - 83693