Exposing Deepfake Videos with Spatial, Frequency and Multi-scale Temporal Artifacts

被引:2
|
作者
Hu, Yongjian [1 ]
Zhao, Hongjie [1 ]
Yu, Zeqiong [1 ]
Liu, Beibei [1 ]
Yu, Xiangyu [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
关键词
Deepfake video detection; Multi-domain features; Multi-scale temporal features; Cross-dataset performance;
D O I
10.1007/978-3-030-95398-0_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The deepfake technique replaces the face in a source video with a fake face which is generated using deep learning tools such as generative adversarial networks (GANs). Even the facial expression can be well synchronized, making it difficult to identify the fake videos. Using features from multiple domains has been proved effective in the literature. It is also known that the temporal information is particularly critical in detecting deepfake videos, since the face-swapping of a video is implemented frame by frame. In this paper, we argue that the temporal differences between authentic and fake videos are complex and can not be adequately depicted from a single time scale. To obtain a complete picture of the temporal deepfake traces, we design a detection model with a short-term feature extraction module and a long-term feature extraction module. The short-term module captures the gradient information of adjacent frames. which is incorporated with the frequency and spatial information to make a multi-domain feature set. The long-term module then reveals the artifacts from a longer period of context. The proposed algorithm is tested on several popular databases, namely FaceForensics++, DeepfakeDetection (DFD), TIMIT-DF and FFW. Experimental results have validated the effectiveness of our algorithm through improved detection performance compared with related works.
引用
收藏
页码:47 / 57
页数:11
相关论文
共 50 条
  • [41] Compressive imaging based on multi-scale modulation and reconstruction in spatial frequency domain*
    Liu, Fan
    Liu, Xue-Feng
    Lan, Ruo-Ming
    Yao, Xu-Ri
    Dou, Shen-Cheng
    Wang, Xiao-Qing
    Zhai, Guang-Jie
    CHINESE PHYSICS B, 2021, 30 (01)
  • [42] Compressive imaging based on multi-scale modulation and reconstruction in spatial frequency domain
    刘璠
    刘雪峰
    蓝若明
    姚旭日
    窦申成
    王小庆
    翟光杰
    ChinesePhysicsB, 2021, 30 (01) : 323 - 330
  • [43] Spatial Index Technology for Multi-scale and Large Scale Spatial Data
    Liu, Yuanyuan
    Liu, Gang
    He, Zhenwen
    2010 18TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2010,
  • [44] LEARNING MULTI-SCALE FEATURES FOR JPEG IMAGE ARTIFACTS REMOVAL
    Ji, Jiahuan
    Zhong, Baojiang
    Song, Weigang
    Ma, Kai-Kuang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1565 - 1569
  • [45] Multi-Scale Spatial-Temporal Integration Convolutional Tube for Human Action Recognition
    Wu, Haoze
    Liu, Jiawei
    Zhu, Xierong
    Wang, Meng
    Zha, Zheng-Jun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 753 - 759
  • [46] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
    Wang, Zhen
    Li, Dongyuan
    Jiang, Renhe
    Okumura, Manabu
    IEEE Access, 13 : 5491 - 5506
  • [47] MTSF: Multi-Scale Temporal-Spatial Fusion Network for Driver Attention Prediction
    Jin, Lisheng
    Ji, Bingdong
    Guo, Baicang
    Wang, Huanhuan
    Han, Zhuotong
    Liu, Xingchen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (02) : 1494 - 1509
  • [48] GaitASMS: gait recognition by adaptive structured spatial representation and multi-scale temporal aggregation
    Sun, Yan
    Long, Hu
    Feng, Xueling
    Nixon, Mark
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7057 - 7069
  • [49] Continuous Sign Language Recognition With Multi-Scale Spatial-Temporal Feature Enhancement
    Wang, Zhen
    Li, Dongyuan
    Jiang, Renhe
    Okumura, Manabu
    IEEE ACCESS, 2025, 13 : 5491 - 5506
  • [50] Spatial-temporal fraction map fusion with multi-scale remotely sensed images
    Zhang, Yihang
    Foody, Giles M.
    Ling, Feng
    Li, Xiaodong
    Ge, Yong
    Du, Yun
    Atkinson, Peter M.
    REMOTE SENSING OF ENVIRONMENT, 2018, 213 : 162 - 181