Attention Guided Spatio-Temporal Artifacts Extraction for Deepfake Detection

被引:1
|
作者
Wang, Zhibing [1 ,2 ]
Li, Xin [1 ,2 ]
Ni, Rongrong [1 ,2 ]
Zhao, Yao [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
关键词
Spatio-temporal artifacts; Attention; Deepfake detection;
D O I
10.1007/978-3-030-88013-2_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, deep-learning based model has been widely used for deepfake video detection due to its effectiveness in artifacts extraction. Most of the existing deep-learning detection methods with the attention mechanism attach more importance to the information in the spatial domain. However, the discrepancy of different frames is also important and should pay different levels of attention to temporal regions. To address this problem, this paper proposes an Attention Guided LSTM Network (AGLNet), which takes into consideration the mutual correlations in both temporal and spatial domains to effectively capture the artifacts in deepfake videos. In particular, sequential feature maps extracted from convolution and fully-connected layers of the convolutional neural network are receptively fed into the attention guided LSTM module to learn soft spatio-temporal assignment weights, which help aggregate not only detailed spatial information but also temporal information from consecutive video frames. Experiments on FaceForensics++ and Celeb-DF datasets demonstrate the superiority of the proposed AGLNet model in exploring the spatio-temporal artifacts extraction.
引用
收藏
页码:374 / 386
页数:13
相关论文
共 50 条
  • [21] A Video Salient Object Detection Model Guided by Spatio-Temporal Prior
    Jiang, Wen-Wen
    Yang, Kai-Fu
    Li, Yong-Jie
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2555 - 2562
  • [22] Annoyance of spatio-temporal artifacts in segmentation quality assessment
    Gelasca, EDG
    Ebrahimi, T
    Farias, MCQ
    Carli, MC
    Mitra, SK
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 345 - 348
  • [23] Spatio-temporal fuzzy filtering for coding artifacts reduction
    Vo, Dung Trung
    Yea, Sehoon
    Vetro, Anthony
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2008, PTS 1 AND 2, 2008, 6822
  • [24] Object extraction by spatio-temporal assembling
    Qin, Xiaoke
    Tang, Liang
    Zhou, Be
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2405 - +
  • [25] Semantic-guided spatio-temporal attention for few-shot action recognition
    Jianyu Wang
    Baolin Liu
    Applied Intelligence, 2024, 54 : 2458 - 2471
  • [26] Semantic-guided spatio-temporal attention for few-shot action recognition
    Wang, Jianyu
    Liu, Baolin
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2458 - 2471
  • [27] Spatio-temporal knowledge distilled video vision transformer (STKD-VViT) for multimodal deepfake detection
    Usmani, Shaheen
    Kumar, Sunil
    Sadhya, Debanjan
    NEUROCOMPUTING, 2025, 620
  • [28] Spatio-Temporal Memory Attention for Image Captioning
    Ji, Junzhong
    Xu, Cheng
    Zhang, Xiaodan
    Wang, Boyue
    Song, Xinhang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7615 - 7628
  • [29] Measuring the velocity of spatio-temporal attention waves
    Jagacinski, Richard J.
    Ma, Aijia
    Morrison, Tyler N.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2024, 122
  • [30] Cascading spatio-temporal attention network for real-time action detection
    Yang, Jianhua
    Wang, Ke
    Li, Ruifeng
    Perner, Petra
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)