FST-Net: Exploiting Frequency Spatial Temporal Information for Low-Quality Fake Video Detection

被引:1
|
作者
Zhang, Min [1 ,2 ]
Liu, Xiaohan [3 ]
Liu, Chenyu [3 ]
Zhang, Xueqi [1 ,2 ]
Xie, Haiyong [2 ,4 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Minist Culture & Tourism, Key Lab Cyberculture Content Cognit & Detect, Hefei 230027, Anhui, Peoples R China
[3] Natl Engn Lab Publ Safety Risk Percept & Control, Beijing 100040, Peoples R China
[4] Capital Med Univ, Adv Innovat Ctr Human Brain Protect, Beijing 100054, Peoples R China
关键词
forgery detection; fake video detection; spectrum decomposition; spatial-temporal features across frames; attention mechanism;
D O I
10.1109/ICTAI52525.2021.00087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, state-of-the-art face manipulation algorithms have made significant progresses to forge images and videos that are able to deceive human eyes or even detection algorithms, which brings new challenges to forgery detection. In particular, the performance of detection algorithms for forgery videos is not as perfect as that for forgery images; furthermore, the difficulty of detection increases dramatically for low-quality videos. To address this challenge, we propose a novel dual stream architecture, referred to as FST-Net, for jointly mining forged features in the frequency, spatial and temporal domains. Specifically, we extract the spectral information of different frequency bands to expose intra-frame artifacts, and use the separable 3D CNN (S3D) to extract the spatio-temporal features among video frame groups. Moreover, to make the model focus on the tampered area, we add an attention layer to both backbone networks. Comprehensive experiments show that our model outperforms existing methods in video detection on challenging FaceForensics++ datasets, especially on low-quality video datasets.
引用
收藏
页码:536 / 543
页数:8
相关论文
共 39 条
  • [1] Gesture detection in low-quality video
    Roh, Myung-Cheol
    Lee, Seong-Whan
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 791 - +
  • [2] Exploiting Spatial-temporal Correlations for Video Anomaly Detection
    Zhao, Mengyang
    Liu, Yang
    Liu, Jing
    Zeng, Xinhua
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1727 - 1733
  • [3] Using Low-Quality Video Sequences for Fish Detection and Tracking
    Lantsova, Ekaterina
    Voitiuk, Tatiana
    Zudilova, Tatiana
    Kaarna, Arto
    PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 426 - 433
  • [4] Temporal Information Detection for Video Quality Evaluation
    Yuan Fei
    Cheng En
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND INFORMATION SYSTEMS, 2009, : 38 - 41
  • [5] Impact of Spatial and Temporal Information on Video Quality and Compressibility
    Robitza, Werner
    Rao, Rakesh Rao Ramachandra
    Goring, Steve
    Raake, Alexer
    2021 13TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2021, : 65 - 68
  • [6] Wire Detection in Low-Altitude, Urban, and Low-Quality Video Frames
    Candamo, Joshua
    Goldgof, Dmitry
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 1510 - 1513
  • [7] Improved Side Information Generation for Distributed Video Coding by Exploiting Spatial and Temporal Correlations
    Shuiming Ye
    Mourad Ouaret
    Frederic Dufaux
    Touradj Ebrahimi
    EURASIP Journal on Image and Video Processing, 2009
  • [8] Improved Side Information Generation for Distributed Video Coding by Exploiting Spatial and Temporal Correlations
    Ye, Shuiming
    Ouaret, Mourad
    Dufaux, Frederic
    Ebrahimi, Touradj
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2009,
  • [9] Online Detection of Low-Quality Synchrophasor Data Considering Frequency Similarity
    Ju, Wenyun
    Silva-Saravia, Horacio
    Nayak, Neeraj
    Yao, Wenxuan
    Zhang, Yichen
    Shi, Qingxin
    Ye, Fan
    IEEE TRANSACTIONS ON POWER DELIVERY, 2021, 36 (06) : 3988 - 3991
  • [10] Exploring Spatial Frequency Information for Enhanced Video Prediction Quality
    Lai, Junyu
    Gan, Lianqiang
    Zhu, Junhong
    Liu, Huashuo
    Gao, Lianli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8955 - 8968