FST-Net: Exploiting Frequency Spatial Temporal Information for Low-Quality Fake Video Detection

被引:1
|
作者
Zhang, Min [1 ,2 ]
Liu, Xiaohan [3 ]
Liu, Chenyu [3 ]
Zhang, Xueqi [1 ,2 ]
Xie, Haiyong [2 ,4 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Peoples R China
[2] Minist Culture & Tourism, Key Lab Cyberculture Content Cognit & Detect, Hefei 230027, Anhui, Peoples R China
[3] Natl Engn Lab Publ Safety Risk Percept & Control, Beijing 100040, Peoples R China
[4] Capital Med Univ, Adv Innovat Ctr Human Brain Protect, Beijing 100054, Peoples R China
关键词
forgery detection; fake video detection; spectrum decomposition; spatial-temporal features across frames; attention mechanism;
D O I
10.1109/ICTAI52525.2021.00087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, state-of-the-art face manipulation algorithms have made significant progresses to forge images and videos that are able to deceive human eyes or even detection algorithms, which brings new challenges to forgery detection. In particular, the performance of detection algorithms for forgery videos is not as perfect as that for forgery images; furthermore, the difficulty of detection increases dramatically for low-quality videos. To address this challenge, we propose a novel dual stream architecture, referred to as FST-Net, for jointly mining forged features in the frequency, spatial and temporal domains. Specifically, we extract the spectral information of different frequency bands to expose intra-frame artifacts, and use the separable 3D CNN (S3D) to extract the spatio-temporal features among video frame groups. Moreover, to make the model focus on the tampered area, we add an attention layer to both backbone networks. Comprehensive experiments show that our model outperforms existing methods in video detection on challenging FaceForensics++ datasets, especially on low-quality video datasets.
引用
收藏
页码:536 / 543
页数:8
相关论文
共 39 条
  • [31] Effects of frequency of supplementation on dry matter intake and net portal and hepatic flux of nutrients in mature ewes that consume low-quality forage
    Krehbiel, CR
    Ferrell, CL
    Freetly, HC
    JOURNAL OF ANIMAL SCIENCE, 1998, 76 (09) : 2464 - 2473
  • [32] Effects of ruminal protein degradability and frequency of supplementation on net flux of N in visceral tissues of lambs fed a low-quality forage.
    Atkinson, R. L.
    Toone, C. D.
    Harmon, D. L.
    Ludden, P. A.
    JOURNAL OF ANIMAL SCIENCE, 2006, 84 : 48 - 48
  • [33] Video Saliency Detection via Spatial-Temporal Fusion and Low-Rank Coherency Diffusion
    Chen, Chenglizhao
    Li, Shuai
    Wang, Yongguang
    Qin, Hong
    Hao, Aimin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3156 - 3170
  • [34] Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks
    Wang, Xiang
    Yang, Jie
    Kasabov, Nikola K.
    SENSORS, 2023, 23 (09)
  • [35] Low-spatial-frequency information facilitates threat detection in a response-specific manner
    Zhu, Shengnan
    Zhang, Yang
    Dong, Junli
    Chen, Lihong
    Luo, Wenbo
    JOURNAL OF VISION, 2021, 21 (04):
  • [36] A VERY LOW COMPLEXITY REDUCED REFERENCE VIDEO QUALITY METRIC BASED ON SPATIO-TEMPORAL INFORMATION SELECTION
    Wang, Mengmeng
    Zhang, Fan
    Agrafiotis, Dimitris
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 571 - 575
  • [37] Detection and evaluation of SCADA voltage data quality in distribution network based on multi temporal and spatial information of multi data sources
    China Electric Power Research Institute, Haidian District, Beijing
    100192, China
    不详
    102206, China
    Dianwang Jishu, 11 (3169-3175):
  • [38] A Directional Method for Blowout Well Detection Based on Spatial Information of Alternating Very-Low-Frequency Magnetic Field
    Yang, Ke
    Gao, Mingyu
    Wang, Yizhe
    Qin, Pei
    Li, Bin
    Ma, Yan
    Yang, Zhicheng
    IEEE SENSORS JOURNAL, 2025, 25 (04) : 7128 - 7134
  • [39] Joint Spatial-Temporal Quality Improvement Scheme for H.264 Low Bit Rate Video Coding via Adaptive Frameskip
    Cui, Ziguan
    Gan, Zongliang
    Zhu, Xiuchang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (01): : 426 - 445