MDCF-Net: Multi-Scale Dual-Branch Network for Compressed Face Forgery Detection

被引:0
|
作者
Zhou, Jiting [1 ]
Zhao, Xinrui [1 ]
Xu, Qian [1 ]
Zhang, Pu [1 ]
Zhou, Zhihao [1 ]
机构
[1] Shanghai Univ, Shanghai Film Acad, Shanghai 200072, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Frequency-domain analysis; Face recognition; Forgery; Transformers; Deepfakes; Image coding; Face forgery; deepfake detection; transformers; frequency domain; two-branch; feature fusion; FEATURE FUSION;
D O I
10.1109/ACCESS.2024.3390217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face forgery detection aims to identify manipulated or altered facial images or videos created using artificial intelligence. Existing detection methods exhibit favorable performance on high-quality videos, but the videos in daily applications are commonly compressed into low-quality formats via social media. The detection difficulty is increased by the poor quality, indistinct detail features, and noises such as artifacts in these images or videos. To address this challenge, we propose a multi-scale dual-branch network for compressed face forgery, called MDCF-Net, effectively capturing cross-domain forgery features at various scales in compressed facial images. The MDCF-Net comprises two branches: an RGB domain branch utilizing Transformers to extract multi-scale fine-texture features from the original RGB images; a frequency domain branch designed to capture artifacts in low-quality videos by extracting global spectral features as a supplementary measure. Then, we introduce a feature fusion module (FFM) based on multi-head attention to merge diverse feature representations in a spatial-frequency complementary manner. Extensive comparative experiments on public datasets such as FaceForensics++, Celeb-DF, and WildDeepfake demonstrate the significant advantage of MDCF-Net in detecting highly compressed and low-quality forged images or videos, especially in achieving state-of-the-art performance on the FaceForensics++ low-quality dataset. Our approach presents a new perspective and technology for low-quality face forgery detection.
引用
收藏
页码:58740 / 58749
页数:10
相关论文
共 50 条
  • [41] Dual-Branch Feature Fusion Network for Salient Object Detection
    Song, Zhehan
    Xu, Zhihai
    Wang, Jing
    Feng, Huajun
    Li, Qi
    PHOTONICS, 2022, 9 (01)
  • [42] DMA-Net: A dual branch encoder and multi-scale cross attention fusion network for skin lesion segmentation
    College of Electronic and Information Engineering, Hebei University, Hebei, China
    不详
    IET Image Proc., 14 (4531-4541):
  • [43] A real-time surface defects detection model via dual-branch feature extraction and dynamic multi-scale fusion attention
    Pei, Jingni
    Li, Shujuan
    Li, Yan
    DIGITAL SIGNAL PROCESSING, 2024, 152
  • [44] DMF-Net: A Dual-Encoding Multi-Scale Fusion Network for Pavement Crack Detection
    Bai, Suli
    Yang, Lei
    Liu, Yanhong
    Yu, Hongnian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 5981 - 5996
  • [45] MSSD-Net: Multi-Scale SAR Ship Detection Network
    Wang, Xi
    Xu, Wei
    Huang, Pingping
    Tan, Weixian
    REMOTE SENSING, 2024, 16 (12)
  • [46] Dual Attention Network Approaches to Face Forgery Video Detection
    Luo, Yi-Xiang
    Chen, Jiann-Liang
    IEEE ACCESS, 2022, 10 : 110754 - 110760
  • [47] MRMNet: Multi-scale residual multi-branch neural network for object detection
    Dong, Yongsheng
    Liu, Yafeng
    Li, Xuelong
    NEUROCOMPUTING, 2024, 596
  • [48] Deep Graph Convolutional Network with Dual-Branch and Multi-interaction
    Lou J.
    Ye H.
    Yang B.
    Li M.
    Cao F.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (08): : 754 - 763
  • [49] An improved multi-scale face detection using convolutional neural network
    Mliki, Hazar
    Dammak, Sahar
    Fendri, Emna
    Dammak, Sahar (sahardammak@fsegs.u-sfax.tn), 1600, Springer Science and Business Media Deutschland GmbH (14): : 1345 - 1353
  • [50] An improved multi-scale face detection using convolutional neural network
    Mliki, Hazar
    Dammak, Sahar
    Fendri, Emna
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (07) : 1345 - 1353