Cascaded Network Based on EfficientNet and Transformer for Deepfake Video Detection

被引:4
|
作者
Deng, Liwei [1 ]
Wang, Jiandong [1 ]
Liu, Zhen [1 ]
机构
[1] Harbin Univ Sci & Technol, Sch Automation, Heilongjiang Prov Key Lab Complex Intelligent Syst, Harbin 150080, Peoples R China
基金
美国国家科学基金会;
关键词
Deepfake detection; EfficientNetV2S; Transformer; Visualization;
D O I
10.1007/s11063-023-11249-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the continuous development of deepfake technology, forged videos are continuously released on various network media, which facilitates people's lives but also brings great negative impacts. And these forged videos have a high degree of authenticity, which brings great challenges to detection. However, most deepfake detection models now only focus on model design, lacking versatility. To address these issues, we proposed a cascaded Network based on EfficientNet and Transformer to achieve deepfake detection tasks. The improved convolutional neural network EfficientNetV2S is used as a feature extractor, the features are input to the Transformer, and its attention mechanism is used for classification. And used Spatial-Reduction Attention (SRA) to improve the traditional attention mechanism in Transformer. And we carefully extracted and screened real and fake faces in the preprocessing stage and trained our model on DFDC and FaceForensics++ benchmarks, achieving state-of-the-art results such as an accuracy of 92.16% and an accuracy of 96.75%, respectively. Finally, we also achieved excellent visualization results on deepfake videos, proving the practicability of our method.
引用
收藏
页码:7057 / 7076
页数:20
相关论文
共 50 条
  • [31] Adversarially Robust Deepfake Video Detection
    Devasthale, Aditya
    Sural, Shamik
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 396 - 403
  • [32] Deep Convolutional Pooling Transformer for Deepfake Detection
    Wang, Tianyi
    Cheng, Harry
    Chow, Kam Pui
    Nie, Liqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [33] Visual attention-based deepfake video forgery detection
    Ganguly, Shreyan
    Mohiuddin, Sk
    Malakar, Samir
    Cuevas, Erik
    Sarkar, Ram
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (04) : 981 - 992
  • [34] Swin-Fake: A Consistency Learning Transformer-Based Deepfake Video Detector
    Gong, Liang Yu
    Li, Xue Jun
    Chong, Peter Han Joo
    [J]. ELECTRONICS, 2024, 13 (15)
  • [35] Deepfake video detection: challenges and opportunities
    Kaur, Achhardeep
    Hoshyar, Azadeh Noori
    Saikrishna, Vidya
    Firmin, Selena
    Xia, Feng
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)
  • [36] Local Region Frequency Guided Dynamic Inconsistency Network for Deepfake Video Detection
    Yue, Pengfei
    Chen, Beijing
    Fu, Zhangjie
    [J]. BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 889 - 904
  • [37] Transformer-based Cross Reference Network for video salient object detection
    Huang, Kan
    Tian, Chunwei
    Su, Jingyong
    Lin, Jerry Chun-Wei
    [J]. PATTERN RECOGNITION LETTERS, 2022, 160 : 122 - 127
  • [38] Improving Video Vision Transformer for Deepfake Video Detection Using Facial Landmark, Depthwise Separable Convolution and Self Attention
    Ramadhani, Kurniawan Nur
    Munir, Rinaldi
    Utama, Nugraha Priya
    [J]. IEEE ACCESS, 2024, 12 : 8932 - 8939
  • [39] Mask based Continuous Frame Faceswap Video Generation for Deepfake Detection
    Liu, Dazhuang
    Yang, Zhen
    Zhang, Ru
    Liu, Jianyi
    [J]. ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2023, 19 : 14 - 14
  • [40] A Cascaded Spatial Transformer Network for Oriented Equipment Detection in Thermal Images
    Lin, Ying
    Wang, Menglin
    Gu, Chao
    Qin, Jiafeng
    Bai, Demeng
    Li, Jun
    [J]. 2018 2ND IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2018,