DeepFake detection algorithm based on improved vision transformer

被引:0
|
作者
Young-Jin Heo
Woon-Ha Yeo
Byung-Gyu Kim
机构
[1] Sookmyung Women’s University,
来源
Applied Intelligence | 2023年 / 53卷
关键词
Deep learning; Deepfake detection; Distillation; Generative adversarial network; Vision transformer;
D O I
暂无
中图分类号
学科分类号
摘要
A DeepFake is a manipulated video made with generative deep learning technologies, such as generative adversarial networks or auto encoders that anyone can utilize. With the increase in DeepFakes, classifiers consisting of convolutional neural networks (CNN) that can distinguish them have been actively created. However, CNNs have a problem with overfitting and cannot consider the relation between local regions as global feature of image, resulting in misclassification. In this paper, we propose an efficient vision transformer model for DeepFake detection to extract both local and global features. We combine vector-concatenated CNN feature and patch-based positioning to interact with all positions to specify the artifact region. For the distillation token, the logit is trained using binary cross entropy through the sigmoid function. By adding this distillation, the proposed model is generalized to improve performance. From experiments, the proposed model outperforms the SOTA model by 0.006 AUC and 0.013 f1 score on the DFDC test dataset. For 2,500 fake videos, the proposed model correctly predicts 2,313 as fake, whereas the SOTA model predicts 2,276 in the best performance. With the ensemble method, the proposed model outperformed the SOTA model by 0.01 AUC. For Celeb-DF (v2) dataset, the proposed model achieves a high performance of 0.993 AUC and 0.978 f1 score, respectively.
引用
收藏
页码:7512 / 7527
页数:15
相关论文
共 50 条
  • [31] Transformer-based cascade networks with spatial and channel reconstruction convolution for deepfake detection
    Li, Xue
    Zhou, Huibo
    Zhao, Ming
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4142 - 4164
  • [32] Combining EfficientNet and Vision Transformers for Video Deepfake Detection
    Coccomini, Davide Alessandro
    Messina, Nicola
    Gennaro, Claudio
    Falchi, Fabrizio
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 219 - 229
  • [33] Vision Transformer-Based Tailing Detection in Videos
    Lee, Jaewoo
    Lee, Sungjun
    Cho, Wonki
    Siddiqui, Zahid Ali
    Park, Unsang
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [34] MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection
    Yu, Yang
    Ni, Rongrong
    Zhao, Yao
    Yang, Siyuan
    Xia, Fen
    Jiang, Ning
    Zhao, Guoqing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4462 - 4471
  • [35] Intrusion Detection Model Based on Improved Transformer
    Liu, Yi
    Wu, Lanjian
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [36] A vision based road detection algorithm
    Hu, MH
    Yang, WJ
    Ren, MW
    Yang, JY
    [J]. 2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 846 - 850
  • [37] An improved calibration algorithm for ITS based on vision
    Liu, Fuqiang
    Wang, Jing
    Guo, Lian
    Wang, Xinhong
    [J]. 2007 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 589 - 592
  • [38] Improving Video Vision Transformer for Deepfake Video Detection Using Facial Landmark, Depthwise Separable Convolution and Self Attention
    Ramadhani, Kurniawan Nur
    Munir, Rinaldi
    Utama, Nugraha Priya
    [J]. IEEE ACCESS, 2024, 12 : 8932 - 8939
  • [39] Deepfake Video Detection Based on Improved CapsNet and Temporal-Spatial Features
    Lu, Tianliang
    Bao, Yuxuan
    Li, Lanting
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 715 - 740
  • [40] A Vulnerability Detection Algorithm Based on Transformer Model
    Hou, Fujin
    Zhou, Kun
    Li, Longbin
    Tian, Yuan
    Li, Jie
    Li, Jian
    [J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT III, 2022, 13340 : 43 - 55