DeepFake detection algorithm based on improved vision transformer

被引:0
|
作者
Young-Jin Heo
Woon-Ha Yeo
Byung-Gyu Kim
机构
[1] Sookmyung Women’s University,
来源
Applied Intelligence | 2023年 / 53卷
关键词
Deep learning; Deepfake detection; Distillation; Generative adversarial network; Vision transformer;
D O I
暂无
中图分类号
学科分类号
摘要
A DeepFake is a manipulated video made with generative deep learning technologies, such as generative adversarial networks or auto encoders that anyone can utilize. With the increase in DeepFakes, classifiers consisting of convolutional neural networks (CNN) that can distinguish them have been actively created. However, CNNs have a problem with overfitting and cannot consider the relation between local regions as global feature of image, resulting in misclassification. In this paper, we propose an efficient vision transformer model for DeepFake detection to extract both local and global features. We combine vector-concatenated CNN feature and patch-based positioning to interact with all positions to specify the artifact region. For the distillation token, the logit is trained using binary cross entropy through the sigmoid function. By adding this distillation, the proposed model is generalized to improve performance. From experiments, the proposed model outperforms the SOTA model by 0.006 AUC and 0.013 f1 score on the DFDC test dataset. For 2,500 fake videos, the proposed model correctly predicts 2,313 as fake, whereas the SOTA model predicts 2,276 in the best performance. With the ensemble method, the proposed model outperformed the SOTA model by 0.01 AUC. For Celeb-DF (v2) dataset, the proposed model achieves a high performance of 0.993 AUC and 0.978 f1 score, respectively.
引用
收藏
页码:7512 / 7527
页数:15
相关论文
共 50 条
  • [1] DeepFake detection algorithm based on improved vision transformer
    Heo, Young-Jin
    Yeo, Woon-Ha
    Kim, Byung-Gyu
    [J]. APPLIED INTELLIGENCE, 2023, 53 (07) : 7512 - 7527
  • [2] Improved Deepfake Video Detection Using Convolutional Vision Transformer
    Deressa, Deressa Wodajo
    Lambert, Peter
    Van Wallendael, Glenn
    Atnafu, Solomon
    Mareen, Hannes
    [J]. 2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024, 2024, : 492 - 497
  • [3] Efficient deepfake detection using shallow vision transformer
    Usmani, Shaheen
    Kumar, Sunil
    Sadhya, Debanjan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12339 - 12362
  • [4] Efficient deepfake detection using shallow vision transformer
    Shaheen Usmani
    Sunil Kumar
    Debanjan Sadhya
    [J]. Multimedia Tools and Applications, 2024, 83 : 12339 - 12362
  • [5] Deepfake Image Detection using Vision Transformer Models
    Ghita, Bogdan
    Kuzminykh, Ievgeniia
    Usama, Abubakar
    Bakhshi, Taimur
    Marchang, Jims
    [J]. 2024 IEEE INTERNATIONAL BLACK SEA CONFERENCE ON COMMUNICATIONS AND NETWORKING, BLACKSEACOM 2024, 2024, : 332 - 335
  • [6] DeepFake detection with multi-scale convolution and vision transformer
    Lin, Hao
    Huang, Wenmin
    Luo, Weiqi
    Lu, Wei
    [J]. DIGITAL SIGNAL PROCESSING, 2023, 134
  • [7] Unmasking Deception: Empowering Deepfake Detection with Vision Transformer Network
    Arshed, Muhammad Asad
    Alwadain, Ayed
    Ali, Rao Faizan
    Mumtaz, Shahzad
    Ibrahim, Muhammad
    Muneer, Amgad
    [J]. MATHEMATICS, 2023, 11 (17)
  • [8] Vehicle Classification Algorithm Based on Improved Vision Transformer
    Dong, Xinlong
    Shi, Peicheng
    Tang, Yueyue
    Yang, Li
    Yang, Aixi
    Liang, Taonian
    [J]. WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (08):
  • [9] Intrusion detection: A model based on the improved vision transformer
    Yang, Yu-Guang
    Fu, Hong-Mei
    Gao, Shang
    Zhou, Yi-Hua
    Shi, Wei-Min
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (09)
  • [10] DFDT: An End-to-End DeepFake Detection Framework Using Vision Transformer
    Khormali, Aminollah
    Yuan, Jiann-Shiun
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (06):