A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

被引:1
|
作者
Seo, Jangwon [1 ]
Hwang, Hyo-Seok [1 ]
Lee, Jiyoung [2 ]
Lee, Minhyeok [3 ]
Kim, Wonsuk [2 ]
Seok, Junhee [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Safe AI, Seoul, South Korea
[3] Chung Ang Univ, Sch Elect & Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Cheapfakes; Misinformation; Out-of-context; BERT; Stable Diffusion; Ground Image Captioning; Semantic Textual Similarity;
D O I
10.1145/3652583.3657601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of 'cheapfakes,' which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A singlestep deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.
引用
收藏
页码:1312 / 1316
页数:5
相关论文
共 50 条
  • [21] Detection of Copy-Move Forgery in Digital Image Using Multi-scale, Multi-stage Deep Learning Model
    Jaiswal, Ankit Kumar
    Srivastava, Rajeev
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 75 - 100
  • [22] Detection of Copy-Move Forgery in Digital Image Using Multi-scale, Multi-stage Deep Learning Model
    Ankit Kumar Jaiswal
    Rajeev Srivastava
    Neural Processing Letters, 2022, 54 : 75 - 100
  • [23] Deep learning image denoising based on multi-stage supervised with Res2-Unet
    Liu Y.
    Chen G.
    Yu C.
    Wang S.
    Sun B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (06): : 920 - 935
  • [24] Face mask detection using deep convolutional neural network and multi-stage image processing
    Umer, Muhammad
    Sadiq, Saima
    Alhebshi, Reemah M.
    Alsubai, Shtwai
    Al Hejaili, Abdullah
    Eshmawi, Ala' Abdulmajid
    Nappi, Michele
    Ashraf, Imran
    IMAGE AND VISION COMPUTING, 2023, 133
  • [25] A deep learning approach for image and text classification using neutrosophy
    Wajid M.A.
    Zafar A.
    Wajid M.S.
    International Journal of Information Technology, 2024, 16 (2) : 853 - 859
  • [26] Extractive Text-Image Summarization Using Multi-Modal RNN
    Chen, Jingqiang
    Hai Zhuge
    2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 245 - 248
  • [27] Progressive Image Restoration with Multi-stage Optimization
    Yang, Jiaming
    Zhang, Weihua
    Pu, Yifei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 445 - 457
  • [28] A Multi-Stage Fingerprint Image Segmentation Method
    Mao, Keming
    Wang, Guoren
    Chang yong
    Jin, Yan
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 1141 - 1145
  • [29] Multi-stage image denoising with the wavelet transform
    Tian, Chunwei
    Zheng, Menghua
    Zuo, Wangmeng
    Zhang, Bob
    Zhang, Yanning
    Zhang, David
    PATTERN RECOGNITION, 2023, 134
  • [30] COMPOUND IMAGE COMPRESSION BY MULTI-STAGE PREDICTION
    Zhu, Weijia
    Ding, Wenpeng
    Xiong, Ruiqin
    Shi, Yuhui
    Yin, Baocai
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,