A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

被引:1
|
作者
Seo, Jangwon [1 ]
Hwang, Hyo-Seok [1 ]
Lee, Jiyoung [2 ]
Lee, Minhyeok [3 ]
Kim, Wonsuk [2 ]
Seok, Junhee [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Safe AI, Seoul, South Korea
[3] Chung Ang Univ, Sch Elect & Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Cheapfakes; Misinformation; Out-of-context; BERT; Stable Diffusion; Ground Image Captioning; Semantic Textual Similarity;
D O I
10.1145/3652583.3657601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of 'cheapfakes,' which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A singlestep deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.
引用
收藏
页码:1312 / 1316
页数:5
相关论文
共 50 条
  • [41] CariesNet: a deep learning approach for segmentation of multi-stage caries lesion from oral panoramic X-ray image
    Haihua Zhu
    Zheng Cao
    Luya Lian
    Guanchen Ye
    Honghao Gao
    Jian Wu
    Neural Computing and Applications, 2023, 35 : 16051 - 16059
  • [42] CariesNet: a deep learning approach for segmentation of multi-stage caries lesion from oral panoramic X-ray image
    Zhu, Haihua
    Cao, Zheng
    Lian, Luya
    Ye, Guanchen
    Gao, Honghao
    Wu, Jian
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22): : 16051 - 16059
  • [43] Learning Text-image Joint Embedding for Efficient Cross-modal Retrieval with Deep Feature Engineering
    Xie, Zhongwei
    Liu, Ling
    Wu, Yanzhao
    Zhong, Luo
    Li, Lin
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (04)
  • [44] Deep Learning to Classify Difference Image for Image Change Detection
    Zhao, Jiaojiao
    Gong, Maoguo
    Liu, Jia
    Jiao, Licheng
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 397 - 403
  • [45] Contrastive completing learning for practical text-image person ReID: Robuster and cheaper
    Du, Guodong
    Gong, Tiantian
    Zhang, Liyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [46] Multi-Stage Image Deblurring Using Long/Short Exposure Time Image Pair
    Lee, Dong-bok
    Song, Byung Cheol
    2013 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2013, : 78 - 79
  • [47] Multi-Stage Transfer Learning System with Lightweight Architectures in Medical Image Classification
    Godasu, Rajesh
    El-Gayar, Omar
    Sutrave, Kruttika
    AMCIS 2020 PROCEEDINGS, 2020,
  • [48] Multi-stage Features Sparse Learning Based Scene Recognition of Hyperspectral Image
    Ren Hui-feng
    Yan Feng
    Dong Qing-chao
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1365 - 1368
  • [49] Multi-Stage Feature Fusion Object Detection Method for Remote Sensing Image
    Chen L.
    Zhang F.
    Guo W.
    Huang Y.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (12): : 3520 - 3528
  • [50] Remotely sensed image processing with multi-stage inferences
    Yamamoto, H
    Homma, K
    Isobe, T
    Naka, M
    Matsumura, S
    Tameishi, H
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING V, 1999, 3871 : 337 - 342