WFormer: A Transformer-Based Soft Fusion Model for Robust Image Watermarking

被引:0
|
作者
Luo T. [1 ]
Wu J. [2 ]
He Z. [1 ]
Xu H. [2 ]
Jiang G. [2 ]
Chang C. [3 ]
机构
[1] College of Science and Technology, Ningbo University, Ningbo
[2] Faculty of Information Science and Engineering, Ningbo University, Ningbo
[3] Department of Information Engineering and Computer Science, Feng Chia University, Taichung
关键词
Convolution; cross-attention; Decoding; Feature extraction; Noise; Robustness; soft fusion; transformer; Transformers; Watermarking;
D O I
10.1109/TETCI.2024.3386916
中图分类号
学科分类号
摘要
Most deep neural network (DNN) based image watermarking models often employ the encoder-noise-decoder structure, in which watermark is simply duplicated for expansion and then directly fused with image features to produce the encoded image. However, simple duplication will generate watermark over-redundancies, and the communication between the cover image and watermark in different domains is lacking in image feature extraction and direction fusion, which degrades the watermarking performance. To solve those drawbacks, this paper proposes a Transformer-based soft fusion model for robust image watermarking, namely WFormer. Specifically, to expand watermark effectively, a watermark preprocess module (WPM) is designed with Transformers to extract valid and expanded watermark features by computing its self-attention. Then, to replace direct fusion, a soft fusion module (SFM) is deployed to integrate Transformers into image fusion with watermark by mining their long-range correlations. Precisely, self-attention is computed to extract their own latent features, and meanwhile, cross-attention is learned for bridging their gap to embed watermark effectively. In addition, a feature enhancement module (FEM) builds communication between the cover image and watermark by capturing their cross-feature dependencies, which tunes image features in accordance with watermark features for better fusion. Experimental results show that the proposed WFormer outperforms the existing state-of-the-art watermarking models in terms of invisibility, robustness, and embedding capacity. Furthermore, ablation results prove the effectiveness of the WPM, the FEM, and the SFM. IEEE
引用
收藏
页码:1 / 18
页数:17
相关论文
共 50 条
  • [31] A transformer-based Urdu image caption generation
    Hadi M.
    Safder I.
    Waheed H.
    Zaman F.
    Aljohani N.R.
    Nawaz R.
    Hassan S.U.
    Sarwar R.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (9) : 3441 - 3457
  • [32] Transformer-based Image Compression with Variable Image Quality Objectives
    Kao, Chia-Hao
    Chen, Yi-Hsin
    Chien, Cheng
    Chiu, Wei-Chen
    Peng, Wen-Hsiao
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1718 - 1725
  • [33] Transformer-Based Sensor Fusion for Autonomous Driving: A Survey
    Singh, Apoorv
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3304 - 3309
  • [34] A Transformer-Based Cross-Window Aggregated Attentional Image Inpainting Model
    Chen, Mingju
    Liu, Tingting
    Xiong, Xingzhong
    Duan, Zhengxu
    Cui, Anle
    ELECTRONICS, 2023, 12 (12)
  • [35] Image fusion-based watermarking
    Xu, Yanjie
    Xu, Luping
    Guangzi Xuebao/Acta Photonica Sinica, 2002, 31 (06):
  • [36] Advancing Hyperspectral and Multispectral Image Fusion: An Information-Aware Transformer-Based Unfolding Network
    Sun, Jianqiao
    Chen, Bo
    Lu, Ruiying
    Cheng, Ziheng
    Qu, Chunhui
    Yuan, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [37] Soft-decision detection robust image watermarking scheme
    Yang, Wen-Xue
    Sang, Mao-Dong
    Zhao, Yao
    Tiedao Xuebao/Journal of the China Railway Society, 2005, 27 (01): : 45 - 51
  • [38] DesnowFormer: an effective transformer-based image desnowing network
    Zhang, Ting
    Jiang, Nanfeng
    Lin, Junhong
    Lin, Jielian
    Zhao, Tiesong
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [39] TransInpaint: Transformer-based Image Inpainting with Context Adaptation
    Shamsolmoali, Pourya
    Zareapoor, Masoumeh
    Granger, Eric
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 849 - 858
  • [40] Recent progress in transformer-based medical image analysis
    Liu, Zhaoshan
    Lv, Qiujie
    Yang, Ziduo
    Li, Yifan
    Lee, Chau Hung
    Shen, Lei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164