Cformer: An underwater image enhancement hybrid network combining convolution and transformer

被引:4
|
作者
Deng, Ruhui [1 ]
Zhao, Lei [1 ]
Li, Heng [1 ]
Liu, Hui [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automation, 727 Jingming South Rd, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
image enhancement; image processing; MODEL;
D O I
10.1049/ipr2.12901
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Underwater images are the most direct and effective ways to obtain underwater information. However, underwater images typically suffer from contrast reduction and colour distortion due to the absorption and scattering of water by light, which seriously limits the further development of underwater visual tasks. Recently, the convolutional neural network has been extensively applied in underwater image enhancement for its powerful local information extraction capabilities, but due to the locality of convolution operation, it cannot capture the global context well. Although the recently emerging Transformer can capture global context, it cannot model local correlations. Cformer is proposed, which is an Unet-like hybrid network structure. First, a Depth Self-Calibrated block is proposed to extract the local features of the image effectively. Second, a novel Cross-Shaped Enhanced Window Transformer block is proposed. It captures long-range pixel interactions while dramatically reducing the computational complexity of feature maps. Finally, the depth self-calibrated block and the cross-shaped enhanced window Transformer block are ingeniously fused to build a global-local Transformer module. Extensive ablation studies are performed on public underwater datasets to demonstrate the effectiveness of individual components in the network. The qualitative and quantitative comparisons indicate that Cformer achieves superior performance compared to other competitive models.
引用
收藏
页码:3841 / 3855
页数:15
相关论文
共 50 条
  • [21] Window-based transformer generative adversarial network for autonomous underwater image enhancement
    Ummar, Mehnaz
    Dharejo, Fayaz Ali
    Alawode, Basit
    Mahbub, Taslim
    Piran, Md. Jalil
    Javed, Sajid
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [22] CEWformer: A Transformer-Based Collaborative Network for Simultaneous Underwater Image Enhancement and Watermarking
    Wu, Jun
    Luo, Ting
    He, Zhouyan
    Song, Yang
    Xu, Haiyong
    Li, Li
    IEEE JOURNAL OF OCEANIC ENGINEERING, 2024, 49 (01) : 30 - 47
  • [23] Dual branch Transformer-CNN parametric filtering network for underwater image enhancement
    Chang, Baocai
    Li, Jinjiang
    Ren, Lu
    Chen, Zheng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [24] A fusion framework with multi-scale convolution and triple-branch cascaded transformer for underwater image enhancement
    Xiang, Dan
    Zhou, Zebin
    Yang, Wenlei
    Wang, Huihua
    Gao, Pan
    Xiao, Mingming
    Zhang, Jinwen
    Zhu, Xing
    OPTICS AND LASERS IN ENGINEERING, 2025, 184
  • [25] FCNN: fusion-based underwater image enhancement using multilayer convolution neural network
    Verma, Gunjan
    Kumar, Manoj
    Raikwar, Suresh
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [26] GFRENet: An Efficient Network for Underwater Image Enhancement with Gated Linear Units and Fast Fourier Convolution
    Zhang, Bingxian
    Fang, Jiahao
    Li, Yujie
    Wang, Yue
    Zhou, Qinglong
    Wang, Xing
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (07)
  • [27] Underwater image enhancement using lightweight vision transformer
    Daud, Muneeba
    Afzal, Hammad
    Mahmood, Khawir
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75603 - 75625
  • [28] U-Shape Transformer for Underwater Image Enhancement
    Peng, Lintao
    Zhu, Chunli
    Bian, Liheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3066 - 3079
  • [29] Autonomous underwater robot for underwater image enhancement via multi-scale deformable convolution network with attention mechanism
    Lin, Yi
    Zhou, Jingchun
    Ren, Wenqi
    Zhang, Weishi
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 191
  • [30] A hybrid network integrating convolution and transformer for thymoma segmentation
    Li, Jingyuan
    Sun, Wenfang
    Feng, Xiulong
    von Deneen, Karen M.
    Wang, Wen
    Cui, Guangbin
    Zhang, Yi
    INTELLIGENT MEDICINE, 2023, 3 (03): : 164 - 172