U-TransCNN: A U-shape transformer-CNN fusion model for underwater image enhancement☆

被引:0
|
作者
Yao, Haiyang [1 ]
Guo, Ruige [1 ]
Zhao, Zhongda [4 ]
Zang, Yuzhang [2 ]
Zhao, Xiaobo [3 ]
Lei, Tao [1 ]
Wang, Haiyan [1 ,4 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710016, Peoples R China
[2] Western Washington Univ, Engn & Design Dept, Bellingham, WA USA
[3] Aarhus Univ, Dept Elect & Comp Engn, DK-8200 Aarhus, Denmark
[4] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
关键词
Underwater image enhancement; Feature fusion; Transformer; CNN;
D O I
10.1016/j.displa.2025.103047
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Underwater imaging faces significant challenges due to nonuniform optical absorption and scattering, resulting in visual quality issues like color distortion, contrast reduction, and image blurring. These factors hinder the accurate capture and clear depiction of underwater imagery. To address these complexities, we propose UTransCNN, a U-shape Transformer- Convolutional Neural Networks (CNN) model, designed to enhance underwater images by integrating the strengths of CNNs and Transformers. The core of U-TransCNN is the GlobalDetail Feature Synchronization Fusion Module. This innovative component enhances global color and contrast while meticulously preserving the intricate texture details, ensuring that both macroscopic and microscopic aspects of the image are enhanced in unison. Then we design the Multiscale Detail Fusion Block to aggregate a richer spectrum of feature information using a variety of convolution kernels. Furthermore, our optimization strategy is augmented with a joint loss function, adynamic approach allowing the model to assign varying weights to the loss associated with different pixel points, depending on their loss magnitude. Six experiments (including reference and non-reference) on three public underwater datasets confirm that U-TransCNN comprehensively surpasses other contemporary state-of-the-art deep learning algorithms, demonstrating marked improvement in visualization quality and quantization parameters of underwater images. Our code is available at https://github.com/GuoRuige/UTransCNN.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Image enhancement with art design: a visual feature approach with a CNN-transformer fusion model
    Xu, Ming
    Cui, Jinwei
    Ma, Xiaoyu
    Zou, Zhiyi
    Xin, Zhisheng
    Bilal, Muhammad
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [22] Designing a U-Net Architecture for Underwater Image Enhancement
    Zaidi, Saba
    Singh, Pranjali
    Guha, Prithwijit
    2024 NATIONAL CONFERENCE ON COMMUNICATIONS, NCC, 2024,
  • [23] Image-based scatter correction for cone-beam CT using flip swin transformer U-shape network
    Zhang, Xueren
    Jiang, Yangkang
    Luo, Chen
    Li, Dengwang
    Niu, Tianye
    Yu, Gang
    MEDICAL PHYSICS, 2023, 50 (08) : 5002 - 5019
  • [24] Hierarchical MVSNet with cost volume separation and fusion based on U-shape feature extraction
    Liu, Wanjun
    Wang, Junkai
    Qu, Haicheng
    Shen, Lei
    MULTIMEDIA SYSTEMS, 2023, 29 (01) : 377 - 387
  • [25] An effective transformer based on dual attention fusion for underwater image enhancement
    Hu, Xianjie
    Liu, Jing
    Li, Heng
    Liu, Hui
    Xue, Xiaojun
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [26] An effective transformer based on dual attention fusion for underwater image enhancement
    Hu X.
    Liu J.
    Li H.
    Liu H.
    Xue X.
    PeerJ Computer Science, 2024, 10
  • [27] Hierarchical MVSNet with cost volume separation and fusion based on U-shape feature extraction
    Wanjun Liu
    Junkai Wang
    Haicheng Qu
    Lei Shen
    Multimedia Systems, 2023, 29 : 377 - 387
  • [28] UATNet: U-Shape Attention-Based Transformer Net for Meteorological Satellite Cloud Recognition
    Wang, Zhanjie
    Zhao, Jianghua
    Zhang, Ran
    Li, Zheng
    Lin, Qinghui
    Wang, Xuezhi
    REMOTE SENSING, 2022, 14 (01)
  • [29] Explaining U-shape of the referral hiring pattern in a search model with heterogeneous workers
    Stupnytska, Yuliia
    Zaharieva, Anna
    JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2015, 119 : 211 - 233
  • [30] Adaptive scale based U-shape transformer network for ischemic stroke lesion segmentation in CTP images
    Zhang, Huiling
    Zhang, Wencong
    Chen, Yingjia
    Xu, Zibi
    Ma, Xiangyuan
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705