Ultra-high-definition underwater image enhancement via dual-domain interactive transformer network

被引:0
|
作者
Li, Weiwei [1 ]
Cao, Feiyuan [2 ,3 ]
Wei, Yiwen [2 ,3 ]
Shi, Zhenghao [4 ]
Jia, Xiuyi [2 ,3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Guangxi, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
关键词
Dual-branch network; Feature interaction; Ultra-high-definition image; DESIGN;
D O I
10.1007/s13042-024-02379-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of ultra-high-definition (UHD) imaging device is increasingly being used for underwater image acquisition. However, due to light scattering and underwater impurities, UHD underwater images often suffer from color deviations and edge blurriness. Many studies have attempted to enhance underwater images by integrating frequency domain and spatial domain information. Nonetheless, these approaches often interactively fuse dual-domain features only in the final fusion module, neglecting the complementary and guiding roles of frequency domain and spatial domain features. Additionally, the extraction of dual-domain features is independent of each other, which leads to the sharp advantages and disadvantages of the dual-domain features extracted by these methods. Consequently, these methods impose high demands on the feature fusion capabilities of the fusion module. But in order to handle UHD underwater images, the fusion modules in these methods often stack only a limited number of convolution and activation function operations. This limitation results in insufficient fusion capability, leading to defects in the restoration of edges and colors in the images. To address these issues, we develop a dual-domain interaction network for enhancing UHD underwater images. The network takes into account both frequency domain and spatial domain features to complement and guide each other's feature extraction patterns, and fully integrates the dual-domain features in the model to better recover image details and colors. Specifically, the network consists of a U-shaped structure, where each layer is composed of dual-domain interaction transformer blocks containing interactive multi-head attention and interactive simple gate feed-forward networks. The interactive multi-head attention captures local interaction features of frequency domain and spatial domain information using convolution operation, followed by multi-head attention operation to extract global information of the mixed features. The interactive simple gate feed-forward network further enhances the model's dual-domain interaction capability and cross-dimensional feature extraction ability, resulting in clearer edges and more realistic colors in the images. Experimental results demonstrate that the performance of our proposal in enhancing underwater images is significantly better than existing methods.
引用
收藏
页码:2093 / 2109
页数:17
相关论文
共 50 条
  • [1] Dual-domain feature aggregation transformer network for underwater image enhancement
    Li, Yufeng
    Zhao, Zitian
    Li, Rui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (03)
  • [2] Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method
    Wang, Tao
    Zhang, Kaihao
    Shen, Tianrun
    Luo, Wenhan
    Stenger, Bjorn
    Lu, Tong
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2654 - 2662
  • [3] Learning Non-Uniform-Sampling for Ultra-High-Definition Image Enhancement
    Yu, Wei
    Zhu, Qi
    Zheng, Naishan
    Huang, Jie
    Zhou, Man
    Zhao, Feng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1412 - 1421
  • [4] Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning
    Zheng, Zhuoran
    Ren, Wenqi
    Cao, Xiaochun
    Wang, Tao
    Jia, Xiuyi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4429 - 4438
  • [5] DDPTransformer: Dual-Domain With Parallel Transformer Network for Sparse View CT Image Reconstruction
    Li, Runrui
    Li, Qing
    Wang, Hexi
    Li, Saize
    Zhao, Juanjuan
    Yan, Qiang
    Wang, Long
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2022, 8 : 1101 - 1116
  • [6] Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning
    Zheng, Zhuoran
    Ren, Wenqi
    Cao, Xiaochun
    Hu, Xiaobin
    Wang, Tao
    Song, Fenglong
    Jia, Xiuyi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16180 - 16189
  • [7] Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance
    Qu, Jingxiang
    Liu, Ryan Wen
    Gao, Yuan
    Guo, Yu
    Zhu, Fenghua
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9550 - 9562
  • [8] Benchmarking Ultra-High-Definition Image Super-resolution
    Zhang, Kaihao
    Li, Dongxu
    Luo, Wenhan
    Ren, Wenqi
    Stenger, Bjorn
    Liu, Wei
    Li, Hongdong
    Yang, Ming-Hsuan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14749 - 14758
  • [9] FSformer: fusing frequency and spatial domain transformer network for underwater image enhancement
    Liu, Dalang
    Rao, Yunbo
    Zhu, Jialong
    Ma, Yanjin
    Li, Jie
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [10] Deep dual-domain semi-blind network for compressed image quality enhancement
    He, Jingbo
    He, Xiaohai
    Zhang, Mozhi
    Xiong, Shuhua
    Chen, Honggang
    KNOWLEDGE-BASED SYSTEMS, 2022, 238