Ultra-high-definition underwater image enhancement via dual-domain interactive transformer network

被引:0
|
作者
Li, Weiwei [1 ]
Cao, Feiyuan [2 ,3 ]
Wei, Yiwen [2 ,3 ]
Shi, Zhenghao [4 ]
Jia, Xiuyi [2 ,3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Guangxi, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
关键词
Dual-branch network; Feature interaction; Ultra-high-definition image; DESIGN;
D O I
10.1007/s13042-024-02379-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of ultra-high-definition (UHD) imaging device is increasingly being used for underwater image acquisition. However, due to light scattering and underwater impurities, UHD underwater images often suffer from color deviations and edge blurriness. Many studies have attempted to enhance underwater images by integrating frequency domain and spatial domain information. Nonetheless, these approaches often interactively fuse dual-domain features only in the final fusion module, neglecting the complementary and guiding roles of frequency domain and spatial domain features. Additionally, the extraction of dual-domain features is independent of each other, which leads to the sharp advantages and disadvantages of the dual-domain features extracted by these methods. Consequently, these methods impose high demands on the feature fusion capabilities of the fusion module. But in order to handle UHD underwater images, the fusion modules in these methods often stack only a limited number of convolution and activation function operations. This limitation results in insufficient fusion capability, leading to defects in the restoration of edges and colors in the images. To address these issues, we develop a dual-domain interaction network for enhancing UHD underwater images. The network takes into account both frequency domain and spatial domain features to complement and guide each other's feature extraction patterns, and fully integrates the dual-domain features in the model to better recover image details and colors. Specifically, the network consists of a U-shaped structure, where each layer is composed of dual-domain interaction transformer blocks containing interactive multi-head attention and interactive simple gate feed-forward networks. The interactive multi-head attention captures local interaction features of frequency domain and spatial domain information using convolution operation, followed by multi-head attention operation to extract global information of the mixed features. The interactive simple gate feed-forward network further enhances the model's dual-domain interaction capability and cross-dimensional feature extraction ability, resulting in clearer edges and more realistic colors in the images. Experimental results demonstrate that the performance of our proposal in enhancing underwater images is significantly better than existing methods.
引用
收藏
页码:2093 / 2109
页数:17
相关论文
共 50 条
  • [21] Transformer-Based Dual-Domain Network for Few-View Dedicated Cardiac SPECT Image Reconstructions
    Xie, Huidong
    Zhou, Bo
    Chen, Xiongchao
    Guo, Xueqi
    Thorn, Stephanie
    Liu, Yi-Hwa
    Wang, Ge
    Sinusas, Albert
    Liu, Chi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 163 - 172
  • [22] Dual-Domain Fusion Convolutional Neural Network for Contrast Enhancement Forensics
    Yang, Pengpeng
    ENTROPY, 2021, 23 (10)
  • [23] A learnable full-frequency transformer dual generative adversarial network for underwater image enhancement
    Zheng, Shijian
    Wang, Rujing
    Zheng, Shitao
    Wang, Liusan
    Liu, Zhigui
    FRONTIERS IN MARINE SCIENCE, 2024, 11
  • [24] FSNet: A dual-domain network for few-shot image classification
    Yan, Xuewen
    Huang, Zhangjin
    DISPLAYS, 2024, 84
  • [25] DDSNet: a dual-domain supervised network for remote sensing image dehazing
    Chen, Xinyi
    Liu, Zhenqi
    Huo, Tianxiang
    Duan, Shukai
    Wang, Lidan
    PHYSICA SCRIPTA, 2025, 100 (02)
  • [26] An illumination-guided dual-domain network for image exposure correction
    Yang, Jie
    Zhang, Yuantong
    Chen, Zhenzhong
    Yang, Daiqin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [27] A dual-domain deep network for high pitch CT reconstruction
    Wang, Wei
    Xia, Xiang-Gen
    He, Chuanjiang
    PATTERN RECOGNITION, 2025, 161
  • [28] Single image denoising based on adaptive fusion dual-domain network
    Jiang, Zhiwei
    Xue, Zhizhong
    Wang, Jue
    Hu, Yibiao
    Zheng, Qiufu
    IET IMAGE PROCESSING, 2024, 18 (03) : 561 - 571
  • [29] Dual-domain sampling and feature-domain optimization network for image compressive sensing
    Xiang, Xinxin
    Tong, Fenghua
    Zhao, Dawei
    Li, Xin
    Yang, Shumian
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [30] An effective transformer based on dual attention fusion for underwater image enhancement
    Hu, Xianjie
    Liu, Jing
    Li, Heng
    Liu, Hui
    Xue, Xiaojun
    PEERJ COMPUTER SCIENCE, 2024, 10