DPMFformer: an underwater image enhancement network based on deep pooling and multi-scale fusion transformer

被引:0
|
作者
Xiang, Dan [1 ,2 ]
Yang, Wenlei [2 ]
Zhou, Zebin [2 ]
Zhang, Jinwen [4 ]
Li, Jianxin [5 ]
Ouyang, Jian [3 ]
Ling, Jing [1 ]
机构
[1] Guangzhou Maritime Univ, Dept Informat & Commun Engn, Guangzhou, Guangdong, Peoples R China
[2] Guangdong Polytech Normal Univ, Sch Elect & Informat, Guangzhou, Guangdong, Peoples R China
[3] Guangdong Polytech Normal Univ, Guangdong Ind Training Ctr, Guangzhou, Guangdong, Peoples R China
[4] Guangdong Prov Key Lab Green Construct & Intellige, Guangzhou, Guangdong, Peoples R China
[5] Guangzhou Maritime Univ, Sch Intelligent Transportat & Engn, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Underwater image enhancement; Transformer; Multi-scale fusion; Deep pooling; COLOR; CONTRAST;
D O I
10.1007/s12145-024-01573-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Due to light absorption and scattering, underwater images often suffer from color distortion, low contrast, and blurred details, seriously affects the effectiveness of advanced computer vision tasks. To address these degradation issues, this paper proposes an innovative underwater image enhancement algorithm, Deep Pooling and Multi-Scale Fusion Transformer (DPMFformer). The algorithm is composed of four key modules: the Dual-Balanced Multiscale Fusion Module (DBMF), the Deep Pooling Self-Attention Transformer (DPST), the Wavelet Sampling (WS), and the Global Spatial Feature Self-Attention Transformer (GSFAT). The DBMF module employs trainable color modules to simulate the grey-scale world theory, achieving inter-channel color balance. The DPST module enhances the network's ability to extract information from feature regions through a deep-pooling layer and spatial attention mechanism. The WS module utilizes Harr wavelet sampling instead of conventional up- and down-sampling, preserving low-frequency information while improving the up-sampling outcome. The GSFAT module combines Swin Transformer (SwinT) and Position Embedding Cascading Transformer (PCET), enhancing the extraction of global information through position embedding and a sliding window self-attention mechanism, thereby improving the attention on the degraded regions of the image. Experimental results show that the proposed DPMFfomer is superior to existing underwater image enhancement methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Transformer-based Multi-scale Underwater Image Enhancement Network
    Yang, Ai-Ping
    Fang, Si-Jie
    Shao, Ming-Fu
    Zhang, Teng-Fei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (12): : 1696 - 1705
  • [2] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu Y.
    Liu M.
    Lin S.
    Tao Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [3] Underwater image enhancement based on color balance and multi-scale fusion
    Hu Z.
    Chen Q.
    Zhu D.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (17): : 2133 - 2146
  • [4] Underwater Image Enhancement Based on Color Balance and Multi-Scale Fusion
    Chen, Qi
    Zhang, Ze
    Li, Gelun
    IEEE PHOTONICS JOURNAL, 2022, 14 (06):
  • [5] Underwater image enhancement based on color correction and multi-scale fusion
    Tao, Yang
    Wu, Ping
    Liu, Yuting
    Fang, Wenjun
    Zhou, Liqun
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (08) : 1046 - 1056
  • [6] MSFFT-Net: A multi-scale feature fusion transformer network for underwater enhancement
    Wu, Zeju
    Chen, Kaiming
    Ji, Panxin
    Zhao, Haoran
    Sun, Xin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [7] Multi-scale Underwater Image Enhancement Network Based on Attention Mechanism
    Fang Ming
    Liu Xiaohan
    Fu Feiran
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3513 - 3521
  • [8] Underwater Image Enhancement Method Based on Multi-scale Cascade Network
    Mi Zetian
    Jin Jie
    Li Yuanyuan
    Ding Xueyan
    Liang Zheng
    Fu Xianping
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (10) : 3353 - 3362
  • [9] An underwater image enhancement method based on multi-scale layer decomposition and fusion
    Yang, Jie
    Wang, Jun
    SIGNAL PROCESSING, 2025, 227
  • [10] Underwater image enhancement based on adaptive color correction and multi-scale fusion
    Jinyu Shi
    Shanshan Yu
    Huanan Li
    Xiuguo Zhang
    Changxin Liu
    Multimedia Tools and Applications, 2024, 83 : 12535 - 12559