UNFOLD: 3-D U-Net, 3-D CNN, and 3-D Transformer-Based Hyperspectral Image Denoising

被引:5
|
作者
Dixit, Aditya [1 ]
Gupta, Anup Kumar [1 ]
Gupta, Puneet [1 ]
Srivastava, Saurabh [2 ]
Garg, Ankur [3 ]
机构
[1] IIT Indore, Dept Comp Sci & Engn, Indore 453552, India
[2] IIT Dhanbad, Dept Comp Sci & Engn, Dhanbad 826004, India
[3] Space Applicat Ctr, Ahmadabad 380015, India
关键词
3-D convolutional neural networks (CNNs); 3-D transformers; 3-D U-Net; hyperspectral imaging denoising; spatial spectral fusion; CLASSIFICATION; RESTORATION; FILTER;
D O I
10.1109/TGRS.2023.3328922
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Hyperspectral images (HSIs) encompass data across numerous spectral bands, making them valuable in various practical fields such as remote sensing, agriculture, and marine monitoring. Unfortunately, inevitable noise introduction during sensing restricts their applicability, necessitating denoising for optimal utilization. The existing deep learning (DL)-based denoising methods suffer from various limitations. For instance, convolutional neural networks (CNNs) struggle with long-range dependencies, while vision transformers (ViTs) struggle to capture local details. This article introduces a novel method, UNFOLD, that addresses these inherent limitations by harmoniously integrating the strengths of 3-D U-Net, 3-D CNN, and 3-D Transformer architectures. Unlike several existing methods that predominantly capture dependencies either along the spatial or the spectral dimension, UNFOLD addresses HSI denoising as a 3-D task, synergizing spatial and spectral information through the utilization of 3-D Transformer and 3-D CNN. It employs the self-attention (SA) mechanism of Transformers to capture the global dependencies and model long-range relationships across spatial and spectral dimensions. To overcome the limitations of 3-D Transformer in capturing fine-grained local and spatial features, UNFOLD complements it by incorporating 3-D CNN. Moreover, UNFOLD utilize a modified form of 3-D U-Net architecture for HSI denoising, wherein it employs a 3-D Transformer-based encoder instead of the conventional 3-D CNN-based encoder. It further capitalizes on the property of U-Net to integrate features across various scales, thereby enhancing efficacy by preserving intricate structural details. Results from extensive experiments demonstrate that UNFOLD outperforms the state-of-the-art HSI denoising methods.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] IMAGE ENHANCEMENT OF 3-D SAR VIA U-NET FRAMEWORK
    Shen, Rong
    Wei, Shunjun
    Zhou, Zichen
    Liang, Jiadian
    Zhang, Xiaoling
    Shi, Jun
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 947 - 950
  • [2] Hyperspectral Image Denoising Using a 3-D Attention Denoising Network
    Shi, Qian
    Tang, Xiaopei
    Yang, Taoru
    Liu, Rong
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10348 - 10363
  • [3] 3-D or not 3-D
    Adam Powell
    JOM, 2002, 54 : 22 - 24
  • [4] 3-D OR NOT 3-D
    SMITH, CW
    NEW SCIENTIST, 1984, 102 (1407) : 40 - 44
  • [5] 3-D OR NOT 3-D
    KERBEL, M
    FILM COMMENT, 1980, 16 (06) : 11 - 20
  • [6] 3-D OR NOT 3-D
    Kehr, Dave
    FILM COMMENT, 2010, 46 (01) : 60 - 67
  • [7] 3-D or not 3-D
    Powell, A
    JOM-JOURNAL OF THE MINERALS METALS & MATERIALS SOCIETY, 2002, 54 (01): : 22 - 24
  • [8] A Fast and Compact 3-D CNN for Hyperspectral Image Classification
    Ahmad, Muhammad
    Khan, Adil Mehmood
    Mazzara, Manuel
    Distefano, Salvatore
    Ali, Mohsin
    Sarfraz, Muhammad Shahzad
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
    Zeeshan Shaukat
    Qurratul Ain Farooq
    Chuangbai Xiao
    Saqib Ali
    Faheem Akhtar
    Muhammad Azeem
    Abdul Ahad Zulfiqar
    Multimedia Tools and Applications, 2024, 83 : 25121 - 25134
  • [10] Efficient scheme to perform semantic segmentation on 3-D brain tumor using 3-D u-net architecture
    Shaukat, Zeeshan
    Farooq, Qurratul Ain
    Xiao, Chuangbai
    Ali, Saqib
    Akhtar, Faheem
    Azeem, Muhammad
    Zulfiqar, Abdul Ahad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25121 - 25134