MFormer: Taming Masked Transformer for Unsupervised Spectral Reconstruction

被引:9
|
作者
Li, Jiaojiao [1 ,2 ]
Leng, Yihong [1 ]
Song, Rui [1 ]
Liu, Wei [3 ]
Li, Yunsong [1 ]
Du, Qian [4 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Chinese Acad Sci, Key Lab Spectral Imaging Technol, Xian 710119, Peoples R China
[3] State Key Lab Geoinformat Engn, Xian 710054, Peoples R China
[4] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Transformers; Image reconstruction; Task analysis; Spatial resolution; Hyperspectral imaging; Imaging; Feature extraction; Band masking; dual multihead self-attention (MSA); spectral reconstruction (SR); spectral structural similarity; transformer; unsupervised learning; DEEP FOREST; CNN;
D O I
10.1109/TGRS.2023.3264976
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Spectral reconstruction (SR) aims to recover the hyperspectral images (HSIs) from the corresponding RGB images directly. Most SR studies based on supervised learning require massive data annotations to achieve superior reconstruction performance, which is limited by complicated imaging techniques and laborious annotation calibration in practice. Thus, unsupervised strategies attract the attention of the community, however, existing unsupervised SR works still face a fatal bottleneck from low accuracy. Besides, traditional convolutional neural network (CNN)-based models are good at capturing local features but experience difficulty in global features. To ameliorate these drawbacks, we propose an unsupervised SR architecture with strong constraints, especially constructing a novel Masked Transformer (MFormer) to excavate latent hyperspectral characteristics to restore realistic HSIs further. Concretely, a dual spectralwise multihead self-attention (DSSA) mechanism embedded in transformer is proposed to firmly associate multihead and channel dimensions and then capture the spectral representation in the implicit solution spaces. Furthermore, a plug-and-play mask-guided band augment (MBA) module is presented to extract and further enhance the bandwise correlation and continuity to boost the robustness of the model. Innovatively, a customized loss based on the intrinsic mapping from HSIs to RGB images and the inherent spectral structural similarity is designed to restrain spectral distortion. Extensive experimental results on three benchmarks verify that our MFormer achieves superior performance over other state-of-the-art (SOTA) supervised and unsupervised methods under a no-label training process equally.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Supervised-unsupervised combined transformer for spectral compressive imaging reconstruction
    Zhou, Han
    Lian, Yusheng
    Li, Jin
    Liu, Zilong
    Cao, Xuheng
    Ma, Chao
    [J]. OPTICS AND LASERS IN ENGINEERING, 2024, 175
  • [2] Unsupervised relational inference using masked reconstruction
    Gerrit Großmann
    Julian Zimmerlin
    Michael Backenköhler
    Verena Wolf
    [J]. Applied Network Science, 8
  • [3] Unsupervised relational inference using masked reconstruction
    Grossmann, Gerrit
    Zimmerlin, Julian
    Backenkoehler, Michael
    Wolf, Verena
    [J]. APPLIED NETWORK SCIENCE, 2023, 8 (01)
  • [4] Masked transformer through knowledge distillation for unsupervised text style transfer
    Scalercio, Arthur
    Paes, Aline
    [J]. NATURAL LANGUAGE ENGINEERING, 2023,
  • [5] UNSUPERVISED PRE-TRAINING OF BIDIRECTIONAL SPEECH ENCODERS VIA MASKED RECONSTRUCTION
    Wang, Weiran
    Tang, Qingming
    Livescu, Karen
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6889 - 6893
  • [6] Masked Face Transformer
    Zhao, Weisong
    Zhu, Xiangyu
    Guo, Kaiwen
    Shi, Haichao
    Zhang, Xiao-Yu
    Lei, Zhen
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 265 - 279
  • [7] Masked Spiking Transformer
    Wang, Ziqing
    Fang, Yuetong
    Cao, Jiahang
    Zhang, Qiang
    Wang, Zhongrui
    Xu, Renjing
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1761 - 1771
  • [8] Unmixing-Guided Convolutional Transformer for Spectral Reconstruction
    Duan, Shiyao
    Li, Jiaojiao
    Song, Rui
    Li, Yunsong
    Du, Qian
    [J]. REMOTE SENSING, 2023, 15 (10)
  • [9] Residual Mask in Cascaded Convolutional Transformer for Spectral Reconstruction
    Li, Jiaojiao
    Duan, Shiyao
    Leng, Yihong
    Song, Rui
    Li, Yunsong
    Du, Qian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [10] Robust unsupervised network intrusion detection with self-supervised masked context reconstruction
    Wang, Wei
    Jian, Songlei
    Tan, Yusong
    Wu, Qingbo
    Huang, Chenlin
    [J]. COMPUTERS & SECURITY, 2023, 128