Transfer Representation Learning Meets Multimodal Fusion Classification for Remote Sensing Images

被引:11
|
作者
Ma, Mengru [1 ]
Ma, Wenping [1 ]
Jiao, Licheng [1 ]
Liu, Xu [1 ]
Liu, Fang [1 ]
Li, Lingling [1 ]
Yang, Shuyuan [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ China,Sch Artificial Intelligence, Int Res Ctr Intelligent Percept & Computat, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Spatial resolution; Sensors; Semantics; Representation learning; Deep learning; Convolutional neural networks; Attention sparse transfer (AST); deep learning; dual-scale decomposition; feature representation; multiresolution classification; remote sensing; PANSHARPENING METHODS; NETWORK; PAN;
D O I
10.1109/TGRS.2022.3215177
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
To maximize the complementary advantages of synergistic multimodal, a transfer representation learning fusion network (TRLF-Net) is proposed for multisource remote sensing images collaborative classification in this article. First, with respect to the feature encoding, we design a dual-branch attention sparse transfer module (DAST-Module), which combines the spatial and channel attention (CA) masks to migrate the advantage attributes of the panchromatic (PAN) and the MS images mutually. This not only enhances their respective image advantages but also facilitates the sparse fusion of low-level features. Second, for the separation of multiscale information, a deep dual-scale decomposition module (DDSD-Module) is designed, which allows the decompose of high-frequency and low-frequency components. Then it uses the decomposed information to make the essential difference as small as possible, and the surrounding contour difference is as large as possible of the complementary multimodal image through the design of the loss function. Finally, to address the problem of large intraclass and small interclass differences, we develop a representation fusion of the global and local features' module (RFGAL-Module). It mainly adopts global features to sort local features within classes, and then outputs them in a cascade. Thus, the characterization ability of features is improved, and the global and local features are used in a coordinated manner to accomplish the sample classification tasks. In particular, the experimental results demonstrate that TRLF-Net can obtain much improved accuracy and efficiency. The code is accessible in: https://github.com/ru-willow/SRLF-Net.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multisource Joint Representation Learning Fusion Classification for Remote Sensing Images
    Geng, Xueli
    Jiao, Licheng
    Li, Lingling
    Liu, Fang
    Liu, Xu
    Yang, Shuyuan
    Zhang, Xiangrong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [2] Domain Adaptation Using Representation Learning for the Classification of Remote Sensing Images
    Elshamli, Ahmed
    Taylor, Graham W.
    Berg, Aaron
    Areibi, Shawki
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (09) : 4198 - 4209
  • [3] Multimodal Fusion Transformer for Remote Sensing Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Hong, Danfeng
    Rasti, Behnood
    Plaza, Antonio
    Chanussot, Jocelyn
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] A Multimodal Unified Representation Learning Framework with Masked Image Modeling for Remote Sensing Images
    Du, Dakuan
    Liu, Tianzhu
    Gu, Yanfeng
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [5] Fusion of Deep Learning Models for Improving Classification Accuracy of Remote Sensing Images
    Deepan, P.
    Sudha, L. R.
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (05): : 189 - 201
  • [6] ADDRESSING RELIABILITY OF MULTIMODAL REMOTE SENSING TO ENHANCE MULTISENSOR DATA FUSION AND TRANSFER LEARNING
    Marinoni, Andrea
    Chlaily, Saloua
    Jutten, Christian
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3896 - 3899
  • [7] More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification
    Hong, Danfeng
    Gao, Lianru
    Yokoya, Naoto
    Yao, Jing
    Chanussot, Jocelyn
    Du, Qian
    Zhang, Bing
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05): : 4340 - 4354
  • [8] Decision fusion for the classification of urban remote sensing images
    Fauvel, Mathieu
    Chanussot, Jocelyn
    Benediktsson, Jon Atli
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (10): : 2828 - 2838
  • [9] ACTIVE LEARNING FOR CLASSIFICATION OF REMOTE SENSING IMAGES
    Bruzzone, Lorenzo
    Persello, Claudio
    [J]. 2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 1995 - 1998
  • [10] Multimodal Classification of Remote Sensing Images: A Review and Future Directions
    Gomez-Chova, Luis
    Tuia, Devis
    Moser, Gabriele
    Camps-Valls, Gustau
    [J]. PROCEEDINGS OF THE IEEE, 2015, 103 (09) : 1560 - 1584