Tuple Perturbation-Based Contrastive Learning Framework for Multimodal Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Ye, Yuanxin [1 ,2 ]
Dai, Jinkun [1 ,2 ]
Zhou, Liang [1 ,2 ]
Duan, Keyi [1 ,2 ]
Tao, Ran [3 ]
Li, Wei [3 ]
Hong, Danfeng [4 ,5 ]
机构
[1] Southwest Jiaotong Univ, Fac Geosci & Engn, Chengdu 610031, Peoples R China
[2] Southwest Jiaotong Univ, State Prov Joint Engn Lab Spatial Informat Technol, Chengdu 611756, Peoples R China
[3] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[5] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Contrastive learning; Remote sensing; Optical sensors; Optical imaging; Radar polarimetry; Adaptive optics; Training; Perturbation methods; multimodal remote sensing image (RSI); negative samples; semantic segmentation; tuple perturbation;
D O I
10.1109/TGRS.2025.3542868
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning models exhibit promising potential in multimodal remote sensing image semantic segmentation (MRSISS). However, the constrained access to labeled samples for training deep learning networks significantly influences the performance of these models. To address that, self-supervised learning (SSL) methods have garnered significant interest in the remote sensing community. Accordingly, this article proposes a novel multimodal contrastive learning framework based on tuple perturbation, which includes the pretraining and fine-tuning stages. First, a tuple perturbation-based multimodal contrastive learning network (TMCNet) is designed to better explore shared and different feature representations across modalities during the pretraining stage and the tuple perturbation module is introduced to improve the network's ability to extract multimodal features by generating more complex negative samples. In the fine-tuning stage, we develop a simple and effective multimodal semantic segmentation network (MSSNet), which can reduce noise by using complementary information from various modalities to integrate multimodal features more effectively, resulting in better semantic segmentation performance. Extensive experiments have been carried out on two published multimodal image datasets including optical and synthetic aperture radar (SAR) pairs, and the results show that the proposed framework can obtain more superior performance of semantic segmentation than the current state-of-the-art methods in cases of limited labeled samples. The source code is available at https://github.com/yeyuanxin110/TMCNet-MSSNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Aircraft Segmentation Based On Deep Learning framework : from extreme points to remote sensing image segmentation
    Zhao, Lei
    Qiao, Peng
    Dou, Yong
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1362 - 1366
  • [32] DSNet-Based Remote Sensing Image Semantic Segmentation Method
    Shi Fangxing
    Zhou Line
    Zhu Daming
    Fu Zhitao
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (06)
  • [33] Change Detection for Remote Sensing Images based on Semantic Prototypes and Contrastive Learning
    Zhao, Guiqin
    Wang, Weiqiang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 865 - 869
  • [34] Remote Sensing Image Semantic Segmentation Based on Edge Information Guidance
    He, Chu
    Li, Shenglin
    Xiong, Dehui
    Fang, Peizhang
    Liao, Mingsheng
    REMOTE SENSING, 2020, 12 (09)
  • [35] Semantic segmentation of remote sensing image based on bilateral branch network
    Li, Zhongyu
    Wang, Huajun
    Liu, Yang
    VISUAL COMPUTER, 2024, 40 (05): : 3069 - 3090
  • [36] Semantic segmentation of remote sensing image based on bilateral branch network
    Zhongyu Li
    Huajun Wang
    Yang Liu
    The Visual Computer, 2024, 40 : 3069 - 3090
  • [37] Combining deep learning and ontology reasoning for remote sensing image semantic segmentation
    Li, Yansheng
    Ouyang, Song
    Zhang, Yongjun
    KNOWLEDGE-BASED SYSTEMS, 2022, 243
  • [38] A Multilevel Multimodal Fusion Transformer for Remote Sensing Semantic Segmentation
    Ma, Xianping
    Zhang, Xiaokang
    Pun, Man-On
    Liu, Ming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [39] Remote Sensing Image Scene Classification Based on Supervised Contrastive Learning
    Guo Dongen
    Xia Ying
    Luo Xiaobo
    Feng Jiangfan
    ACTA PHOTONICA SINICA, 2021, 50 (07)
  • [40] Multi-granularity semantic alignment distillation learning for remote sensing image semantic segmentation
    Di Zhang
    Yong Zhou
    Jiaqi Zhao
    Zhongyuan Yang
    Hui Dong
    Rui Yao
    Huifang Ma
    Frontiers of Computer Science, 2022, 16