Tuple Perturbation-Based Contrastive Learning Framework for Multimodal Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Ye, Yuanxin [1 ,2 ]
Dai, Jinkun [1 ,2 ]
Zhou, Liang [1 ,2 ]
Duan, Keyi [1 ,2 ]
Tao, Ran [3 ]
Li, Wei [3 ]
Hong, Danfeng [4 ,5 ]
机构
[1] Southwest Jiaotong Univ, Fac Geosci & Engn, Chengdu 610031, Peoples R China
[2] Southwest Jiaotong Univ, State Prov Joint Engn Lab Spatial Informat Technol, Chengdu 611756, Peoples R China
[3] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[5] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Contrastive learning; Remote sensing; Optical sensors; Optical imaging; Radar polarimetry; Adaptive optics; Training; Perturbation methods; multimodal remote sensing image (RSI); negative samples; semantic segmentation; tuple perturbation;
D O I
10.1109/TGRS.2025.3542868
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning models exhibit promising potential in multimodal remote sensing image semantic segmentation (MRSISS). However, the constrained access to labeled samples for training deep learning networks significantly influences the performance of these models. To address that, self-supervised learning (SSL) methods have garnered significant interest in the remote sensing community. Accordingly, this article proposes a novel multimodal contrastive learning framework based on tuple perturbation, which includes the pretraining and fine-tuning stages. First, a tuple perturbation-based multimodal contrastive learning network (TMCNet) is designed to better explore shared and different feature representations across modalities during the pretraining stage and the tuple perturbation module is introduced to improve the network's ability to extract multimodal features by generating more complex negative samples. In the fine-tuning stage, we develop a simple and effective multimodal semantic segmentation network (MSSNet), which can reduce noise by using complementary information from various modalities to integrate multimodal features more effectively, resulting in better semantic segmentation performance. Extensive experiments have been carried out on two published multimodal image datasets including optical and synthetic aperture radar (SAR) pairs, and the results show that the proposed framework can obtain more superior performance of semantic segmentation than the current state-of-the-art methods in cases of limited labeled samples. The source code is available at https://github.com/yeyuanxin110/TMCNet-MSSNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images
    Dong, Zhe
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [22] Semantic Segmentation of Remote Sensing Image Based on Neural Network
    Wang Ende
    Qi Kai
    Li Xuepeng
    Peng Liangyu
    ACTA OPTICA SINICA, 2019, 39 (12)
  • [23] Remote sensing image semantic segmentation based on cascaded Transformer
    Wang F.
    Ji J.
    Wang Y.
    IEEE. Trans. Artif. Intell., 2024, 8 (4136-4148): : 1 - 12
  • [24] Remote sensing image semantic segmentation network based on ENet
    Wang, Yiqin
    JOURNAL OF ENGINEERING-JOE, 2022, 2022 (12): : 1219 - 1227
  • [25] Remote Sensing Image Semantic Segmentation Algorithm Based on TransMANet
    Song Xirui
    Ge Hongwei
    LASER & OPTOELECTRONICS PROGRESS, 2024, 61 (10)
  • [26] FALSE: False Negative Samples Aware Contrastive Learning for Semantic Segmentation of High-Resolution Remote Sensing Image
    Zhang, Zhaoyang
    Wang, Xuying
    Mei, Xiaoming
    Tao, Chao
    Li, Haifeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] GPMO: Gradient Perturbation-Based Contrastive Learning for Molecule Optimization
    Yang, Xixi
    Fu, Li
    Deng, Yafeng
    Liu, Yuansheng
    Cao, Dongsheng
    Zeng, Xiangxiang
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4940 - 4948
  • [28] Semi-Supervised Remote Sensing Image Semantic Segmentation Method Based on Deep Learning
    Li, Linhui
    Zhang, Wenjun
    Zhang, Xiaoyan
    Emam, Mahmoud
    Jing, Weipeng
    ELECTRONICS, 2023, 12 (02)
  • [29] Research on Semantic Segmentation Method of Remote Sensing Image Based on Self-supervised Learning
    Zhang, Wenbo
    Wang, Achuan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) : 500 - 508
  • [30] Prototype Guided Pseudo Labeling and Perturbation-based Active Learning for domain adaptive semantic segmentation
    Peng, Junkun
    Sun, Mingjie
    Lim, Eng Gee
    Wang, Qiufeng
    Xiao, Jimin
    PATTERN RECOGNITION, 2024, 148