A NOVEL SELF-SUPERVISED CROSS-MODAL IMAGE RETRIEVAL METHOD IN REMOTE SENSING

被引:3
|
作者
Sumbul, Gencer [1 ]
Mueller, Markus [1 ]
Demir, Beguem [1 ]
机构
[1] Tech Univ Berlin, Fac Elect Engn & Comp Sci, Berlin, Germany
基金
欧洲研究理事会;
关键词
Cross-modal image retrieval; deep learning; self-supervised learning; remote sensing;
D O I
10.1109/ICIP46576.2022.9897475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the availability of multi-modal remote sensing (RS) image archives, one of the most important research topics is the development of cross-modal RS image retrieval (CM-RSIR) methods that search semantically similar images across different modalities. Existing CM-RSIR methods require the availability of a high quality and quantity of annotated training images. The collection of a sufficient number of reliable labeled images is time consuming, complex and costly in operational scenarios, and can significantly affect the final accuracy of CM-RSIR. In this paper, we introduce a novel self-supervised CM-RSIR method that aims to: i) model mutual-information between different modalities in a self-supervised manner; ii) retain the distributions of modal-specific feature spaces similar to each other; and iii) define the most similar images within each modality without requiring any annotated training image. To this end, we propose a novel objective including three loss functions that simultaneously: i) maximize mutual information of different modalities for inter-modal similarity preservation; ii) minimize the angular distance of multi-modal image tuples for the elimination of inter-modal discrepancies; and iii) increase cosine similarity of the most similar images within each modality for the characterization of intra-modal similarities. Experimental results show the effectiveness of the proposed method compared to state-of-the-art methods. The code of the proposed method is publicly available at https://git.tu- berlin.de/rsim/SS-CM-RSIR.
引用
收藏
页码:2426 / 2430
页数:5
相关论文
共 50 条
  • [41] Learning From Self-Supervised Features for Hashing-Based Remote Sensing Image Retrieval
    Tang, Jiayi
    Wang, Dali
    Tong, Xiaochong
    Qiu, Chunping
    Yang, Weiming
    Lei, Yi
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [42] Cross-Modal feature description for remote sensing image matching
    Li, Liangzhi
    Liu, Ming
    Ma, Lingfei
    Han, Ling
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
  • [43] Cross-Modal Contrastive Learning for Remote Sensing Image Classification
    Feng, Zhixi
    Song, Liangliang
    Yang, Shuyuan
    Zhang, Xinyu
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [44] HGR MAXIMAL CORRELATION AUGMENTED CROSS-MODAL REMOTE SENSING RETRIEVAL
    Wang, Zhuoyue
    Wang, Xueqian
    Li, Gang
    Li, Chengxi
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5053 - 5056
  • [45] Federated learning for supervised cross-modal retrieval
    Li, Ang
    Li, Yawen
    Shao, Yingxia
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (04):
  • [46] Text-Image Matching for Cross-Modal Remote Sensing Image Retrieval via Graph Neural Network
    Yu, Hongfeng
    Yao, Fanglong
    Lu, Wanxuan
    Liu, Nayu
    Li, Peiguang
    You, Hongjian
    Sun, Xian
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 812 - 824
  • [47] A General Self-Supervised Framework for Remote Sensing Image Classification
    Gao, Yuan
    Sun, Xiaojuan
    Liu, Chao
    [J]. REMOTE SENSING, 2022, 14 (19)
  • [48] Research on Semantic Segmentation Method of Remote Sensing Image Based on Self-supervised Learning
    Zhang, Wenbo
    Wang, Achuan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (08) : 500 - 508
  • [49] Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding
    Wu, Yue
    Liu, Jiaming
    Gong, Maoguo
    Gong, Peiran
    Fan, Xiaolong
    Qin, A. K.
    Miao, Qiguang
    Ma, Wenping
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1626 - 1638
  • [50] Heterogeneous self-supervised interest point matching for multi-modal remote sensing image registration
    Zhao, Ming
    Zhang, Guixiang
    Ding, Min
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (03) : 915 - 931