Cross-Modality Image Matching Network with Modality-Invariant Feature Representation for Airborne-Ground Thermal Infrared and Visible Datasets

被引:0
|
作者
Cui, Song [1 ]
Ma, Ailong [1 ]
Wan, Yuting [1 ]
Zhong, Yanfei [1 ]
Luo, Bin [1 ]
Xu, Miaozhong [1 ]
机构
[1] State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China
基金
中国国家自然科学基金;
关键词
Image matching - Infrared imaging - Remote sensing - Deep learning - Image enhancement - Semantics - Infrared radiation - Neural networks - Object recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Thermal infrared (TIR) remote-sensing imagery can allow objects to be imaged clearly at night through the long-wave infrared, so that the fusion of thermal infrared and visible (VIS) imagery is a way to improve the remote-sensing interpretation ability. However, due to the large radiation difference between the two kinds of images, it is very difficult to match them. One of the most important issues is the lack of comprehensive consideration of the modality-specific information and modality-shared information, which makes it difficult for the existing methods to obtain a modality-invariant feature representation. In this article, a cross-modality image matching network, which we refer to as CMM-Net, is proposed to realize thermal infrared and visible image matching by learning a modality-invariant feature representation. First, in order to extract the modality-specific features of the imagery, the framework constructs a shallow two-branch network to make full use of the modality-specific information, without sharing parameters. Second, in order to extract the high-level semantic information between the different modalities, modality-shared layers are embedded into the deep layers of the network. In addition, three novel loss functions are designed and combined to learn the modality-invariant feature representation, that is, the discriminative loss of the non-corresponding features in the same modality, the cross-modality loss of the corresponding features between different modalities, and the cross-modality triplet (CMT) loss. The multimodal matching experiments conducted with ground- and airborne-based thermal infrared images and visible images showed that the proposed method outperforms the existing image matching methods by about 2% and 6% for the ground and airborne images, respectively. © 1980-2012 IEEE.
引用
收藏
相关论文
共 26 条
  • [1] Cross-Modality Image Matching Network With Modality-Invariant Feature Representation for Airborne-Ground Thermal Infrared and Visible Datasets
    Cui, Song
    Ma, Ailong
    Wan, Yuting
    Zhong, Yanfei
    Luo, Bin
    Xu, Miaozhong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Modality-Invariant Structural Feature Representation for Multimodal Remote Sensing Image Matching
    Fan, Jianwei
    Xiong, Qing
    Li, Jian
    Ye, Yuanxin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] Learning Modality-Invariant Features by Cross-Modality Adversarial Network for Visual Question Answering
    Fu, Ze
    Zheng, Changmeng
    Cai, Yi
    Li, Qing
    Wang, Tao
    [J]. WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 316 - 331
  • [4] Enhanced Invariant Feature Joint Learning via Modality-Invariant Neighbor Relations for Cross-Modality Person Re-Identification
    Du, Guodong
    Zhang, Liyan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2361 - 2373
  • [5] Correlation-Guided Discriminative Cross-Modality Features Network for Infrared and Visible Image Fusion
    Cai, Zhao
    Ma, Yong
    Huang, Jun
    Mei, Xiaoguang
    Fan, Fan
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 18
  • [6] Adversarial Decoupling and Modality-Invariant Representation Learning for Visible-Infrared Person Re-Identification
    Hu, Weipeng
    Liu, Bohong
    Zeng, Haitang
    Hou, Yanke
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5095 - 5109
  • [7] Multi-complement feature network for infrared-visible cross-modality person re-identification
    Kong, Jun
    Liu, Xudong
    Jiang, Min
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (06)
  • [8] General cross-modality registration framework for visible and infrared UAV target image registration
    Yu Luo
    Hao Cha
    Lei Zuo
    Peng Cheng
    Qing Zhao
    [J]. Scientific Reports, 13
  • [9] General cross-modality registration framework for visible and infrared UAV target image registration
    Luo, Yu
    Cha, Hao
    Zuo, Lei
    Cheng, Peng
    Zhao, Qing
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Pose-Guided Modality-Invariant Feature Alignment for Visible-Infrared Object Re-Identification
    Liu, Min
    Sun, Yeqing
    Wang, Xueping
    Bian, Yuan
    Zhang, Zhu
    Wang, Yaonan
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10