Multimodal Image Fusion Framework for End-to-End Remote Sensing Image Registration

被引:0
|
作者
Li, Liangzhi [1 ]
Han, Ling [2 ]
Ding, Mingtao [1 ]
Cao, Hongye [1 ]
机构
[1] Changan Univ, Coll Geol Engn & Geomatics, Xian 710054, Shaanxi, Peoples R China
[2] Changan Univ, Sch Land Engn, Xian 710064, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Feature extraction; Image matching; Image registration; Task analysis; Convolutional neural networks; Image fusion; End-to-end registration; multimodal fusion; remote sensing image; spatial transformer networks; DEEP LEARNING FRAMEWORK;
D O I
10.1109/TGRS.2023.3247642
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
We formulate the registration as a function that maps the input reference and sensed images to eight displacement parameters between prescribed matching points, as opposed to the usual techniques (feature extraction-description-matching-geometric restrictions). The projection transformation matrix (PTM) is then computed in the neural network and used to warp the sensed image, uniting all matching tasks under one framework. In this article, we offer a multimodal image fusion network with self-attention to merge the feature representation of the reference and sensed images. The integration information is then utilized to regress the prescribed points' displacement parameters to get PTM between the reference and sensed images. Finally, PTM is supplied into the spatial transformation network (STN), which warps the sensed image to the same coordinates as the reference image, achieving end-to-end matching. In addition, a dual-supervised loss function is proposed to optimize the network from both the prescribed point displacement and the overall pixel matching perspectives. The effectiveness of our method is validated by qualitative and quantitative experimental results on multimodal remote sensing image matching tasks. The code is available at: https://github.com/liliangzhi110/E2EIR.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] End-to-end multimodal image registration via reinforcement learning
    Hu, Jing
    Luo, Ziwei
    Wang, Xin
    Sun, Shanhui
    Yin, Youbing
    Cao, Kunlin
    Song, Qi
    Lyu, Siwei
    Wu, Xi
    MEDICAL IMAGE ANALYSIS, 2021, 68
  • [2] END-TO-END LEARNING OF POLYGONS FOR REMOTE SENSING IMAGE CLASSIFICATION
    Girard, Nicolas
    Tarabalka, Yuliya
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2083 - 2086
  • [3] An End-to-End Framework Based on Vision-Language Fusion for Remote Sensing Cross-Modal Text-Image Retrieval
    He, Liu
    Liu, Shuyan
    An, Ran
    Zhuo, Yudong
    Tao, Jian
    MATHEMATICS, 2023, 11 (10)
  • [4] End-to-End FusVAE for Face Image Fusion
    Li, Xiang
    Chen, Bo
    Wen, Meijin
    Wang, Haoshuang
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [5] An End-to-End Local-Global-Fusion Feature Extraction Network for Remote Sensing Image Scene Classification
    Lv, Yafei
    Zhang, Xiaohan
    Xiong, Wei
    Cui, Yaqi
    Cai, Mi
    REMOTE SENSING, 2019, 11 (24)
  • [6] End-to-end dynamic residual focal transformer network for multimodal medical image fusion
    Zhang W.
    Yu L.
    Wang H.
    Pedrycz W.
    Neural Computing and Applications, 2024, 36 (19) : 11579 - 11601
  • [7] AN END-TO-END ADVERSARIAL HASHING METHOD FOR UNSUPERVISED MULTISPECTRAL REMOTE SENSING IMAGE RETRIEVAL
    Chen, Xuelei
    Lu, Cunyue
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1536 - 1540
  • [8] Remote sensing image description based on word embedding and end-to-end deep learning
    Wang, Yuan
    Ma, Hongbing
    Alifu, Kuerban
    Lv, Yalong
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [9] Remote sensing image description based on word embedding and end-to-end deep learning
    Yuan Wang
    Hongbing Ma
    Kuerban Alifu
    Yalong Lv
    Scientific Reports, 11