Multibranch Joint Representation Learning Based on Information Fusion Strategy for Cross-View Geo-Localization

被引:0
|
作者
Ge, Fawei [1 ]
Zhang, Yunzhou [1 ]
Liu, Yixiu [2 ]
Wang, Guiyuan [3 ]
Coleman, Sonya [4 ]
Kerr, Dermot
Wang, Li [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[3] Jiangsu Shuguang Optoelect Co Ltd, Yangzhou 225000, Jiangsu, Peoples R China
[4] Ulster Univ, Intelligent Syst Res Ctr, Londonderry BT52 1SA, England
基金
中国国家自然科学基金;
关键词
Feature extraction; Task analysis; Representation learning; Location awareness; Deep learning; Context modeling; Layout; Geo-localization; hybrid information fusion strategies (IFSs); joint representation learning; multibranch; IMAGE; SIMILARITY; SCALE;
D O I
10.1109/TGRS.2024.3378453
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Cross-view geo-localization refers to recognizing images of the same geographic target obtained from different platforms (such as drone-view, satellite-view, and ground-view). However, cross-view geo-localization is challenging as image capture using different platforms coupled with extreme viewpoint variations can cause significant changes to the visual image content. Existing methods mainly focus on mining the fine-grained features or the contextual information in neighboring areas, but ignore the complete information of the entire image and the association of contextual information of adjacent regions. Therefore, a multibranch joint representation learning network model based on information fusion strategies (IFSs) is proposed to solve this cross-view geo-localization problem. First, we obtained feature information from the image through global information fusion (GIF) branch and local information fusion (LIF) branch to help the network learn the discernable information in the different images. In addition, a local-guided-GIF (LGGIF) branch is introduced to make local information assist global features to enhance the learning of potential information in the images. Second, we introduced different IFSs in each branch to increase the extraction of contextual information through expanding the global receptive field, thus improving the performance of the model. Finally, a series of experiments is carried out on four prevailing benchmark datasets, namely University-1652, SUES-200, CVUAS, and CVACT datasets. The quantitative comparisons from the experiments clearly indicate that the proposed network framework has great performance. For example, compared with some state-of-the-art methods, the quantitative improvements of the R@1 and AP on the University-1652 datasets are 1.91%, 2.18%, and 1.55%, 2.99% in both tasks, respectively.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [1] Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization
    Lin, Jinliang
    Zheng, Zhedong
    Zhong, Zhun
    Luo, Zhiming
    Li, Shaozi
    Yang, Yi
    Sebe, Nicu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3780 - 3792
  • [2] Cross-View Geo-Localization: A Survey
    Durgam, Abhilash
    Paheding, Sidike
    Dhiman, Vikas
    Devabhaktuni, Vijay
    [J]. IEEE Access, 2024, 12 : 192028 - 192050
  • [3] Multilevel Feedback Joint Representation Learning Network Based on Adaptive Area Elimination for Cross-View Geo-Localization
    Ge, Fawei
    Zhang, Yunzhou
    Wang, Li
    Liu, Wei
    Liu, Yixiu
    Coleman, Sonya
    Kerr, Dermot
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [4] Perceptual Feature Fusion Network for Cross-View Geo-Localization
    Wang, Jiayi
    Chen, Ziyang
    Yuan, Xiaochen
    Zhao, Genping
    [J]. Computer Engineering and Applications, 60 (03): : 255 - 262
  • [5] Fusing Geometric and Scene Information for Cross-View Geo-Localization
    Guo, Siyuan
    Liu, Tianying
    Li, Wengen
    Guan, Jihong
    Zhou, Shuigeng
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3978 - 3982
  • [6] GAMa: Cross-View Video Geo-Localization
    Vyas, Shruti
    Chen, Chen
    Shah, Mubarak
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
  • [7] Cross-View Image Sequence Geo-localization
    Zhang, Xiaohan
    Sultani, Waqas
    Wshah, Safwan
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2913 - 2922
  • [8] Cross-view geo-localization with evolving transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    [J]. arXiv, 2021,
  • [9] Cross-view Geo-localization Based on Cross-domain Matching
    Wu, Xiaokang
    Ma, Qianguang
    Li, Qi
    Yu, Yuanlong
    Liu, Wenxi
    [J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 719 - 728
  • [10] Learning Cross-View Visual Geo-Localization Without Ground Truth
    Li, Haoyuan
    Xu, Chang
    Yang, Wen
    Yu, Huai
    Xia, Gui-Song
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62