Multibranch Joint Representation Learning Based on Information Fusion Strategy for Cross-View Geo-Localization

被引：0

作者：

Ge, Fawei ^{[1
]}

Zhang, Yunzhou ^{[1
]}

Liu, Yixiu ^{[2
]}

Wang, Guiyuan ^{[3
]}

Coleman, Sonya ^{[4
]}

Kerr, Dermot

Wang, Li ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China

[3] Jiangsu Shuguang Optoelect Co Ltd, Yangzhou 225000, Jiangsu, Peoples R China

[4] Ulster Univ, Intelligent Syst Res Ctr, Londonderry BT52 1SA, England

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Task analysis; Representation learning; Location awareness; Deep learning; Context modeling; Layout; Geo-localization; hybrid information fusion strategies (IFSs); joint representation learning; multibranch; IMAGE; SIMILARITY; SCALE;

D O I：

10.1109/TGRS.2024.3378453

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Cross-view geo-localization refers to recognizing images of the same geographic target obtained from different platforms (such as drone-view, satellite-view, and ground-view). However, cross-view geo-localization is challenging as image capture using different platforms coupled with extreme viewpoint variations can cause significant changes to the visual image content. Existing methods mainly focus on mining the fine-grained features or the contextual information in neighboring areas, but ignore the complete information of the entire image and the association of contextual information of adjacent regions. Therefore, a multibranch joint representation learning network model based on information fusion strategies (IFSs) is proposed to solve this cross-view geo-localization problem. First, we obtained feature information from the image through global information fusion (GIF) branch and local information fusion (LIF) branch to help the network learn the discernable information in the different images. In addition, a local-guided-GIF (LGGIF) branch is introduced to make local information assist global features to enhance the learning of potential information in the images. Second, we introduced different IFSs in each branch to increase the extraction of contextual information through expanding the global receptive field, thus improving the performance of the model. Finally, a series of experiments is carried out on four prevailing benchmark datasets, namely University-1652, SUES-200, CVUAS, and CVACT datasets. The quantitative comparisons from the experiments clearly indicate that the proposed network framework has great performance. For example, compared with some state-of-the-art methods, the quantitative improvements of the R@1 and AP on the University-1652 datasets are 1.91%, 2.18%, and 1.55%, 2.99% in both tasks, respectively.

引用

页码：1 / 16

页数：16

共 50 条

[1] Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization
Lin, Jinliang
Zheng, Zhedong
Zhong, Zhun
Luo, Zhiming
Li, Shaozi
Yang, Yi
Sebe, Nicu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3780 - 3792
[2] Cross-View Geo-Localization: A Survey
Durgam, Abhilash
Paheding, Sidike
Dhiman, Vikas
Devabhaktuni, Vijay
[J]. IEEE Access, 2024, 12 : 192028 - 192050
[3] Multilevel Feedback Joint Representation Learning Network Based on Adaptive Area Elimination for Cross-View Geo-Localization
Ge, Fawei
Zhang, Yunzhou
Wang, Li
Liu, Wei
Liu, Yixiu
Coleman, Sonya
Kerr, Dermot
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
[4] Perceptual Feature Fusion Network for Cross-View Geo-Localization
Wang, Jiayi
Chen, Ziyang
Yuan, Xiaochen
Zhao, Genping
[J]. Computer Engineering and Applications, 60 (03): : 255 - 262
[5] Fusing Geometric and Scene Information for Cross-View Geo-Localization
Guo, Siyuan
Liu, Tianying
Li, Wengen
Guan, Jihong
Zhou, Shuigeng
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3978 - 3982
[6] GAMa: Cross-View Video Geo-Localization
Vyas, Shruti
Chen, Chen
Shah, Mubarak
[J]. COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 440 - 456
[7] Cross-View Image Sequence Geo-localization
Zhang, Xiaohan
Sultani, Waqas
Wshah, Safwan
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2913 - 2922
[8] Cross-view geo-localization with evolving transformer
Yang, Hongji
Lu, Xiufan
Zhu, Yingying
[J]. arXiv, 2021,
[9] Cross-view Geo-localization Based on Cross-domain Matching
Wu, Xiaokang
Ma, Qianguang
Li, Qi
Yu, Yuanlong
Liu, Wenxi
[J]. ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 719 - 728
[10] Learning Cross-View Visual Geo-Localization Without Ground Truth
Li, Haoyuan
Xu, Chang
Yang, Wen
Yu, Huai
Xia, Gui-Song
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62

← 1 2 3 4 5 →