TransFG: A Cross-View Geo-Localization of Satellite and UAVs Imagery Pipeline Using Transformer-Based Feature Aggregation and Gradient Guidance

被引:8
|
作者
Zhao, Hu [1 ]
Ren, Keyan [1 ]
Yue, Tianyi [1 ]
Zhang, Chun [1 ]
Yuan, Shuai [1 ]
机构
[1] Beijing Univ Technol, Lab Low Altitude Intelligent Percept, Beijing 100124, Peoples R China
关键词
Cross-view geo-localization; feature aggregation (FA); image matching; transformer; unmanned aerial vehicles (UAVs);
D O I
10.1109/TGRS.2024.3352418
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Cross-view geo-localization of satellite and unmanned aerial vehicles (UAVs) imagery has attracted extensive attention due to its tremendous potential for global navigation satellite system (GNSS) denied navigation. However, inadequate feature representation across different views coupled with positional shifts and distance-scale uncertainty are key challenges. Most of the existing research mainly focused on extracting comprehensive and fine-grained information, yet effective feature representation and alignment should be imposed equal importance. In this article, we propose an innovative transformer-based pipeline TransFG for robust cross-view image matching, which incorporates feature aggregation (FA) and gradient guidance (GG) module. TransFG synergically takes advantage of FA and GG, achieving an effective balance in feature representation and alignment. Specifically, the proposed FA module implicitly learns salient features and dynamically aggregates contextual features from the vision transformer (ViT). The proposed GG module uses the gradient information of local features to further enhance the cross-view feature representation and aligns specific instances across different views. Extensive experiments demonstrate that our pipeline outperforms existing methods in cross-view geo-localization. It achieves an impressive improvement in R@1 and AP than the state-of-the-art (SOTA) methods.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Cross-view geo-localization with evolving transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    arXiv, 2021,
  • [2] Cross-View Object Geo-Localization in a Local Region With Satellite Imagery
    Sun, Yuxi
    Ye, Yunming
    Kang, Jian
    Fernandez-Beltran, Ruben
    Feng, Shanshan
    Li, Xutao
    Luo, Chuyao
    Zhang, Puzhao
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [3] Combining OpenStreetMap with Satellite Imagery to Enhance Cross-View Geo-Localization
    Hu, Yuekun
    Liu, Yingfan
    Hui, Bin
    SENSORS, 2025, 25 (01)
  • [4] A Semantic Guidance and Transformer-Based Matching Method for UAVs and Satellite Images for UAV Geo-Localization
    Zhuang, Jiedong
    Chen, Xuruoyan
    Dai, Ming
    Lan, Wenbo
    Cai, Yongheng
    Zheng, Enhui
    IEEE ACCESS, 2022, 10 : 34277 - 34287
  • [5] Cross-View Geo-Localization for Autonomous UAV Using Locally-Aware Transformer-Based Network
    Bui, Duc Viet
    Kubo, Masao
    Sato, Hiroshi
    IEEE ACCESS, 2023, 11 : 104200 - 104210
  • [6] Spatial-Aware Feature Aggregation for Cross-View Image based Geo-Localization
    Shi, Yujiao
    Liu, Liu
    Yu, Xin
    Li, Hongdong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Feature Relation Guided Cross-View Image Based Geo-Localization
    Hou, Qingfeng
    Lu, Jun
    Guo, Haitao
    Liu, Xiangyun
    Gong, Zhihui
    Zhu, Kun
    Ping, Yifan
    REMOTE SENSING, 2023, 15 (20)
  • [8] Cross-view Geo-localization with Layer-to-Layer Transformer
    Yang, Hongji
    Lu, Xiufan
    Zhu, Yingying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] UAV-Satellite View Synthesis for Cross-View Geo-Localization
    Tian, Xiaoyang
    Shao, Jie
    Ouyang, Deqiang
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4804 - 4815
  • [10] Perceptual Feature Fusion Network for Cross-View Geo-Localization
    Wang, Jiayi
    Chen, Ziyang
    Yuan, Xiaochen
    Zhao, Genping
    Computer Engineering and Applications, 60 (03): : 255 - 262