VIGOR: Cross-View Image Geo-localization beyond One-to-one Retrieval

被引:65
|
作者
Zhu, Sijie [1 ]
Yang, Taojiannan [1 ]
Chen, Chen [1 ]
机构
[1] Univ North Carolina Charlotte, Charlotte, NC 28223 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR46437.2021.00364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-view image geo-localization aims to determine the locations of street-view query images by matching with GPS-tagged reference images from aerial view. Recent works have achieved surprisingly high retrieval accuracy on city-scale datasets. However, these results rely on the assumption that there exists a reference image exactly centered at the location of any query image, which is not applicable for practical scenarios. In this paper, we redefine this problem with a more realistic assumption that the query image can be arbitrary in the area of interest and the reference images are captured before the queries emerge. This assumption breaks the one-to-one retrieval setting of existing datasets as the queries and reference images are not perfectly aligned pairs, and there may be multiple reference images covering one query location. To bridge the gap between this realistic setting and existing datasets, we propose a new large-scale benchmark -VIGOR- for cross-View Image Geo-localization beyond One-to-one Retrieval. We benchmark existing state-of-the-art methods and propose a novel end-to-end framework to localize the query in a coarse-to-fine manner. Apart from the image-level retrieval accuracy, we also evaluate the localization accuracy in terms of the actual distance (meters) using the raw GPS data. Extensive experiments are conducted under different application scenarios to validate the effectiveness of the proposed method. The results indicate that cross-view geo-localization in this realistic setting is still challenging, fostering new research in this direction. Our dataset and code will be released at https : //github.com/Jeff - Zilence/VIGOR.
引用
下载
收藏
页码:3639 / 3648
页数:10
相关论文
共 50 条
  • [41] Beyond Geo-localization: Fine-grained Orientation of Street-view Images by Cross-view Matching with Satellite Imagery
    Hu, Wenmiao
    Zhang, Yichen
    Liang, Yuxuan
    Yin, Yifang
    Georgescu, Andrei
    Tran, An
    Kruppa, Hannes
    Ng, See-Kiong
    Zimmermann, Roger
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6155 - 6164
  • [42] Learning Cross-View Geo-Localization Embeddings via Dynamic Weighted Decorrelation Regularization
    Wang, Tingyu
    Zheng, Zhedong
    Zhu, Zunjie
    Sun, Yaoqi
    Yan, Chenggang
    Yang, Yi
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [43] A Practical Cross-View Image Matching Method between UAV and Satellite for UAV-Based Geo-Localization
    Ding, Lirong
    Zhou, Ji
    Meng, Lingxuan
    Long, Zhiyong
    REMOTE SENSING, 2021, 13 (01) : 1 - 22
  • [44] Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image
    Shi, Yujiao
    Li, Hongdong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16989 - 16999
  • [45] AFPN: Attention-guided Feature Partition Network for Cross-view Geo-localization
    Lin, Zhifeng
    Huang, Ranran
    Cai, Jiancheng
    Liu, Xinmin
    Ding, Changxing
    Chai, Zhenhua
    PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, : 39 - 44
  • [46] A cross-view geo-localization method guided by relation-aware global attention
    Jing Sun
    Rui Yan
    Bing Zhang
    Bing Zhu
    Fuming Sun
    Multimedia Systems, 2023, 29 : 2205 - 2216
  • [47] CCR: A Counterfactual Causal Reasoning-based Method for Cross-view Geo-localization
    Du H.
    He J.
    Zhao Y.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 1 - 1
  • [48] Aligning Geometric Spatial Layout in Cross-View Geo-Localization via Feature Recombination
    Zhang, Qingwang
    Zhu, Yingying
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7251 - 7259
  • [49] Navigating the Metaverse: UAV-Based Cross-View Geo-Localization in Virtual Worlds
    Yagi, Ryota
    Yairi, Takehisa
    Iwasaki, Akira
    PROCEEDINGS OF THE 2023 WORKSHOP ON UAVS IN MULTIMEDIA: CAPTURING THE WORLD FROM A NEW PERSPECTIVE, UAVM 2023, 2023, : 13 - 17
  • [50] A cross-view geo-localization method guided by relation-aware global attention
    Sun, Jing
    Yan, Rui
    Zhang, Bing
    Zhu, Bing
    Sun, Fuming
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2205 - 2216