A Survey of the Research Progress in Image Geo-localization

被引:0
|
作者
Huang G. [1 ]
Zhou Y. [1 ]
Hu X. [1 ]
Zhao L. [1 ]
Zhang C. [1 ]
机构
[1] Institute of Geospatial Information, PLA Strategic Support Force Information Engineering University, Zhengzhou
关键词
cyberspace surveying and mapping; datasets; evaluation method; image geo- localization; image retrieval; multi- sources data;
D O I
10.12082/dqxxkx.2023.230073
中图分类号
学科分类号
摘要
Image geo-localization is a technique that obtains the geographic location information of an image through a series of methods, so as to establish a mapping relationship with the real geographic space. This technique is important for further image information mining and has potential application value in cyberspace surveying and mapping, intelligence acquisition, user outdoor positioning, and augmented reality. Despite the tremendous progress in the field of computer vision, high-precision automatic geo-localization of images still needs to be addressed due to the involvement of multiple fields such as image feature extraction, large-scale data retrieval, large-scale point cloud processing, deep learning, geographic information feature extraction, geometric modeling and reasoning, semantic scene understanding, context-based reasoning, and multiple data fusion. This paper reviews the progress of image geo-localization research, mainly including image geo-localization methods, image geo-localization datasets, image geo-localization evaluation methods, and summary and prospect of image geo-localization. Firstly, image geolocation methods are classified into three categories, i.e., image retrieval, 2D-3D matching, and cross-modal retrieval, according to the relevance of the research content. Secondly, the datasets and evaluation methods used for image geo-localization research are categorized and summarized. The geolocalization datasets include image datasets, cross-view datasets, Structure from Motion (SfM) datasets, and multimodal datasets, etc. The image geo-localization evaluation metrics include Top-k candidates, localization error, position and orientation error per video frame, and accuracy/recall. Finally, the current status of image geo-localization research is analyzed, and the future research directions of image geo-localization are outlined in terms of global geo-localization, natural area geo-localization, multi-method fusion for geo-localization, Point of Interest (POI) data-based geo-localization, and pre-selected location refinement. © 2023 Journal of Geo-Information Science. All rights reserved.
引用
收藏
页码:1336 / 1362
页数:26
相关论文
共 162 条
  • [1] Flatow D, Naaman M, Xie K E, Et al., On the accuracy of hyper- local geotagging of social media content, Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, pp. 127-136, (2015)
  • [2] Hays J, Efros A A., IM2GPS: estimating geographic information from a single image[C], 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, (2008)
  • [3] Bansal M, Sawhney H S, Cheng H, Et al., Geo-localization of street views with aerial image databases[C], Proceedings of the 19th ACM international conference on Multimedia - MM’11, (2011)
  • [4] Zamir A R, Ardeshir S, Shah M., GPS-tag refinement using random walks with an adaptive damping factor[C], 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 4280-4287, (2014)
  • [5] Baatz G, Saurer O, Koeser K, Et al., Large scale visual geolocalization of images in mountainous terrain[C], Computer Vision - ECCV 2012, 7573, pp. 517-530, (2012)
  • [6] Baboud L, Cadik M, Eisemann E, Et al., Automatic photo-to- terrain alignment for the annotation of mountain pictures[C], CVPR, pp. 41-48, (2011)
  • [7] Porzi L, Bulo S R, Valigi P, Et al., Learning contours for automatic annotations of mountains pictures on a smart-phone[C], Proceedings of the International Conference on Distributed Smart Cameras, pp. 1-6, (2014)
  • [8] Tzeng E, Zhai A, Clements M, Et al., User-driven geolocation of untagged desert imagery using digital elevation models[C], 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 237-244, (2013)
  • [9] Mithun N C, Sikka K, Chiu H P, Et al., RGB2LIDAR: Towards solving large-scale cross-modal visual localization [EB/OL], (2020)
  • [10] Weng L, Gouet- Brunet V, Soheilian B., Semantic signatures for large- scale visual localization[J], Multimedia Tools and Applications, 80, 15, pp. 22347-22372, (2021)