Visual Grounding in Remote Sensing Images

被引:23
|
作者
Sun, Yuxi [1 ]
Feng, Shanshan [1 ]
Li, Xutao [1 ]
Ye, Yunming [1 ]
Kang, Jian [2 ]
Huang, Xu [1 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
[2] Soochow Univ, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
dataset; object retrieval; visual grounding; remote sensing; referring expression;
D O I
10.1145/3503161.3548316
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Ground object retrieval from a large-scale remote sensing image is very important for lots of applications. We present a novel problem of visual grounding in remote sensing images. Visual grounding aims to locate the particular objects (in the form of the bounding box or segmentation mask) in an image by a natural language expression. The task already exists in the computer vision community. However, existing benchmark datasets and methods mainly focus on natural images rather than remote sensing images. Compared with natural images, remote sensing images contain large-scale scenes and the geographical spatial information of ground objects (e.g., longitude, latitude). The existing method cannot deal with these challenges. In this paper, we collect a new visual grounding dataset, called RSVG, and design a new method, namely GeoVG. In particular, the proposed method consists of a language encoder, image encoder, and fusion module. The language encoder is used to learn numerical geospatial relations and represent a complex expression as a geospatial relation graph. The image encoder is applied to learn large-scale remote sensing scenes with adaptive region attention. The fusion module is used to fuse the text and image feature for visual grounding. We evaluate the proposed method by comparing it to the state-of-the-art methods on RSVG. Experiments show that our method outperforms the previous methods on the proposed datasets. https://sunyuxi.github.io/publication/GeoVG
引用
收藏
页数:9
相关论文
共 50 条
  • [41] An indexing model of remote sensing images
    Carrara, P
    Pasi, G
    Pepe, M
    Rampini, A
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 517 - 525
  • [42] Images fusing in remote sensing mapping
    Qin, QM
    Liu, DP
    Liu, HT
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : A315 - A320
  • [43] Insightful Visualization of Remote Sensing Images
    Najim, Safa A.
    Ahmed, Basaeir Y.
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [44] On Interference Removal in Remote Sensing Images
    Zhang, Nannan
    Zhang, Guanbin
    Zhou, Kefa
    Chen, Wanwen
    INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING BIOMEDICAL ENGINEERING, AND INFORMATICS (SPBEI 2013), 2014, : 131 - 138
  • [45] Retrieving images for remote sensing applications
    Sawant, Neela
    Chandran, Sharat
    Mohan, B. Krishna
    COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2006, 4338 : 849 - +
  • [46] The Potential of Visual ChatGPT for Remote Sensing
    Osco, Lucas Prado
    de Lemos, Eduardo Lopes
    Goncalves, Wesley Nunes
    Ramos, Ana Paula Marques
    Marcato Jr, Jose
    REMOTE SENSING, 2023, 15 (13)
  • [47] A Multilevel Visual Feature-Based Approach for Measuring the Spatial Information in Remote Sensing Images
    Liu, Baoju
    Deng, Min
    Liu, Huimin
    Shi, Yan
    Zhao, Bin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (10) : 4110 - 4122
  • [48] Lossy DCT-based compression of remote sensing images with providing a desired visual quality
    Krivenko, Sergey S.
    Abramov, Sergey K.
    Lukin, Vladimir V.
    Vozel, Benoit
    Chehdi, Kacem
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXV, 2019, 11155
  • [49] Region of Interest Detection Based on Visual Saliency Analysis and Iteratively Clustering for Remote Sensing Images
    Sun, Yang
    Wang, Shuang
    Zhang, Libao
    AUTOMATED VISUAL INSPECTION AND MACHINE VISION III, 2019, 11061
  • [50] Remote Sensing Images Fusion based on Block Compressed Sensing
    Yang Sen-lin
    Wan Guo-bin
    Zhang Bian-lian
    Chong Xin
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2013: IMAGING SPECTROMETER TECHNOLOGIES AND APPLICATIONS, 2013, 8910