Rotation-aware representation learning for remote sensing image retrieval

被引:12
|
作者
Wu, Zhi-Ze [1 ]
Zou, Chang [2 ]
Wang, Yan [3 ]
Tan, Ming [4 ]
Weise, Thomas [1 ]
机构
[1] Hefei Univ, Sch Artificial Intelligence & Big Data, Inst Appl Optimizat, Jinxiu Dadao 99, Hefei 230601, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Anhui Jianzhu Univ, Sch Art, Jinzhai Rd 856, Hefei 230022, Anhui, Peoples R China
[4] Hefei Univ, Sch Artificial Intelligence & Big Data, Jinxiu Dadao 99, Hefei 230601, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Spatial transformer network; Rotation invariance; Deep learning; Deep features; Content-based remote sensing image retrieval; SCENE; SHAPE; SELECTION; NETWORK;
D O I
10.1016/j.ins.2021.04.078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rising number and size of remote sensing (RS) image archives makes content-based RS image retrieval (CBRSIR) more important. Convolutional neural networks (CNNs) offer good CBRSIR performance, but the features they extract are not rotation-invariant. This is problematic as objects in RS images appear in arbitrary rotation angles. We develop and investigate two new rotation-aware CNN-based CBRSIR methods: 1) In the Feature Map Transformation Based Rotation-Aware Network (FMT-RAN), the last pooling layer is rotated in four different angles during training. Its outputs are passed through the same fully connected-, coding-, and classification layer, and the resulting losses are added. 2) The Spatial Transformer-based Rotation-Aware Network (ST-RAN) contains a spatial transformer network (STN) and a rotation aware network (RAN). For training, the original and a randomly rotated version of an image are fed into the ST-RAN. The STN generates a transformed version of the original to match the rotated image. The RAN extracts the features of all three images. We apply two-stage training, which first optimizes the STN and then the RAN. Both of our methods are efficient in terms of retrieval accuracy and time, but ST-RAN has the overall best performance. It outperforms the state-of-the-art CBRSIR methods. (c) 2021 Published by Elsevier Inc.
引用
收藏
页码:404 / 423
页数:20
相关论文
共 50 条
  • [31] Rotation-aware correlation filters for robust visual tracking
    Liao, Jiawen
    Qi, Chun
    Cao, Jianzhong
    Wang, Xiaofang
    Ren, Long
    Zhang, Chaoning
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83
  • [32] Cross-Modal Contrastive Learning With Spatiotemporal Context for Correlation-Aware Multiscale Remote Sensing Image Retrieval
    Zhu, Lilu
    Wang, Yang
    Hu, Yanfeng
    Su, Xiaolu
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [33] WRAPD: Weighted Rotation-aware ADMM for Parameterization and Deformation
    Brown, George E.
    Narain, Rahul
    ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04):
  • [34] Duplex-Hierarchy Representation Learning for Remote Sensing Image Classification
    Yuan, Xiaobin
    Zhu, Jingping
    Lei, Hao
    Peng, Shengjun
    Wang, Weidong
    Li, Xiaobin
    SENSORS, 2024, 24 (04)
  • [35] Remote Sensing Image Fusion Based on Dictionary Learning and Sparse Representation
    Yin, Fei
    Cao, Shuhua
    Xu, Xiaojie
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [36] MARTA GANs: Unsupervised Representation Learning for Remote Sensing Image Classification
    Lin, Daoyu
    Fu, Kun
    Wang, Yang
    Xu, Guangluan
    Sun, Xian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (11) : 2092 - 2096
  • [37] SEMANTIC DECOUPLED REPRESENTATION LEARNING FOR REMOTE SENSING IMAGE CHANGE DETECTION
    Chen, Hao
    Zao, Yifan
    Liu, Liqin
    Chen, Song
    Shi, Zhenwei
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1051 - 1054
  • [38] Remote sensing image-text retrieval based on layout semantic joint representation
    Zhang R.
    Nie J.
    Song N.
    Zheng C.
    Wei Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 671 - 683
  • [39] A rotation-invariant horizontal vertical pooled module for remote sensing image representation
    Sitaula, Chiranjibi
    Aryal, Jagannath
    Neural Computing and Applications, 2024, 36 (30) : 18661 - 18673
  • [40] Boundary-Aware Multiscale Learning Perception for Remote Sensing Image Segmentation
    You, Chao
    Jiao, Licheng
    Liu, Xu
    Li, Lingling
    Liu, Fang
    Ma, Wenping
    Yang, Shuyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61