Automated classification of remote sensing satellite images using deep learning based vision transformer

被引:0
|
作者
Adekanmi Adegun [1 ]
Serestina Viriri [2 ]
Jules-Raymond Tapamo [2 ]
机构
[1] University of Roehampton,Department of Computing, School of Arts, Humanities, and Social Sciences
[2] University of KwaZulu-Natal,School of Mathematics, Statistics and Computer Science
[3] University of KwaZulu-Natal,School of Engineering
关键词
Remote sensing; Deep learning; Vision transformer; Local self attention;
D O I
10.1007/s10489-024-05818-y
中图分类号
学科分类号
摘要
Automatic classification of remote sensing images using machine learning techniques is challenging due to the complex features of the images. The images are characterized by features such as multi-resolution, heterogeneous appearance and multi-spectral channels. Deep learning methods have achieved promising results in the analysis of remote sensing satellite images in the recent past. However, deep learning methods based on convolutional neural networks (CNN) experience difficulties in the analysis of intrinsic objects from satellite images. These techniques have not achieved optimum performance in the analysis of remote sensing satellite images due to their complex features, such as coarse resolution, cloud masking, varied sizes of embedded objects and appearance. The receptive fields in convolutional operations are not able to establish long-range dependencies and lack global contextual connectivity for effective feature extraction. To address this problem, we propose an improved deep learning-based vision transformer model for the efficient analysis of remote sensing images. The proposed model incorporates a multi-head local self-attention mechanism with patch shifting procedure to provide both local and global context for effective extraction of multi-scale and multi-resolution spatial features of remote sensing images. The proposed model is also enhanced by fine-tuning the hyper-parameters by introducing dropout modules and a decay linear learning rate scheduler. This approach leverages local self-attention for learning and extraction of the complex features in satellite images. Four distinct remote sensing image datasets, namely RSSCN, EuroSat, UC Merced (UCM) and SIRI-WHU, were subjected to experiments and analysis. The results show some improvement in the proposed vision transformer on the CNN-based methods.
引用
收藏
页码:13018 / 13037
页数:19
相关论文
共 50 条
  • [31] Deep Learning for Remote Sensed Target Classification in Maritime Satellite Radar Images
    Traore, Abdarahmane
    Jensen, Jeremy
    Akhloufi, Moulay A.
    OCEAN SENSING AND MONITORING XI, 2019, 11014
  • [32] Scene Classification of Optical High-resolution Remote Sensing Images Using Vision Transformer and Graph Convolutional Network
    Wang Jianan
    Gao Yue
    Shi Jun
    Liu Ziqi
    ACTA PHOTONICA SINICA, 2021, 50 (11)
  • [33] Classification Method of High-Resolution Remote Sensing Scene Image Based on Dictionary Learning and Vision Transformer
    He Xiaojun
    Liu Xuan
    Wei Xian
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (14)
  • [34] MITformer: A Multiinstance Vision Transformer for Remote Sensing Scene Classification
    Sha, Zongyao
    Li, Jianfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [35] Detecting Wheat Heads from UAV Low-Altitude Remote Sensing Images Using Deep Learning Based on Transformer
    Zhu, Jiangpeng
    Yang, Guofeng
    Feng, Xuping
    Li, Xiyao
    Fang, Hui
    Zhang, Jinnuo
    Bai, Xiulin
    Tao, Mingzhu
    He, Yong
    REMOTE SENSING, 2022, 14 (20)
  • [36] Aircraft recognition in remote sensing images based on deep learning
    Lin, Jiaxi
    Li, Xinde
    Pan, Hong
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 895 - 899
  • [37] Retrieval of remote sensing images based on semisupervised deep learning
    Zhang H.
    Liu X.
    Yang S.
    Li Y.
    Li, Yu (liyu@radi.ac.cn), 1600, Science Press (21): : 406 - 414
  • [38] Remote Sensing Images Target Detection Based on Deep Learning
    Zhang, Yuan
    Zhao, Lingran
    Jia, Linjing
    Zhang, Yuhao
    Qu, Hongquan
    2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933
  • [39] Classification of tobacco using remote sensing and deep learning techniques
    Qazi, Umama Khalid
    Ahmad, Iftikhar
    Minallah, Nasru
    Zeeshan, Muhammad
    AGRONOMY JOURNAL, 2024, 116 (03) : 839 - 847
  • [40] A Review on Image Classification of Remote Sensing Using Deep Learning
    Yao, Chuchu
    Luo, Xianxian
    Zhao, Yudan
    Zeng, Wei
    Chen, Xiaoyu
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1947 - 1955