A Vision Transformer Model for Convolution-Free Multilabel Classification of Satellite Imagery in Deforestation Monitoring

被引:65
|
作者
Kaselimi, Maria [1 ]
Voulodimos, Athanasios [2 ]
Daskalopoulos, Ioannis [3 ]
Doulamis, Nikolaos [1 ]
Doulamis, Anastasios [1 ]
机构
[1] Natl Tech Univ Athens, Sch Rural & Surveying Engn, Athens 15773, Greece
[2] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[3] Univ West Attica, Dept Informat & Comp Engn, Athens 15773, Greece
关键词
Forestry; Transformers; Satellites; Remote sensing; Monitoring; Earth; Artificial satellites; Deforestation; multilabel image classification; self-attention; vision transformers;
D O I
10.1109/TNNLS.2022.3144791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the dynamics of deforestation and land uses of neighboring areas is of vital importance for the design and development of appropriate forest conservation and management policies. In this article, we approach deforestation as a multilabel classification (MLC) problem in an endeavor to capture the various relevant land uses from satellite images. To this end, we propose a multilabel vision transformer model, ForestViT, which leverages the benefits of the self-attention mechanism, obviating any convolution operations involved in commonly used deep learning models utilized for deforestation detection. Experimental evaluation in open satellite imagery datasets yields promising results in the case of MLC, particularly for imbalanced classes, and indicates ForestViT's superiority compared with well-established convolutional structures (ResNET, VGG, DenseNet, and ModileNet neural networks). This superiority is more evident for minority classes.
引用
收藏
页码:3299 / 3307
页数:9
相关论文
共 50 条
  • [1] Multilabel classification of remote sensed satellite imagery
    Kumar, Ajay
    Abhishek, Kumar
    Singh, Amit Kumar
    Nerurkar, Pranav
    Chandane, Madhav
    Bhirud, Sunil
    Patel, Dhiren
    Busnel, Yann
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (07)
  • [2] Transformer-based Convolution-free Visual Place Recognition
    Urban, Anna
    Kwolek, Bogdan
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 161 - 166
  • [3] CTformer: convolution-free Token2Token dilated vision transformer for low-dose CT denoising
    Wang, Dayang
    Fan, Fenglei
    Wu, Zhan
    Liu, Rui
    Wang, Fei
    Yu, Hengyong
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (06):
  • [4] Convolution-Free Waveform Transformers for Multi-Lead ECG Classification
    Natarajan, Annamalai
    Boverman, Gregory
    Chang, Yale
    Antonescu, Corneliu
    Rubin, Jonathan
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [5] TransDeepLab: Convolution-Free Transformer-Based DeepLab v3+for Medical Image Segmentation
    Azad, Reza
    Heidari, Moein
    Shariatnia, Moein
    Aghdam, Ehsan Khodapanah
    Karimijafarbigloo, Sanaz
    Adeli, Ehsan
    Merhof, Dorit
    PREDICTIVE INTELLIGENCE IN MEDICINE (PRIME 2022), 2022, 13564 : 91 - 102
  • [6] Convolution-Vision Transformer for Automatic Lung Sound Classification
    Neto, Jose
    Arrais, Nicksson
    Vinuto, Tiago
    Lucena, Joao
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 97 - 102
  • [7] Monitoring deforestation in Jordan using deep semantic segmentation with satellite imagery
    Alzu'bi, Ahmad
    Alsmadi, Lujain
    ECOLOGICAL INFORMATICS, 2022, 70
  • [8] BrownViTNet: Hybrid CNN-Vision Transformer Model for the Classification of Brownfields in Aerial Imagery
    Duerrbeck, Konrad
    Lasker, Asifuzzaman
    Gollapalli, Kiran
    Ghosh, Mridul
    Sk, Md Obaidullah
    Fischer, Roland
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 8189 - 8202
  • [9] EEG-VTTCNet: A loss joint training model based on the vision transformer and the temporal convolution network for EEG-based motor imagery classification
    Shi, Xingbin
    Li, Baojiang
    Wang, Wenlong
    Qin, Yuxin
    Wang, Haiyan
    Wang, Xichao
    NEUROSCIENCE, 2024, 556 : 42 - 51
  • [10] Multi-scale vision transformer classification model with self-supervised learning and dilated convolution
    Xing, Liping
    Jin, Hongmei
    Li, Hong-an
    Li, Zhanli
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103