Wildfire Segmentation Using Deep Vision Transformers

被引：56

作者：

Ghali, Rafik ^{[1
,2
]}

Akhloufi, Moulay A. ^{[1
]}

Jmal, Marwa ^{[3
]}

Mseddi, Wided Souidene ^{[2
]}

Attia, Rabah ^{[2
]}

机构：

[1] Univ Moncton, Percept Robot & Intelligent Machines Res Grp PRIM, Dept Comp Sci, 18 Antonine Maillet Ave, Moncton, NB E1A 3E9, Canada

[2] Univ Carthage, Ecole Polytech Tunisie, SERCOM Lab, La Marsa 77-1054, Carthage, Tunisia

[3] Telnet Holding, Telnet Innovat Labs, Parc Elghazela Technol Commun, Ariana 2088, Tunisia

来源：

REMOTE SENSING | 2021年 / 13卷 / 17期

基金：

加拿大自然科学与工程研究理事会;

关键词：

forest fire detection; fire segmentation; vision Transformer; TransUNet; MedT; wildfires; FIRE-DETECTION; BURNED AREA; IMAGE; ALGORITHM; SEQUENCES; SEVERITY; NETWORK; COLOR;

D O I：

10.3390/rs13173527

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

In this paper, we address the problem of forest fires' early detection and segmentation in order to predict their spread and help with fire fighting. Techniques based on Convolutional Networks are the most used and have proven to be efficient at solving such a problem. However, they remain limited in modeling the long-range relationship between objects in the image, due to the intrinsic locality of convolution operators. In order to overcome this drawback, Transformers, designed for sequence-to-sequence prediction, have emerged as alternative architectures. They have recently been used to determine the global dependencies between input and output sequences using the self-attention mechanism. In this context, we present in this work the very first study, which explores the potential of vision Transformers in the context of forest fire segmentation. Two vision-based Transformers are used, TransUNet and MedT. Thus, we design two frameworks based on the former image Transformers adapted to our complex, non-structured environment, which we evaluate using varying backbones and we optimize for forest fires' segmentation. Extensive evaluations of both frameworks revealed a performance superior to current methods. The proposed approaches achieved a state-of-the-art performance with an F1-score of 97.7% for TransUNet architecture and 96.0% for MedT architecture. The analysis of the results showed that these models reduce fire pixels mis-classifications thanks to the extraction of both global and local features, which provide finer detection of the fire's shape.

引用

下载

页数：24

共 50 条

[1] Semantic segmentation using Vision Transformers: A survey
Thisanke, Hans
Deshan, Chamli
Chamith, Kavindu
Seneviratne, Sachith
Vidanaarachchi, Rajith
Herath, Damayanthi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[2] Remote Wildfire Detection using Multispectral Satellite Imagery and Vision Transformers
Rad, Ryan
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
[3] Wildfire Segmentation using Deep-RegSeg Semantic Segmentation Architecture
Ghali, Rafik
Akhloufi, Moulay A.
Mseddi, Wided Souidene
Jmal, Marwa
19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 149 - 154
[4] Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy
Rozendo, Guilherme Botazzo
Roberto, Guilherme Freire
Zanchetta do Nascimento, Marcelo
Neves, Leandro Alves
Lumini, Alessandra
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 229 - 243
[5] SegViT: Semantic Segmentation with Plain Vision Transformers
Zhang, Bowen
Tian, Zhi
Tang, Quan
Chu, Xiangxiang
Wei, Xiaolin
Shen, Chunhua
Liu, Yifan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[6] Vision Transformers for Lung Segmentation on CXR Images
Ghali R.
Akhloufi M.A.
SN Computer Science, 4 (4)
[7] Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review
Amjoud, Ayoub Benali
Amrouch, Mustapha
IEEE ACCESS, 2023, 11 : 35479 - 35516
[8] Wildfire Detection From Multisensor Satellite Imagery Using Deep Semantic Segmentation
Rashkovetsky, Dmitry
Mauracher, Florian
Langer, Martin
Schmitt, Michael
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 7001 - 7016
[9] Self-supervised vision transformers for semantic segmentation
Gu, Xianfan
Hu, Yingdong
Wen, Chuan
Gao, Yang
Computer Vision and Image Understanding, 2025, 251
[10] SimpleClick: Interactive Image Segmentation with Simple Vision Transformers
Liu, Qin
Xu, Zhenlin
Bertasius, Gedas
Niethammer, Marc
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 22233 - 22243

← 1 2 3 4 5 →