Wildfire Segmentation Using Deep Vision Transformers

被引:56
|
作者
Ghali, Rafik [1 ,2 ]
Akhloufi, Moulay A. [1 ]
Jmal, Marwa [3 ]
Mseddi, Wided Souidene [2 ]
Attia, Rabah [2 ]
机构
[1] Univ Moncton, Percept Robot & Intelligent Machines Res Grp PRIM, Dept Comp Sci, 18 Antonine Maillet Ave, Moncton, NB E1A 3E9, Canada
[2] Univ Carthage, Ecole Polytech Tunisie, SERCOM Lab, La Marsa 77-1054, Carthage, Tunisia
[3] Telnet Holding, Telnet Innovat Labs, Parc Elghazela Technol Commun, Ariana 2088, Tunisia
基金
加拿大自然科学与工程研究理事会;
关键词
forest fire detection; fire segmentation; vision Transformer; TransUNet; MedT; wildfires; FIRE-DETECTION; BURNED AREA; IMAGE; ALGORITHM; SEQUENCES; SEVERITY; NETWORK; COLOR;
D O I
10.3390/rs13173527
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In this paper, we address the problem of forest fires' early detection and segmentation in order to predict their spread and help with fire fighting. Techniques based on Convolutional Networks are the most used and have proven to be efficient at solving such a problem. However, they remain limited in modeling the long-range relationship between objects in the image, due to the intrinsic locality of convolution operators. In order to overcome this drawback, Transformers, designed for sequence-to-sequence prediction, have emerged as alternative architectures. They have recently been used to determine the global dependencies between input and output sequences using the self-attention mechanism. In this context, we present in this work the very first study, which explores the potential of vision Transformers in the context of forest fire segmentation. Two vision-based Transformers are used, TransUNet and MedT. Thus, we design two frameworks based on the former image Transformers adapted to our complex, non-structured environment, which we evaluate using varying backbones and we optimize for forest fires' segmentation. Extensive evaluations of both frameworks revealed a performance superior to current methods. The proposed approaches achieved a state-of-the-art performance with an F1-score of 97.7% for TransUNet architecture and 96.0% for MedT architecture. The analysis of the results showed that these models reduce fire pixels mis-classifications thanks to the extraction of both global and local features, which provide finer detection of the fire's shape.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Semantic segmentation using Vision Transformers: A survey
    Thisanke, Hans
    Deshan, Chamli
    Chamith, Kavindu
    Seneviratne, Sachith
    Vidanaarachchi, Rajith
    Herath, Damayanthi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [2] Remote Wildfire Detection using Multispectral Satellite Imagery and Vision Transformers
    Rad, Ryan
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [3] Wildfire Segmentation using Deep-RegSeg Semantic Segmentation Architecture
    Ghali, Rafik
    Akhloufi, Moulay A.
    Mseddi, Wided Souidene
    Jmal, Marwa
    [J]. 19TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2022, 2022, : 149 - 154
  • [4] Weeds Classification with Deep Learning: An Investigation Using CNN, Vision Transformers, Pyramid Vision Transformers, and Ensemble Strategy
    Rozendo, Guilherme Botazzo
    Roberto, Guilherme Freire
    Zanchetta do Nascimento, Marcelo
    Neves, Leandro Alves
    Lumini, Alessandra
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 229 - 243
  • [5] SegViT: Semantic Segmentation with Plain Vision Transformers
    Zhang, Bowen
    Tian, Zhi
    Tang, Quan
    Chu, Xiangxiang
    Wei, Xiaolin
    Shen, Chunhua
    Liu, Yifan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Vision Transformers for Lung Segmentation on CXR Images
    Ghali R.
    Akhloufi M.A.
    [J]. SN Computer Science, 4 (4)
  • [7] Object Detection Using Deep Learning, CNNs and Vision Transformers: A Review
    Amjoud, Ayoub Benali
    Amrouch, Mustapha
    [J]. IEEE ACCESS, 2023, 11 : 35479 - 35516
  • [8] Wildfire Detection From Multisensor Satellite Imagery Using Deep Semantic Segmentation
    Rashkovetsky, Dmitry
    Mauracher, Florian
    Langer, Martin
    Schmitt, Michael
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 7001 - 7016
  • [9] CellViT: Vision Transformers for precise cell segmentation and classification
    Hoerst, Fabian
    Rempe, Moritz
    Heine, Lukas
    Seibold, Constantin
    Keyl, Julius
    Baldini, Giulia
    Ugurel, Selma
    Siveke, Jens
    Gruenwald, Barbara
    Egger, Jan
    Kleesiek, Jens
    [J]. MEDICAL IMAGE ANALYSIS, 2024, 94
  • [10] Multimodal Semantic Segmentation Based On Improved Vision Transformers
    Qi, Weimin
    Chen, Hangong
    Wang, Zhiming
    Wang, Meng
    [J]. PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 565 - 569