Fire Detection Approach Based on Vision Transformer

被引:7
|
作者
Khudayberdiev, Otabek [1 ]
Zhang, Jiashu [1 ]
Elkhalil, Ahmed [1 ]
Balde, Lansana [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu, Peoples R China
关键词
Vision transformer; Self-attention; Convolutional neural networks; Fire detection; Image classification; CONVOLUTIONAL NEURAL-NETWORKS; SURVEILLANCE;
D O I
10.1007/978-3-031-06794-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the rapid development of embedding surveillance video systems for fire monitoring, we need to distribute systems with high accuracy and detection speed. Recent progress in vision-based fire detection techniques achieved remarkable success by the powerful ability of deep convolutional neural networks. CNN's have long been the architecture of choice for computer vision tasks. However, current CNN-based methods consider fire classification entire image pixels as equal, ignoring regardless of information. Thus, this can cause a low accuracy rate and delay detection. To increase detection speed and achieve high accuracy, we propose a fire detection approach based on Vision Transformer as a viable alternative to CNN. Different from convolutional networks, transformers operate with images as a sequence of patches, selectively attending to different image parts based on context. In addition, the attention mechanism in the transformer solves the problem with a small flame, thereby provide detection fire in the early stage. Since transformers using global self-attention, which conducts complex computing, we utilize fine-tuned Swin Transformer as our backbone architecture that computes self-attention with local windows. Thus, solving the classification problems with high-resolution images. Experimental results conducted on the image fire dataset demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains a classification accuracy of 98.54% on the publicly available dataset.
引用
收藏
页码:41 / 53
页数:13
相关论文
共 50 条
  • [31] Fire Detection using Transformer Network
    Shahid, Mohammad
    Hua, Kai-lung
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 627 - 630
  • [32] Intelligent and vision-based fire detection systems: A survey
    Bu, Fengju
    Gharajeh, Mohammad Samadi
    IMAGE AND VISION COMPUTING, 2019, 91
  • [33] Nighttime fire smoke detection system based on machine vision
    Chao-Ching Ho
    Ming-Chen Chen
    International Journal of Precision Engineering and Manufacturing, 2012, 13 : 1369 - 1376
  • [34] Fire detection based on vision sensor and support vector machines
    Ko, Byoung Chul
    Cheong, Kwang-Ho
    Nam, Jae-Yeal
    FIRE SAFETY JOURNAL, 2009, 44 (03) : 322 - 329
  • [35] Nighttime fire smoke detection system based on machine vision
    Ho, Chao-Ching
    Chen, Ming-Chen
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2012, 13 (08) : 1369 - 1376
  • [36] Computer Vision Based Fire Detection with a Video Alert System
    Sathyakala, G.
    Kirthika, V.
    Aishwarya, B.
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 725 - 727
  • [37] Visual perception enhancement fall detection algorithm based on vision transformer
    Cai, Xi
    Wang, Xiangcheng
    Bao, Kexin
    Chen, Yinuo
    Jiao, Yin
    Han, Guang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [38] Vision Transformer-Based Emotion Detection in HCI for Enhanced Interaction
    Soni, Jayesh
    Prabakar, Nagarajan
    Upadhyay, Himanshu
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 76 - 86
  • [39] Vision Transformer Based Multi-class Lesion Detection in IVOCT
    Wang, Zixuan
    Shao, Yifan
    Sun, Jingyi
    Huang, Zhili
    Wang, Su
    Li, Qiyong
    Li, Jinsong
    Yu, Qian
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 327 - 336
  • [40] Detection model of sister chromatid cohesion defects based on Vision Transformer
    Matsumoto, Shinya
    Okubo, Kan
    Abe, Takuya
    Nishikawa, Kiyoshi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 27 - 31