Hybrid Vision Transformers and CNNs for Enhanced Transmission Line Segmentation in Aerial Images

被引:0
|
作者
Nguyen, Hoanh [1 ]
Nguyen, Tuan Anh [1 ]
机构
[1] Ind Univ Ho Chi Minh City, Fac Elect Engn Technol, Ho Chi Minh City, Vietnam
关键词
Vision transformers; convolutional neural networks; transmission lines segmentation; hybrid model; feature fusion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a novel architecture for the segmentation of transmission lines in aerial images, utilizing a hybrid model that combines the strengths of Vision Transformers (ViTs) and Convolutional Neural Networks (CNNs). The proposed method first employs a Swin Transformer backbone (Swin-B) that processes the input image through a hierarchical structure, effectively capturing multi-scale contextual information. Following this, an upsampling strategy is employed, wherein the features extracted by the transformer are refined through convolutional layers, ensuring that the resolution is maintained, and spatial details are recovered. To integrate multi-level feature maps, a feature fusion module with a squeeze-and-excitation (SE) layer is introduced, which consolidates the benefits of both high-level and low-level feature extractions. The SE layer plays a pivotal role in augmenting the feature channels, focusing the model's attention on the most informative features for transmission line detection. By leveraging the global receptive field of ViTs for comprehensive context and the local precision of CNNs for fine-grained detail, our method aims to set a new benchmark for transmission line segmentation in aerial imagery. The effectiveness of our approach is demonstrated through extensive experiments and comparisons with existing state-of-theart methods.
引用
收藏
页码:434 / 442
页数:9
相关论文
共 50 条
  • [1] SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS
    Marmanis, D.
    Wegner, J. D.
    Galliani, S.
    Schindler, K.
    Datcu, M.
    Stilla, U.
    [J]. XXIII ISPRS CONGRESS, COMMISSION III, 2016, 3 (03): : 473 - 480
  • [2] Vision Transformers for Lung Segmentation on CXR Images
    Ghali R.
    Akhloufi M.A.
    [J]. SN Computer Science, 4 (4)
  • [3] Enhancing Computer Vision Performance: A Hybrid Deep Learning Approach with CNNs and Vision Transformers
    Sardar, Abha Singh
    Ranjan, Vivek
    [J]. COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 591 - 602
  • [4] Distilling Segmenters From CNNs and Transformers for Remote Sensing Images' Semantic Segmentation
    Dong, Zhe
    Gao, Guoming
    Liu, Tianzhu
    Gu, Yanfeng
    Zhang, Xiangrong
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Vision transformers for segmentation of disc and cup in retinal fundus images
    Bazi, Yakoub
    Al Rahhal, Mohamad Mahmoud
    Elgibreen, Hebah
    Zuair, Mansour
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91
  • [6] Transmission line detection in aerial images: An instance segmentation approach based on multitask neural networks
    Li, Bo
    Chen, Cheng
    Dong, Shiwen
    Qiao, Junfeng
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 96 (96)
  • [7] Efficient Hybrid Supervision for Instance Segmentation in Aerial Images
    Chen, Linwei
    Fu, Ying
    You, Shaodi
    Liu, Hongzhe
    [J]. REMOTE SENSING, 2021, 13 (02) : 1 - 22
  • [8] Comparison of CNNs and Vision Transformers-Based Hybrid Models Using Gradient Profile Loss for Classification of Oil Spills in SAR Images
    Basit, Abdul
    Siddique, Muhammad Adnan
    Bhatti, Muhammad Khurram
    Sarfraz, Muhammad Saquib
    [J]. REMOTE SENSING, 2022, 14 (09)
  • [9] Hybrid Multiple Attention Network for Semantic Segmentation in Aerial Images
    Niu, Ruigang
    Sun, Xian
    Tian, Yu
    Diao, Wenhui
    Chen, Kaiqiang
    Fu, Kun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Parameter instance learning with enhanced vision transformers for aerial person re-identification
    Peng, Houfu
    Lu, Xing
    Xu, Lili
    Xia, Daoxun
    Xie, Xiaoyao
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (12):