Deep Learning Approach for Multi-class Semantic Segmentation of UAV Images

被引:0
|
作者
Chouhan, Avinash [1 ]
Chutia, Dibyajyoti [1 ]
Aggarwal, Shiv Prasad [1 ]
机构
[1] North Eastern Space Applicat Ctr, Umiam 793103, Meghalaya, India
关键词
Semantic segmentation; deep learning; UAV; FEATURES;
D O I
10.1142/S0218213023500331
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image understanding plays a very crucial role in remote sensing applications. For this, image semantic segmentation is one of the approaches where each pixel of an image is assigned to particular classes based on various features. Aerial semantic segmentation suffers from the class imbalance problem. Proper differentiation of least represented categories is challenging and a goal for the state-of-art approach. In this work, we present a novel deep learning method to perform this task. We proposed a lightweight encoder-decoder network residual depth separable UNet (RDS-UNet) and conditional random field for effective segmentation on very high-resolution aerial images. We proposed patch-with-multi-class sampling to handle the class imbalance problem without increasing the computational overhead during the training process. We created a semi-precise annotated UAV dataset named NESAC UAV Seg for the aerial semantic segmentation task. We demonstrated the efficacy of our model using the publicly available benchmark Drone Deploy dataset and our NESAC UAV Seg dataset. Our model required approximately half the number of trainable parameters and floating point operations compared to other methods. A detailed ablation study is presented to showcase the effectiveness of various modules utilized in our network.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [32] Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Ouyang, Wanli
    Bennamoun, Mohammed
    Boussaid, Farid
    Xu, Dan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4300 - 4309
  • [33] Segmentation-based multi-class semantic object detection
    Remi Vieux
    Jenny Benois-Pineau
    Jean-Philippe Domenger
    Achille Braquelaire
    Multimedia Tools and Applications, 2012, 60 : 305 - 326
  • [34] Efficient semantic image segmentation with multi-class ranking prior
    Pei, Deli
    Li, Zhenguo
    Ji, Rongrong
    Sun, Fuchun
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 120 : 81 - 90
  • [35] Multi-class semantic segmentation for identification of silicate island defects
    Ramachandran, Vishwath
    Elias, Susan
    Narayanan, Badri
    Thilagam, Ayyappan Uma Chandra
    Sridharann, Niyanth
    WELDING INTERNATIONAL, 2023, 37 (01) : 12 - 20
  • [36] Deep Learning-Based Multi-Class Segmentation of the Paranasal Sinuses of Sinusitis Patients Based on Computed Tomographic Images
    Whangbo, Jongwook
    Lee, Juhui
    Kim, Young Jae
    Kim, Seon Tae
    Kim, Kwang Gi
    SENSORS, 2024, 24 (06)
  • [37] Binary vs. Multi-Class Segmentation for Off-angle Iris Images using Deep Learning Frameworks
    Ghandour, Imad El Ddine
    Karakaya, Mahmut
    MULTIMODAL IMAGE EXPLOITATION AND LEARNING 2022, 2022, 12100
  • [38] Research progress on deep learning methods for object detection and semantic segmentation in UAV aerial images
    Luo X.
    Wu Y.
    Chen J.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (06):
  • [39] A deep learning based approach for semantic segmentation of small fires from UAV imagery
    Saxena, Vishu
    Jain, Yash
    Mittal, Sparsh
    REMOTE SENSING LETTERS, 2025, 16 (03) : 277 - 289
  • [40] SketchSegNet plus : An End-to-End Learning of RNN for Multi-Class Sketch Semantic Segmentation
    Qi, Yonggang
    Tan, Zheng-Hua
    IEEE ACCESS, 2019, 7 : 102717 - 102726