UAVid: A semantic segmentation dataset for UAV imagery

被引:149
|
作者
Lyu, Ye [1 ]
Vosselman, George [1 ]
Xia, Gui-Song [2 ]
Yilmaz, Alper [3 ]
Yang, Michael Ying [1 ]
机构
[1] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, Enschede, Netherlands
[2] Wuhan Univ, Sch Comp Sci, State Key Lab LIESMARS, Wuhan, Peoples R China
[3] Ohio State Univ, Dept Civil Environm & Geodet Engn, Columbus, OH 43210 USA
基金
中国国家自然科学基金;
关键词
UAV; Semantic segmentation; Deep learning; Dataset; LIDAR;
D O I
10.1016/j.isprsjprs.2020.05.009
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Semantic segmentation has been one of the leading research interests in computer vision recently. It serves as a perception foundation for many fields, such as robotics and autonomous driving. The fast development of semantic segmentation attributes enormously to the large scale datasets, especially for the deep learning related methods. There already exist several semantic segmentation datasets for comparison among semantic segmentation methods in complex urban scenes, such as the Cityscapes and CamVid datasets, where the side views of the objects are captured with a camera mounted on the driving car. There also exist semantic labeling datasets for the airborne images and the satellite images, where the nadir views of the objects are captured. However, only a few datasets capture urban scenes from an oblique Unmanned Aerial Vehicle (UAV) perspective, where both of the top view and the side view of the objects can be observed, providing more information for object recognition. In this paper, we introduce our UAVid dataset, a new high-resolution UAV semantic segmentation dataset as a complement, which brings new challenges, including large scale variation, moving object recognition and temporal consistency preservation. Our UAV dataset consists of 30 video sequences capturing high-resolution images in oblique views. In total, 300 images have been densely labeled with 8 classes for the semantic labeling task. We have provided several deep learning baseline methods with pre-training, among which the proposed Multi-Scale-Dilation net performs the best via multi-scale feature extraction, reaching a mean intersection-over-union (IoU) score around 50%. We have also explored the influence of spatial-temporal regularization for sequence data by leveraging on feature space optimization (FSO) and 3D conditional random field (CRF). Our UAVid website and the labeling tool have been published online (https://uavid.nl/).
引用
收藏
页码:108 / 119
页数:12
相关论文
共 50 条
  • [1] Semantic Segmentation of Underwater Imagery: Dataset and Benchmark
    Islam, Md Jahidul
    Edge, Chelsey
    Xiao, Yuyang
    Luo, Peigen
    Mehtaz, Muntaqim
    Morse, Christopher
    Enan, Sadman Sakib
    Sattar, Junaed
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 1769 - 1776
  • [2] An instance segmentation dataset of cabbages over the whole growing season for UAV imagery
    Yokoyama, Yui
    Matsui, Tsutomu
    Tanaka, Takashi S. T.
    [J]. DATA IN BRIEF, 2024, 55
  • [3] WTA/TLA: A UAV-captured Dataset for Semantic Segmentation of Energy Infrastructure
    Zampokas, Georgios
    Skartados, Evangelos
    Alexiou, Dimitrios
    Tsiakas, Kosmas
    Tzanakis, Ioannis
    Roussos, Nikolaos
    Giakoumis, Dimitrios
    Kostavelis, Ioannis
    Bouganis, Christos-Savvas
    Tzovaras, Dimitrios
    [J]. 2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 552 - 561
  • [4] Crop classification for UAV visible imagery using deep semantic segmentation methods
    Zhang, Shiqi
    Dai, Xiaoai
    Li, Jingzhong
    Gao, Xiaojie
    Zhang, Fuxi
    Gong, Fanxi
    Lu, Heng
    Wang, Meilian
    Ji, Fujiang
    Wang, Zekun
    Peng, Peihao
    [J]. GEOCARTO INTERNATIONAL, 2022, 37 (25) : 10033 - 10057
  • [5] Semantic Segmentation of UAV image using Combined U-net and heterogeneous UAV imagery datasets
    Song, A.
    [J]. REMOTE SENSING TECHNOLOGIES AND APPLICATIONS IN URBAN ENVIRONMENTS VII, 2022, 12269
  • [6] RescueNet: A High Resolution UAV Semantic Segmentation Dataset for Natural Disaster Damage Assessment
    Maryam Rahnemoonfar
    Tashnim Chowdhury
    Robin Murphy
    [J]. Scientific Data, 10
  • [7] RescueNet: A High Resolution UAV Semantic Segmentation Dataset for Natural Disaster Damage Assessment
    Rahnemoonfar, Maryam
    Chowdhury, Tashnim
    Murphy, Robin
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [8] Comprehensive Semantic Segmentation on High Resolution UAV Imagery for Natural Disaster Damage Assessment
    Chowdhury, Tashnim
    Rahnemoonfar, Maryam
    Murphy, Robin
    Fernandes, Odair
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3904 - 3913
  • [9] Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery
    Li, Qi
    Cai, Jiaxin
    Luo, Jiexin
    Yu, Yuanlong
    Gu, Jason
    Pan, Jia
    Liu, Wenxi
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1708 - 1715
  • [10] A pothole video dataset for semantic segmentation
    Ihsan, Muhammad
    Amrizal, Muhammad Alfian
    Harjoko, Agus
    [J]. DATA IN BRIEF, 2024, 53