UAVid: A semantic segmentation dataset for UAV imagery

被引:149
|
作者
Lyu, Ye [1 ]
Vosselman, George [1 ]
Xia, Gui-Song [2 ]
Yilmaz, Alper [3 ]
Yang, Michael Ying [1 ]
机构
[1] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, Enschede, Netherlands
[2] Wuhan Univ, Sch Comp Sci, State Key Lab LIESMARS, Wuhan, Peoples R China
[3] Ohio State Univ, Dept Civil Environm & Geodet Engn, Columbus, OH 43210 USA
基金
中国国家自然科学基金;
关键词
UAV; Semantic segmentation; Deep learning; Dataset; LIDAR;
D O I
10.1016/j.isprsjprs.2020.05.009
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Semantic segmentation has been one of the leading research interests in computer vision recently. It serves as a perception foundation for many fields, such as robotics and autonomous driving. The fast development of semantic segmentation attributes enormously to the large scale datasets, especially for the deep learning related methods. There already exist several semantic segmentation datasets for comparison among semantic segmentation methods in complex urban scenes, such as the Cityscapes and CamVid datasets, where the side views of the objects are captured with a camera mounted on the driving car. There also exist semantic labeling datasets for the airborne images and the satellite images, where the nadir views of the objects are captured. However, only a few datasets capture urban scenes from an oblique Unmanned Aerial Vehicle (UAV) perspective, where both of the top view and the side view of the objects can be observed, providing more information for object recognition. In this paper, we introduce our UAVid dataset, a new high-resolution UAV semantic segmentation dataset as a complement, which brings new challenges, including large scale variation, moving object recognition and temporal consistency preservation. Our UAV dataset consists of 30 video sequences capturing high-resolution images in oblique views. In total, 300 images have been densely labeled with 8 classes for the semantic labeling task. We have provided several deep learning baseline methods with pre-training, among which the proposed Multi-Scale-Dilation net performs the best via multi-scale feature extraction, reaching a mean intersection-over-union (IoU) score around 50%. We have also explored the influence of spatial-temporal regularization for sequence data by leveraging on feature space optimization (FSO) and 3D conditional random field (CRF). Our UAVid website and the labeling tool have been published online (https://uavid.nl/).
引用
收藏
页码:108 / 119
页数:12
相关论文
共 50 条
  • [21] A Lightweight CNN-Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation
    Lu, Wen
    Zhang, Zhiqi
    Nguyen, Minh
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [22] Consumer-grade UAV imagery facilitates semantic segmentation of species-rich savanna tree layers
    Popp, Manuel R.
    Kalwij, Jesse M.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01):
  • [23] Synthetic Data for Semantic Segmentation in Underwater Imagery
    Pergeorelis, Michael
    Bazik, Maxim
    Saponaro, Philip
    Kim, Joong
    Kambhamettu, Chandra
    [J]. 2022 OCEANS HAMPTON ROADS, 2022,
  • [24] HSI-TransUNet: A transformer based semantic segmentation model for crop mapping from UAV hyperspectral imagery
    Niu, Bowen
    Feng, Quanlong
    Chen, Boan
    Ou, Cong
    Liu, Yiming
    Yang, Jianyu
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 201
  • [25] Consumer-grade UAV imagery facilitates semantic segmentation of species-rich savanna tree layers
    Manuel R. Popp
    Jesse M. Kalwij
    [J]. Scientific Reports, 13 (1)
  • [26] Semantic segmentation of citrus-orchard using deep neural networks and multispectral UAV-based imagery
    Lucas Prado Osco
    Keiller Nogueira
    Ana Paula Marques Ramos
    Mayara Maezano Faita Pinheiro
    Danielle Elis Garcia Furuya
    Wesley Nunes Gonçalves
    Lucio André de Castro Jorge
    José Marcato Junior
    Jefersson Alex dos Santos
    [J]. Precision Agriculture, 2021, 22 : 1171 - 1188
  • [27] Semantic Segmentation of Objects from Airborne Imagery
    Thuy Thi Nguyen
    Sang Viet Dinh
    Nguyen Tien Quang
    Huynh Thi Thanh Binh
    [J]. 2017 FOURTH ASIAN CONFERENCE ON DEFENCE TECHNOLOGY - JAPAN (ACDT), 2017, : 140 - 145
  • [28] SHIPNET FOR SEMANTIC SEGMENTATION ON VHR MARITIME IMAGERY
    Sun, Shihao
    Lu, Zexin
    Liu, Wenjie
    Hu, Wei
    Li, Ruirui
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6911 - 6914
  • [29] YUTO SEMANTIC: A LARGE SCALE AERIAL LIDAR DATASET FOR SEMANTIC SEGMENTATION
    Yoo, S.
    Ko, C.
    Sohn, G.
    Lee, H.
    [J]. GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 209 - 215
  • [30] Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline
    Ji, Wei
    Li, Jingjing
    Bian, Cheng
    Zhou, Zongwei
    Zhao, Jiaying
    Yuille, Alan
    Cheng, Li
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 1094 - 1104