SynDrone - Multi-modal UAV Dataset for Urban Scenarios

被引:2
|
作者
Rizzoli, Giulia [1 ]
Barbato, Francesco [1 ]
Caligiuri, Matteo [1 ]
Zanuttigh, Pietro [1 ]
机构
[1] Univ Padua, Dept Informat Engn, Via Gradenigo 6 b, Padua, Italy
关键词
SEMANTIC SEGMENTATION; NETWORK;
D O I
10.1109/ICCVW60793.2023.00235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of computer vision algorithms for Unmanned Aerial Vehicles (UAVs) imagery heavily relies on the availability of annotated high-resolution aerial data. However, the scarcity of large-scale real datasets with pixel-level annotations poses a significant challenge to researchers as the limited number of images in existing datasets hinders the effectiveness of deep learning models that require a large amount of training data. In this paper, we propose a multimodal synthetic dataset containing both images and 3D data taken at multiple flying heights to address these limitations. In addition to object-level annotations, the provided data also include pixel-level labeling in 28 classes, enabling exploration of the potential advantages in tasks like semantic segmentation. In total, our dataset contains 72k labeled samples that allow for effective training of deep architectures showing promising results in synthetic-to-real adaptation. The dataset will be made publicly available to support the development of novel computer vision methods targeting UAV applications.
引用
收藏
页码:2202 / 2212
页数:11
相关论文
共 50 条
  • [21] Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding
    Jia, Ao
    He, Yu
    Zhang, Yazhou
    Uprety, Sagar
    Song, Dawei
    Lioma, Christina
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1512 - 1522
  • [22] A multi-modal panel dataset to understand the psychological impact of the pandemic
    Isabelle van der Vegt
    Bennett Kleinberg
    [J]. Scientific Data, 10
  • [23] A multi-modal open dataset for mental-disorder analysis
    Hanshu Cai
    Zhenqin Yuan
    Yiwen Gao
    Shuting Sun
    Na Li
    Fuze Tian
    Han Xiao
    Jianxiu Li
    Zhengwu Yang
    Xiaowei Li
    Qinglin Zhao
    Zhenyu Liu
    Zhijun Yao
    Minqiang Yang
    Hong Peng
    Jing Zhu
    Xiaowei Zhang
    Guoping Gao
    Fang Zheng
    Rui Li
    Zhihua Guo
    Rong Ma
    Jing Yang
    Lan Zhang
    Xiping Hu
    Yumin Li
    Bin Hu
    [J]. Scientific Data, 9
  • [24] FatigueSet: A Multi-modal Dataset for Modeling Mental Fatigue and Fatigability
    Kalanadhabhatta, Manasa
    Min, Chulhong
    Montanari, Alessandro
    Kawsar, Fahim
    [J]. PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE, PERVASIVE HEALTH 2021, 2022, 431 : 204 - 217
  • [25] Ticino: A multi-modal remote sensing dataset for semantic segmentation
    Barbato, Mirko Paolo
    Piccoli, Flavio
    Napoletano, Paolo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [26] A multi-modal open dataset for mental-disorder analysis
    Cai, Hanshu
    Yuan, Zhenqin
    Gao, Yiwen
    Sun, Shuting
    Li, Na
    Tian, Fuze
    Xiao, Han
    Li, Jianxiu
    Yang, Zhengwu
    Li, Xiaowei
    Zhao, Qinglin
    Liu, Zhenyu
    Yao, Zhijun
    Yang, Minqiang
    Peng, Hong
    Zhu, Jing
    Zhang, Xiaowei
    Gao, Guoping
    Zheng, Fang
    Li, Rui
    Guo, Zhihua
    Ma, Rong
    Yang, Jing
    Zhang, Lan
    Hu, Xiping
    Li, Yumin
    Hu, Bin
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [27] A multi-modal panel dataset to understand the psychological impact of the pandemic
    van der Vegt, Isabelle
    Kleinberg, Bennett
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [28] Multi-modal Gesture Recognition Challenge 2013: Dataset and Results
    Escalera, Sergio
    Gonzalez, Jordi
    Baro, Xavier
    Reyes, Miguel
    Lopes, Oscar
    Guyon, Isabelle
    Athitsos, Vassilis
    Escalante, Hugo J.
    [J]. ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 445 - 452
  • [29] MOFA: A novel dataset for Multi-modal Image Fusion Applications
    Xiao, Kaihua
    Kang, Xudong
    Liu, Haibo
    Duan, Puhong
    [J]. INFORMATION FUSION, 2023, 96 : 144 - 155
  • [30] Mode Transition Control Law Design for a Multi-modal UAV
    Liu, Yang
    Wang, Hua
    [J]. PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1664 - 1671