WildScenes: A benchmark for 2D and 3D semantic segmentation in large-scale natural environments

被引:0
|
作者
Vidanapathirana, Kavisha [1 ,2 ]
Knights, Joshua [1 ,2 ]
Hausler, Stephen [1 ]
Cox, Mark [1 ]
Ramezani, Milad [1 ]
Jooste, Jason [1 ]
Griffiths, Ethan [1 ,2 ]
Mohamed, Shaheer [1 ,2 ]
Sridharan, Sridha [2 ]
Fookes, Clinton [2 ]
Moghadam, Peyman [1 ,2 ]
机构
[1] CSIRO, CSIRO Robot, Data61, 1 Technology Ct, Pullenvale, Qld 4069, Australia
[2] Queensland Univ Technol, Brisbane, Qld, Australia
关键词
Semantic scene understanding; performance evaluation and benchmarking; data sets for robotic vision; data sets for robot learning; DATASET;
D O I
10.1177/02783649241278369
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and LiDAR) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural automation. Therefore, we introduce WildScenes, a bi-modal benchmark dataset consisting of multiple large-scale, sequential traversals in natural environments, including semantic annotations in high-resolution 2D images and dense 3D LiDAR point clouds, and accurate 6-DoF pose information. The data is (1) trajectory-centric with accurate localization and globally aligned point clouds, (2) calibrated and synchronized to support bi-modal training and inference, and (3) containing different natural environments over 6 months to support research on domain adaptation. Our 3D semantic labels are obtained via an efficient, automated process that transfers the human-annotated 2D labels from multiple views into 3D point cloud sequences, thus circumventing the need for expensive and time-consuming human annotation in 3D. We introduce benchmarks on 2D and 3D semantic segmentation and evaluate a variety of recent deep-learning techniques to demonstrate the challenges in semantic segmentation in natural environments. We propose train-val-test splits for standard benchmarks as well as domain adaptation benchmarks and utilize an automated split generation technique to ensure the balance of class label distributions. The WildScenes benchmark webpage is https://csiro-robotics.github.io/WildScenes, and the data is publicly available at https://data.csiro.au/collection/csiro:61541.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans
    Dai, Angela
    Ritchie, Daniel
    Bokeloh, Martin
    Reed, Scott
    Sturm, Juergen
    Niessner, Matthias
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4578 - 4587
  • [2] CONSEL: Connectivity-based Segmentation in Large-Scale 2D/3D Sensor Networks
    Jiang, Hongbo
    Yu, Tianlong
    Tian, Chen
    Tan, Guang
    Wang, Chonggang
    [J]. 2012 PROCEEDINGS IEEE INFOCOM, 2012, : 2086 - 2094
  • [3] MedMNIST v2-A large-scale lightweight benchmark for 2D and 3D biomedical image classification
    Yang, Jiancheng
    Shi, Rui
    Wei, Donglai
    Liu, Zequan
    Zhao, Lin
    Ke, Bilian
    Pfister, Hanspeter
    Ni, Bingbing
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [4] MedMNIST v2 - A large-scale lightweight benchmark for 2D and 3D biomedical image classification
    Jiancheng Yang
    Rui Shi
    Donglai Wei
    Zequan Liu
    Lin Zhao
    Bilian Ke
    Hanspeter Pfister
    Bingbing Ni
    [J]. Scientific Data, 10
  • [5] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
    Wan, Yingcai
    Fang, Lijin
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
  • [6] A 3D Semantic Segmentation Method for Large-Scale Point Cloud on Deep Learning
    Liu, Sihan
    Zhang, Wenyu
    Zhang, Yujun
    Wang, Zhijian
    Gao, Dongxiang
    [J]. ENGINEERING LETTERS, 2023, 31 (04) : 1667 - 1674
  • [7] Transition to turbulence over 2D and 3D periodic large-scale roughnesses
    Hamed, A. M.
    Sadowski, M.
    Zhang, Z.
    Chamorro, L. P.
    [J]. JOURNAL OF FLUID MECHANICS, 2016, 804 : R6
  • [8] Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation
    Robert, Damien
    Vallet, Bruno
    Landrieu, Loic
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5565 - 5574
  • [9] SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds
    Hu, Qingyong
    Yang, Bo
    Fang, Guangchi
    Guo, Yulan
    Leonardis, Ales
    Trigoni, Niki
    Markham, Andrew
    [J]. COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 600 - 619
  • [10] Mining local geometric structure for large-scale 3D point clouds semantic segmentation
    Shao, Yuyuan
    Tong, Guofeng
    Peng, Hao
    [J]. NEUROCOMPUTING, 2022, 500 : 191 - 202