WildScenes: A benchmark for 2D and 3D semantic segmentation in large-scale natural environments

被引：0

作者：

Vidanapathirana, Kavisha ^{[1
,2
]}

Knights, Joshua ^{[1
,2
]}

Hausler, Stephen ^{[1
]}

Cox, Mark ^{[1
]}

Ramezani, Milad ^{[1
]}

Jooste, Jason ^{[1
]}

Griffiths, Ethan ^{[1
,2
]}

Mohamed, Shaheer ^{[1
,2
]}

Sridharan, Sridha ^{[2
]}

Fookes, Clinton ^{[2
]}

Moghadam, Peyman ^{[1
,2
]}

机构：

[1] CSIRO, CSIRO Robot, Data61, 1 Technology Ct, Pullenvale, Qld 4069, Australia

[2] Queensland Univ Technol, Brisbane, Qld, Australia

来源：

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH | 2024年

关键词：

Semantic scene understanding; performance evaluation and benchmarking; data sets for robotic vision; data sets for robot learning; DATASET;

D O I：

10.1177/02783649241278369

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Recent progress in semantic scene understanding has primarily been enabled by the availability of semantically annotated bi-modal (camera and LiDAR) datasets in urban environments. However, such annotated datasets are also needed for natural, unstructured environments to enable semantic perception for applications, including conservation, search and rescue, environment monitoring, and agricultural automation. Therefore, we introduce WildScenes, a bi-modal benchmark dataset consisting of multiple large-scale, sequential traversals in natural environments, including semantic annotations in high-resolution 2D images and dense 3D LiDAR point clouds, and accurate 6-DoF pose information. The data is (1) trajectory-centric with accurate localization and globally aligned point clouds, (2) calibrated and synchronized to support bi-modal training and inference, and (3) containing different natural environments over 6 months to support research on domain adaptation. Our 3D semantic labels are obtained via an efficient, automated process that transfers the human-annotated 2D labels from multiple views into 3D point cloud sequences, thus circumventing the need for expensive and time-consuming human annotation in 3D. We introduce benchmarks on 2D and 3D semantic segmentation and evaluate a variety of recent deep-learning techniques to demonstrate the challenges in semantic segmentation in natural environments. We propose train-val-test splits for standard benchmarks as well as domain adaptation benchmarks and utilize an automated split generation technique to ensure the balance of class label distributions. The WildScenes benchmark webpage is https://csiro-robotics.github.io/WildScenes, and the data is publicly available at https://data.csiro.au/collection/csiro:61541.

引用

页数：18

共 50 条

[41] 2D TO 3D LABEL PROPAGATION FOR THE SEMANTIC SEGMENTATION OF HERITAGE BUILDING POINT CLOUDS
Pellis, E.
Murtiyoso, A.
Masiero, A.
Tucci, G.
Betti, M.
Grussenmeyer, P.
[J]. XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 861 - 867
[42] SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks
Boulch, Alexandre
Guerry, Yids
Le Saux, Bertrand
Audebert, Nicolas
[J]. COMPUTERS & GRAPHICS-UK, 2018, 71 : 189 - 198
[43] A Benchmark for 3D Mesh Segmentation
Chen, Xiaobai
Golovinskiy, Aleksey
Funkhouser, Thomas
[J]. ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03):
[44] Multimodal interaction for 2D and 3D environments
Cohen, P
McGee, D
Oviatt, S
Wu, LZ
Clow, J
King, R
Julier, S
Rosenblum, L
[J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1999, 19 (04) : 10 - 13
[45] Current Progress and Challenges in Large-Scale 3D Mitochondria Instance Segmentation
Franco-Barranco, Daniel
Lin, Zudi
Jang, Won-Dong
Wang, Xueying
Shen, Qijia
Yin, Wenjie
Fan, Yutian
Li, Mingxing
Chen, Chang
Xiong, Zhiwei
Xin, Rui
Liu, Hao
Chen, Huai
Li, Zhili
Zhao, Jie
Chen, Xuejin
Pape, Constantin
Conrad, Ryan
Nightingale, Luke
de Folter, Joost
Jones, Martin L.
Liu, Yanling
Ziaei, Dorsa
Huschauer, Stephan
Arganda-Carreras, Ignacio
Pfister, Hanspeter
Wei, Donglai
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3956 - 3971
[46] On Prioritization Mechanisms for Large-Scale 3D Streaming in Distributed Virtual Environments
Jia, Jinyuan
Wang, Mingfei
Wang, Wei
Hei, Xiaojun
[J]. 2016 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2016), 2016, : 465 - 472
[47] Large-Scale Supervised Learning For 3D Point Cloud Labeling: Semantic3d.Net
Hackel, Timo
Wegner, Jan D.
Savinov, Nikolay
Ladicky, Lubor
Schindler, Konrad
Pollefeys, Marc
[J]. PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2018, 84 (05): : 297 - 308
[48] Recognition-Driven 3D Navigation in Large-Scale Virtual Environments
Guan, Wei
You, Suya
Neumann, Ulrich
[J]. 2011 IEEE VIRTUAL REALITY CONFERENCE (VR), 2011, : 71 - 74
[49] Large-scale data analysis of bioactivity information in PubChem using 2D and 3D chemical similarity
Bolton, Evan
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2011, 242
[50] Efficient and multifidelity terrain modeling for 3D large-scale and unstructured environments
Liu, Xu
Li, Decai
He, Yuqing
Gu, Feng
[J]. JOURNAL OF FIELD ROBOTICS, 2022, 39 (08) : 1286 - 1322

← 1 2 3 4 5 →