Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous driving

被引:0
|
作者
Alibeigi, Mina [1 ]
Ljungbergh, William [1 ]
Tonderski, Adam [1 ]
Hess, Georg [1 ]
Lilja, Adam [1 ]
Lindstrom, Carl [1 ]
Motorniuk, Daria [1 ]
Fu, Junsheng [1 ]
Widahl, Jenny [1 ]
Petersson, Christoffer [1 ]
机构
[1] Zenseact, Gothenburg, Sweden
关键词
D O I
10.1109/ICCV51070.2023.01846
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing datasets for autonomous driving (AD) often lack diversity and long-range capabilities, focusing instead on 360 degrees perception and temporal reasoning. To address this gap, we introduce Zenseact Open Dataset (ZOD), a large-scale and diverse multimodal dataset collected over two years in various European countries, covering an area 9x that of existing datasets. ZOD boasts the highest range and resolution sensors among comparable datasets, coupled with detailed keyframe annotations for 2D and 3D objects (up to 245m), road instance/semantic segmentation, traffic sign recognition, and road classification. We believe that this unique combination will facilitate breakthroughs in long-range perception and multi-task learning. The dataset is composed of Frames, Sequences, and Drives, designed to encompass both data diversity and support for spatio-temporal learning, sensor fusion, localization, and mapping. Frames consist of 100k curated camera images with two seconds of other supporting sensor data, while the 1473 Sequences and 29 Drives include the entire sensor suite for 20 seconds and a few minutes, respectively. ZOD is the only large-scale AD dataset released under a permissive license, allowing for both research and commercial use. More information, and an extensive devkit, can be found at zod.zenseact.com.
引用
收藏
页码:20121 / 20131
页数:11
相关论文
共 50 条
  • [41] MineRL: A Large-Scale Dataset of Minecraft Demonstrations
    Guss, William H.
    Houghton, Brandon
    Topin, Nicholay
    Wang, Phillip
    Codel, Cayden
    Veloso, Manuela
    Salakhutdinov, Ruslan
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2442 - 2448
  • [42] A large-scale and global car dataset for verification
    Hu, Lingji
    Luo, Xingcheng
    Deng, Jianhua
    Lai, Fengjie
    Hu, Jian
    Yu, Yongbin
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2016, 48 : 49 - 52
  • [43] EdNet: A Large-Scale Hierarchical Dataset in Education
    Choi, Youngduck
    Lee, Youngnam
    Shin, Dongmin
    Cho, Junghyun
    Park, Seoyon
    Lee, Seewoo
    Baek, Jineon
    Bae, Chan
    Kim, Byungsoo
    Heo, Jaewe
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT II, 2020, 12164 : 69 - 73
  • [44] A Large-Scale Dataset for Empathetic Response Generation
    Welivita, Anuradha
    Xie, Yubo
    Pu, Pearl
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1251 - 1264
  • [45] VoxCeleb: a large-scale speaker identification dataset
    Nagrani, Arsha
    Chung, Joon Son
    Zisserman, Andrew
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2616 - 2620
  • [46] Dungeons and Data: A Large-Scale NetHack Dataset
    Hambro, Eric
    Raileanu, Roberta
    Rothermel, Danielle
    Mella, Vegard
    Rocktaschel, Tim
    Kuttler, Heinrich
    Murray, Naila
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [47] Bella Turca: A Large-Scale Dataset of Diverse Text Sources for Turkish Language Modeling
    Altinok, Duygu
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT I, 2024, 15048 : 196 - 213
  • [48] ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation
    Huang, Kuan-Hao
    Iyer, Varun
    Hsu, I-Hung
    Kumar, Anoop
    Chang, Kai-Wei
    Galstyan, Aram
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 8047 - 8061
  • [49] Driving Big Data: A First Look at Driving Behavior Via a Large-scale Private Car Dataset
    Li, Tong
    Alhilal, Ahmad
    Zhang, Anlan
    Hoque, Mohammad A.
    Chatzopoulos, Dimitris
    Xiao, Zhu
    Li, Yong
    Hui, Pan
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 61 - 68
  • [50] WHUVID: A Large-Scale Stereo-IMU Dataset for Visual-Inertial Odometry and Autonomous Driving in Chinese Urban Scenarios
    Chen, Tianyang
    Pu, Fangling
    Chen, Hongjia
    Liu, Zhihong
    [J]. REMOTE SENSING, 2022, 14 (09)