SELMA: SEmantic Large-Scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints

被引:9
|
作者
Testolina, Paolo [1 ]
Barbato, Francesco [1 ]
Michieli, Umberto [1 ]
Giordani, Marco [1 ]
Zanuttigh, Pietro [1 ]
Zorzi, Michele [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
Cameras; Sensors; Semantics; Meteorology; Autonomous vehicles; Task analysis; Synthetic data; Synthetic dataset; CARLA; autonomous driving; domain adaptation; semantic segmentation; sensor fusion; UNSUPERVISED DOMAIN ADAPTATION; CHALLENGES; BENCHMARK; NETWORKS;
D O I
10.1109/TITS.2023.3257086
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Accurate scene understanding from multiple sensors mounted on cars is a key requirement for autonomous driving systems. Nowadays, this task is mainly performed through data-hungry deep learning techniques that need very large amounts of data to be trained. Due to the high cost of performing segmentation labeling, many synthetic datasets have been proposed. However, most of them miss the multi-sensor nature of the data, and do not capture the significant changes introduced by the variation of daytime and weather conditions. To fill these gaps, we introduce SELMA, a novel synthetic dataset for semantic segmentation that contains more than 30K unique waypoints acquired from 24 different sensors including RGB, depth, semantic cameras and LiDARs, in 27 different weather and daytime conditions, for a total of more than 20M samples. SELMA is based on CARLA, an open-source simulator for generating synthetic data in autonomous driving scenarios, that we modified to increase the variability and the diversity in the scenes and class sets, and to align it with other benchmark datasets. As shown by the experimental evaluation, SELMA allows the efficient training of standard and multi-modal deep learning architectures, and achieves remarkable results on real-world data. SELMA is free and publicly available, thus supporting open science and research.
引用
收藏
页码:7012 / 7024
页数:13
相关论文
共 50 条
  • [41] Multimodal and Multilingual Embeddings for Large-Scale Speech Mining
    Duquenne, Paul-Ambroise
    Gong, Hongyu
    Schwenk, Holger
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] Learning Fused Representations for Large-Scale Multimodal Classification
    Nawaz, Shah
    Calefati, Alessandro
    Janjua, Muhammad Kamran
    Anwaar, Muhammad Umer
    Gallo, Ignazio
    IEEE SENSORS LETTERS, 2019, 3 (01)
  • [43] Large-scale multimodal surface neural interfaces for primates
    Belloir, Tiphaine
    Montalvo-Vargo, Sergio
    Ahmed, Zabir
    Griggs, Devon J.
    Fisher, Shawn
    Brown, Timothy
    Chamanzar, Maysamreza
    Yazdan-Shahmorad, Azadeh
    ISCIENCE, 2023, 26 (01)
  • [44] Market Impacts of Large-Scale Variable Generation
    Hajos, Attila
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [45] Variable optical attenuator for large-scale integration
    Garner, SM
    Caracci, S
    IEEE PHOTONICS TECHNOLOGY LETTERS, 2002, 14 (11) : 1560 - 1562
  • [46] Images Don't Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank
    Lynch, Corey
    Aryafar, Kamelia
    Attenberg, Josh
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 541 - 548
  • [47] Lexicon Propagation for Learning a Large-scale Semantic Parser
    Xie, Jiongkun
    Chen, Xiaoping
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 900 - 905
  • [48] Large-scale Semantic Mapping and Reasoning with Heterogeneous Modalities
    Pronobis, Andrzej
    Jensfelt, Patric
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 3515 - 3522
  • [49] Knowledge Representation and Acquisition for Large-Scale Semantic Memory
    Szymanski, Julian
    Duch, Wlodzislaw
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3118 - +
  • [50] Graph theoretic modeling of large-scale semantic networks
    Bales, Michael E.
    Johnson, Stephen B.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2006, 39 (04) : 451 - 464