Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets

被引:5
|
作者
Cortes, Andoni [1 ]
Rodriguez, Clemente [1 ]
Velez, Gorka [2 ]
Barandiaran, Javier [2 ]
Nieto, Marcos [2 ]
机构
[1] Univ Pais Vasco UPV, Comp Architecture & Technol Dept, San Sebastian 20018, Spain
[2] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Intelligent Transportat Syst & Engn Dept, San Sebastian 20009, Spain
基金
欧盟地平线“2020”;
关键词
Training; Data models; Machine learning; Pipelines; Detectors; Vehicles; Synthetic datasets; deep learning; traffic sign recognition; TRAFFIC SIGN RECOGNITION; DEEP NEURAL-NETWORK;
D O I
10.1109/TITS.2020.3009186
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the potential of synthetic data-based training in the field of intelligent transportation systems. Our focus is on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving. The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights. A well-known DL model was trained with different datasets to compare the performance of synthetic and real image-based trained models. Additionally, a new, detailed method to objectively compare these models is proposed. Synthetic images are generated using a semi-supervised errors-guide method which is also described. Our experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets (+10% precision for GTSRB dataset) and consequently, the generalization of the model is improved decreasing the cost of acquiring images.
引用
收藏
页码:190 / 199
页数:10
相关论文
共 50 条
  • [1] Mining Cross-Domain Rating Datasets from Structured Data on Twitter
    Dooms, Simon
    De Pessemier, Toon
    Martens, Luc
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 621 - 624
  • [2] Bridge Data: Boosting Generalization of Robotic Skills with Cross-Domain Datasets
    Ebert, Frederik
    Yang, Yanlai
    Schmeckpeper, Karl
    Bucher, Bernadette
    Georgakis, Georgios
    Daniilidis, Kostas
    Finn, Chelsea
    Levine, Sergey
    ROBOTICS: SCIENCE AND SYSTEM XVIII, 2022,
  • [3] DaGzang: a synthetic data generator for cross-domain recommendation services
    Nguyen, Luong Vuong
    Vo, Nam D.
    Jung, Jason J.
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [4] Cross-Domain Knowledge Transfer Using High Dynamic Range Imaging in Synthetic Datasets
    Peleka, Georgia
    Sarafis, Dimitrios
    Mariolis, Ioannis
    Tzovaras, Dimitrios
    CYBERNETICS AND SYSTEMS, 2023, 54 (03) : 372 - 386
  • [5] Cross-Domain Transformation for Outlier Detection on Tabular Datasets
    Herurkar, Dayananda
    Sattarov, Timur
    Hees, Joern
    Palacio, Sebastian
    Raue, Federico
    Dengel, Andreas
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Cross-Domain Data Fusion
    Yang, Qiang
    COMPUTER, 2016, 49 (04) : 18 - 18
  • [7] USING CLASSIFIER DISCREPANCY FOR CROSS-DOMAIN IMAGE RETRIEVAL
    Zhao, Longjiao
    Wang, Yu
    Kato, Jien
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3314 - 3318
  • [8] Explaining Cross-domain Recognition with Interpretable Deep Classifier
    Zhang, Yiheng
    Yao, Ting
    Qiu, Zhaofan
    Mei, Tao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
  • [9] Improving Cross-Domain Brain Tissue Segmentation in Fetal MRI with Synthetic Data
    Zalevskyi, Vladyslav
    Sanchez, Thomas
    Roulet, Margaux
    Verdera, Jordina Aviles
    Hutter, Jana
    Kebiri, Hamza
    Cuadra, Meritxell Bach
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT I, 2024, 15001 : 437 - 447
  • [10] A Cross-domain Data Marketplace for Data Sharing
    Mavrogiorgou, Argyro
    Koukos, Vasileios
    Kouremenou, Eleftheria
    Kiourtis, Athanasios
    Raikos, Alexandros
    Manias, George
    Kyriazis, Dimosthenis
    PROCEEDINGS OF 2022 THE 3RD EUROPEAN SYMPOSIUM ON SOFTWARE ENGINEERING, ESSE 2022, 2022, : 72 - 79