Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets

被引:5
|
作者
Cortes, Andoni [1 ]
Rodriguez, Clemente [1 ]
Velez, Gorka [2 ]
Barandiaran, Javier [2 ]
Nieto, Marcos [2 ]
机构
[1] Univ Pais Vasco UPV, Comp Architecture & Technol Dept, San Sebastian 20018, Spain
[2] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Intelligent Transportat Syst & Engn Dept, San Sebastian 20009, Spain
基金
欧盟地平线“2020”;
关键词
Training; Data models; Machine learning; Pipelines; Detectors; Vehicles; Synthetic datasets; deep learning; traffic sign recognition; TRAFFIC SIGN RECOGNITION; DEEP NEURAL-NETWORK;
D O I
10.1109/TITS.2020.3009186
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the potential of synthetic data-based training in the field of intelligent transportation systems. Our focus is on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving. The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights. A well-known DL model was trained with different datasets to compare the performance of synthetic and real image-based trained models. Additionally, a new, detailed method to objectively compare these models is proposed. Synthetic images are generated using a semi-supervised errors-guide method which is also described. Our experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets (+10% precision for GTSRB dataset) and consequently, the generalization of the model is improved decreasing the cost of acquiring images.
引用
收藏
页码:190 / 199
页数:10
相关论文
共 50 条
  • [41] Matching Cross-Domain Data with Cooperative Training of Triplet Networks: A Case Study on Underwater Robotics
    De Giacomo, Giovanni G.
    dos Santos, Matheus M.
    Drews-Jr, Paulo L. J.
    Botelho, Silvia S. C.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (03)
  • [42] Classification of Big Velocity Data via Cross-Domain Canonical Correlation Analysis
    Zhang, Bo
    Shi, Zhong-Zhi
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [43] Cross-domain replay spoofing attack detection using domain adversarial training
    Wang, Hongji
    Dinkel, Heinrich
    Wang, Shuai
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 2938 - 2942
  • [44] Adaptive Training Instance Selection for Cross-Domain Emotion Identification
    Wang, Wenbo
    Chen, Lu
    Chen, Keke
    Thirunarayan, Krishnaprasad
    Sheth, Amit P.
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 525 - 532
  • [45] Cross-Domain Sentiment Analysis: An Empirical Investigation
    Heredia, Brian
    Khoshgoftaar, Taghi M.
    Prusa, Joseph
    Crawford, Michael
    PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 160 - 165
  • [46] QUALITY ANALYSIS OF A CROSS-DOMAIN REFERENCE ARCHITECTURE
    Dobrica, Liliana
    Ovaska, Eila
    ICSOFT 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 1, 2009, : 157 - +
  • [47] Cross-Domain Data Traceability Mechanism Based on Blockchain
    Zhao, Shoucai
    Cao, Lifeng
    Li, Jinhui
    Wan, Jiling
    Bai, Jinlong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (02): : 2531 - 2549
  • [48] ADDRESSING UNCERTAINTY AND CONFLICTS IN CROSS-DOMAIN DATA PROVENANCE
    Moitra, Abha
    Barnett, Bruce
    Crapo, Andrew
    Dill, Stephen J.
    MILITARY COMMUNICATIONS CONFERENCE, 2010 (MILCOM 2010), 2010, : 912 - 917
  • [49] Identifying intentions in forum posts with cross-domain data
    Tu Minh Phuong
    Le Cong Linh
    Ngo Xuan Bach
    Journal of Heuristics, 2022, 28 : 171 - 192
  • [50] Data Loss Prevention for Cross-Domain Instant Messaging
    Kongsgard, Kyrre Wahl
    Nordbotten, Nils Agne
    Mancini, Federico
    Engelstad, Paal E.
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3565 - 3572