Synthetic Data Generator for Classification Rules Learning

被引:0
|
作者
Liu, Runzong [1 ]
Fang, Bin [1 ]
Tang, Yuan Yan [2 ]
Chan, Patrick P. K. [3 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Guangdong, Peoples R China
关键词
Synthetic data; Automatic decision support; Data mining; Decision tree; DECISION TREE;
D O I
10.1109/CCBD.2016.78
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A standard data set is useful to empirically evaluate classification rules learning algorithms. However, there is still no standard data set which is common enough for various situations. Data sets from the real world are limited to specific applications. The sizes of attributes, the rules and samples of the real data are fixed. A data generator is proposed here to produce synthetic data set which can be as big as the experiments demand. The size of attributes, rules, and samples of the synthetic data sets can be easily changed to meet the demands of evaluation on different learning algorithms. In the generator, related attributes are created at first. And then, rules are created based on the attributes. Samples are produced following the rules. Three decision tree algorithms are evaluated used synthetic data sets produced by the proposed data generator.
引用
收藏
页码:357 / 361
页数:5
相关论文
共 50 条
  • [31] From Data to Classification Rules and Actions
    Ras, Zbigniew W.
    Dardzinska, Agnieszka
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (06) : 572 - 590
  • [32] Evaluation of Synthetic Video Data in Machine Learning Approaches for Parking Space Classification
    Horn, Daniela
    Houben, Sebastian
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2157 - 2162
  • [33] Synthetic data augmentation for surface defect detection and classification using deep learning
    Jain, Saksham
    Seth, Gautam
    Paruthi, Arpit
    Soni, Umang
    Kumar, Girish
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (04) : 1007 - 1020
  • [34] Synthetic Data Generation for Steel Defect Detection and Classification Using Deep Learning
    Boikov, Aleksei
    Payor, Vladimir
    Savelev, Roman
    Kolesnikov, Alexandr
    [J]. SYMMETRY-BASEL, 2021, 13 (07):
  • [35] Synthetic Aperture Radar image classification based on constrictive learning with limited data
    Zhu, Wenbin
    Gu, Hong
    Zhu, Xiaochun
    [J]. IET RADAR SONAR AND NAVIGATION, 2022, 16 (09): : 1530 - 1537
  • [36] Emergency Shutdown Valve damage classification by machine learning using synthetic data
    de Gouveia, S. M.
    Correa, L. de Abreu
    Teles, D. B.
    Oliveira, M.
    Clarke, T. G. R.
    [J]. ENGINEERING FAILURE ANALYSIS, 2024, 156
  • [37] Synthetic data augmentation for surface defect detection and classification using deep learning
    Saksham Jain
    Gautam Seth
    Arpit Paruthi
    Umang Soni
    Girish Kumar
    [J]. Journal of Intelligent Manufacturing, 2022, 33 : 1007 - 1020
  • [38] Combining rough sets and data-driven fuzzy learning for generation of classification rules
    Shen, Q
    Chouchoulas, A
    [J]. PATTERN RECOGNITION, 1999, 32 (12) : 2073 - 2076
  • [39] XyGen: Synthetic data generator for feature selection
    Kamalov, Firuz
    Elnaffar, Said
    Sulieman, Hana
    Cherukuri, Aswani Kumar
    [J]. SOFTWARE IMPACTS, 2023, 15
  • [40] SynTiSeD - Synthetic Time Series Data Generator
    Meiser, Michael
    Duppe, Benjamin
    Zinnikus, Ingo
    [J]. 2023 11TH WORKSHOP ON MODELLING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2023,