DataGen: a generator of datasets for evaluation of classification algorithms

被引:13
|
作者
Rachkovskij, DA [1 ]
Kussul, EM [1 ]
机构
[1] Ukrainian Acad Sci, Cybernet Ctr, UA-252650 Kiev, Ukraine
关键词
benchmarking; evaluation; classification; supervised learning; datasets; data generator; synthetic data;
D O I
10.1016/S0167-8655(98)00053-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dataset generators are useful for the evaluation of an algorithm's performance because they allow control of the characteristics and amount of data used for benchmarking. We propose a dataset generator called DataGen that allows varying the number of input features and output classes, the complexity and realizations of class regions, the distributions of data samples, the noise level, the number of data samples. A C language listing of basic DataCen version is provided. (C) 1998 Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:537 / 544
页数:8
相关论文
共 50 条
  • [1] DataGen: A generator of datasets for evaluation of classification algorithms
    Natl Ukrainian Acad of Sciences, Kiev, Ukraine
    Pattern Recognit Lett, 7 (537-544):
  • [2] Classification algorithms for biomedical volume datasets
    Cerquides, Jesus
    Lopez-Sanchez, Maite
    Ontanon, Santi
    Puertas, Eloi
    Puig, Anna
    Pujol, Oriol
    Tost, Dani
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2006, 4177 : 143 - 152
  • [3] Use of Genetic Algorithms for Classification of Datasets
    Shanabog, Nandish C. S.
    Ashwinkumar, U. M.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2016 - 2020
  • [4] Reanalysis of Classification Algorithms on Different Datasets
    Yue, Peng-fei
    Wu, Qin-ge
    Zhu, Jian-gang
    Cheng, Wen-fang
    Qian, Xiao-liang
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MANUFACTURING ENGINEERING AND INTELLIGENT MATERIALS (ICMEIM 2017), 2017, 100 : 124 - 127
  • [5] Performance Comparison of Classification Algorithms on Medical Datasets
    Ramana, Bendi Venkata
    Boddu, Raja Sarath Kumar
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 140 - 145
  • [6] Analysis of cancer datasets using Classification Algorithms
    Kumar, Parvesh
    Wasan, Siri Krishan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (06): : 175 - 182
  • [7] A Comparative Analysis of Classification Algorithms on Diverse Datasets
    Alghobiri, Muhammad
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (02) : 2790 - 2795
  • [8] Metric Structures on Datasets: Stability and Classification of Algorithms
    Memoli, Facundo
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS: 14TH INTERNATIONAL CONFERENCE, CAIP 2011, PT 2, 2011, 6855 : 1 - 33
  • [9] Classification and Analysis of Clustering Algorithms for Large Datasets
    Badase, P. S.
    Deshbhratar, G. P.
    Bhagat, A. P.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [10] Evaluation of Deep Learning and Machine Learning Algorithms for Building Occupancy Classification on Open Datasets
    Cretu, Georgiana
    Stamatescu, Iulia
    Stamatescu, Grigore
    2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 575 - 580