Analysis of Learning Influence of Training Data Selected by Distribution Consistency

被引:2
|
作者
Hwang, Myunggwon [1 ,2 ]
Jeong, Yuna [1 ]
Sung, Won-Kyung [1 ,2 ]
机构
[1] Korea Inst Sci & Technol Informat, Intelligent Infrastruct Technol Res Ctr, Daejeon 34141, South Korea
[2] Univ Sci & Technol, Dept Data & HPC Sci, Daejeon 34113, South Korea
关键词
learning influence; machine learning; training data similarity; distribution consistency;
D O I
10.3390/s21041045
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This study suggests a method to select core data that will be helpful for machine learning. Specifically, we form a two-dimensional distribution based on the similarity of the training data and compose grids with fixed ratios on the distribution. In each grid, we select data based on the distribution consistency (DC) of the target class data and examine how it affects the classifier. We use CIFAR-10 for the experiment and set various grid ratios from 0.5 to 0.005. The influences of these variables were analyzed with the use of different training data sizes selected based on high-DC, low-DC (inverse of high DC), and random (no criteria) selections. As a result, the average point accuracy at 0.95% (+/- 0.65) and the point accuracy at 1.54% (+/- 0.59) improved for the grid configurations of 0.008 and 0.005, respectively. These outcomes justify an improved performance compared with that of the existing approach (data distribution search). In this study, we confirmed that the learning performance improved when the training data were selected for very small grid and high-DC settings.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Consistency Analysis of Sensor Data Distribution
    Reali, Gianluca
    Femminella, Mauro
    2013 9TH INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2013, : 1442 - 1447
  • [2] Spam Filtering: the Influence of the Temporal Distribution of Training Data
    Bryl, Anton
    STAIRS 2006, 2006, 142 : 249 - 250
  • [3] SELECTED EARLY CHILDHOOD AFFECTIVE LEARNING PROGRAMS - ANALYSIS OF THEORIES, STRUCTURE, AND CONSISTENCY
    MARTORELLA, PH
    YOUNG CHILDREN, 1975, 30 (04): : 289 - 301
  • [4] Unsupervised Data Augmentation for Consistency Training
    Xie, Qizhe
    Dai, Zihang
    Hovy, Eduard
    Luong, Minh-Thang
    Le, Quoc V.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [5] Training data influence analysis and estimation: a survey
    Zayd Hammoudeh
    Daniel Lowd
    Machine Learning, 2024, 113 : 2351 - 2403
  • [6] Training data influence analysis and estimation: a survey
    Hammoudeh, Zayd
    Lowd, Daniel
    MACHINE LEARNING, 2024, 113 (05) : 2351 - 2403
  • [7] On the Influence of Selected Game Design Elements on Learning Performance in Digital Spelling Training
    Mueller, Hans-Georg
    ZEITSCHRIFT FUR PADAGOGIK, 2023, 69 (01): : 111 - 130
  • [8] CONSISTENCY ANALYSIS OF BILEVEL DATA-DRIVEN LEARNING IN INVERSE PROBLEMS
    Chada, Neil K.
    Schillings, Claudia
    Tong, Xin T.
    Weissmann, Simon
    COMMUNICATIONS IN MATHEMATICAL SCIENCES, 2022, 20 (01) : 123 - 164
  • [9] Analysis of the Influence of Outward Bound Training Based on Data Analysis in College Physical Training
    Wang, Diliang
    Huang, Guoyang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [10] Convolutional Analysis Operator Learning: Dependence on Training Data
    Chun, Il Yong
    Hong, David
    Adcock, Ben
    Fessler, Jeffrey A.
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (08) : 1137 - 1141