Analysis of Learning Influence of Training Data Selected by Distribution Consistency

被引:2
|
作者
Hwang, Myunggwon [1 ,2 ]
Jeong, Yuna [1 ]
Sung, Won-Kyung [1 ,2 ]
机构
[1] Korea Inst Sci & Technol Informat, Intelligent Infrastruct Technol Res Ctr, Daejeon 34141, South Korea
[2] Univ Sci & Technol, Dept Data & HPC Sci, Daejeon 34113, South Korea
关键词
learning influence; machine learning; training data similarity; distribution consistency;
D O I
10.3390/s21041045
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This study suggests a method to select core data that will be helpful for machine learning. Specifically, we form a two-dimensional distribution based on the similarity of the training data and compose grids with fixed ratios on the distribution. In each grid, we select data based on the distribution consistency (DC) of the target class data and examine how it affects the classifier. We use CIFAR-10 for the experiment and set various grid ratios from 0.5 to 0.005. The influences of these variables were analyzed with the use of different training data sizes selected based on high-DC, low-DC (inverse of high DC), and random (no criteria) selections. As a result, the average point accuracy at 0.95% (+/- 0.65) and the point accuracy at 1.54% (+/- 0.59) improved for the grid configurations of 0.008 and 0.005, respectively. These outcomes justify an improved performance compared with that of the existing approach (data distribution search). In this study, we confirmed that the learning performance improved when the training data were selected for very small grid and high-DC settings.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [41] Learning the Distribution of Data for Embedding
    Shen, Yunpeng
    Ren, Pengfei
    Zhang, Taiping
    Tang, Yuan Yan
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 46 - 51
  • [42] On learning control with limited training data
    Ou, Y
    Xu, Y
    2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, : 4148 - 4153
  • [43] Resolving Quantitative MRI Model Degeneracy with Machine Learning via Training Data Distribution Design
    Guerreri, Michele
    Epstein, Sean
    Azadbakht, Hojjat
    Zhang, Hui
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2023, 2023, 13939 : 3 - 14
  • [44] Learning from Discriminatory Training Data
    Grabowicz, Przemyslaw
    Perello, Nicholas
    Takatsu, Kenta
    PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 752 - 763
  • [45] Segmentation Consistency Training: Out-of-Distribution Generalization for Medical Image Segmentation
    Torpmann-Hagen, Birk
    Thambawita, Vajira
    Riegler, Michael A.
    Halvorsen, Pal
    Glette, Kyrre
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 42 - 49
  • [46] Leverage Learning Behaviour Data for Students' Learning Performance Prediction and Influence Factor Analysis
    Ni Q.
    Zhu Y.
    Zhang L.
    Lu X.
    Zhang L.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2422 - 2433
  • [47] Geometric deep learning as an enabler for data consistency and interoperability in manufacturing
    Bruendl, Patrick
    Scheffler, Benedikt
    Straub, Christopher
    Stoidner, Micha
    Nguyen, Huong Giang
    Franke, Joerg
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2025, 44
  • [48] Negative Data in Data Sets for Machine Learning Training
    Maloney, Michael P.
    Coley, Connor W.
    Genheden, Samuel
    Carson, Nessa
    Helquist, Paul
    Norrby, Per-Ola
    Wiest, Olaf
    ORGANIC LETTERS, 2023, 25 (17) : 2945 - 2947
  • [49] Learning Data Consistency and its Application to Dynamic MR Imaging
    Cheng, Jing
    Cui, Zhuo-Xu
    Huang, Wenqi
    Ke, Ziwen
    Ying, Leslie
    Wang, Haifeng
    Zhu, Yanjie
    Liang, Dong
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (11) : 3140 - 3153
  • [50] Data-Efficient Deep Reinforcement Learning with Symmetric Consistency
    Zhang, Xianchao
    Yang, Wentao
    Zhang, Xiaotong
    Liu, Han
    Wang, Guanglu
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2430 - 2436