Data reduction based on NN-kNN measure for NN classification and regression

被引:9
|
作者
An, Shuang [1 ]
Hu, Qinghua [2 ]
Wang, Changzhong [3 ]
Guo, Ge [1 ]
Li, Piyu [1 ]
机构
[1] Northeastern Univ, Shenyang 110819, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Bohai Univ, Jinzhou 121013, Peoples R China
基金
中国国家自然科学基金;
关键词
Data quality; Sample reduction; kNN; Local evaluation; Robust classification and regression; OUTLIER DETECTION; ALGORITHMS; SELECTION;
D O I
10.1007/s13042-021-01327-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction processes are designed not only to reduce the amount of data, but also to reduce noise interference. In this study, we focus on researching sample reduction algorithms for the classification and regression data. A sample quality evaluation measure denoted by NN-kNN, which is inspired by human social behavior, is proposed. This measure is a local evaluation method that can accurately evaluate the quality of samples under uneven and irregular data distribution. Additionally, the measure is easy to understand and applies to both supervised and unsupervised data. Consequently, it respectively studies the sample reduction algorithms based on the NN-kNN measure for classification and regression data. Experiments are carried out to verify the proposed quality evaluation measure and data reduction algorithms. Experimental results show that NN-kNN can evaluate data quality effectively. High quality samples selected by the reduction algorithms can generate high classification and prediction performance. Furthermore, the robustness of the sample reduction algorithms is also validated.
引用
收藏
页码:765 / 781
页数:17
相关论文
共 50 条
  • [1] Data reduction based on NN-kNN measure for NN classification and regression
    Shuang An
    Qinghua Hu
    Changzhong Wang
    Ge Guo
    Piyu Li
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 765 - 781
  • [2] An Empirical Analysis of Data Reduction Techniques for k-NN Classification
    Eleftheriadis, Stylianos
    Evangelidis, Georgios
    Ougiaroglou, Stefanos
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 83 - 97
  • [3] Sensitive Data Detection Using NN and KNN from Big Data
    Adhikari, Binod Kumar
    Zuo, Wan Li
    Maharjan, Ramesh
    Guo, Lin
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT IV, 2018, 11337 : 628 - 642
  • [4] Efficient -NN classification based on homogeneous clusters
    Ougiaroglou, Stefanos
    Evangelidis, Georgios
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (03) : 491 - 513
  • [5] EEG signal classification based on PCA and NN
    Oh, Changmok
    Kim, Min-Soeng
    Lee, Ju-Jang
    [J]. 2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 4760 - +
  • [6] RHC: a non-parametric cluster-based data reduction for efficient k-NN classification
    Ougiaroglou, Stefanos
    Evangelidis, Georgios
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (01) : 93 - 109
  • [7] The Canonical Distortion Measure in feature space and 1-NN classification
    Baxter, J
    Bartlett, P
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 245 - 251
  • [8] k-NN Regression on Functional Data with Incomplete Observations
    Reddi, Sashank J.
    Poczos, Barnabis
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 692 - 701
  • [10] Wi-NN: Human Gesture Recognition System Based on Weighted KNN
    Zhang, Yajun
    Yuan, Bo
    Yang, Zhixiong
    Li, Zijian
    Liu, Xu
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (06):