Data reduction based on NN-kNN measure for NN classification and regression

被引:9
|
作者
An, Shuang [1 ]
Hu, Qinghua [2 ]
Wang, Changzhong [3 ]
Guo, Ge [1 ]
Li, Piyu [1 ]
机构
[1] Northeastern Univ, Shenyang 110819, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Bohai Univ, Jinzhou 121013, Peoples R China
基金
中国国家自然科学基金;
关键词
Data quality; Sample reduction; kNN; Local evaluation; Robust classification and regression; OUTLIER DETECTION; ALGORITHMS; SELECTION;
D O I
10.1007/s13042-021-01327-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction processes are designed not only to reduce the amount of data, but also to reduce noise interference. In this study, we focus on researching sample reduction algorithms for the classification and regression data. A sample quality evaluation measure denoted by NN-kNN, which is inspired by human social behavior, is proposed. This measure is a local evaluation method that can accurately evaluate the quality of samples under uneven and irregular data distribution. Additionally, the measure is easy to understand and applies to both supervised and unsupervised data. Consequently, it respectively studies the sample reduction algorithms based on the NN-kNN measure for classification and regression data. Experiments are carried out to verify the proposed quality evaluation measure and data reduction algorithms. Experimental results show that NN-kNN can evaluate data quality effectively. High quality samples selected by the reduction algorithms can generate high classification and prediction performance. Furthermore, the robustness of the sample reduction algorithms is also validated.
引用
收藏
页码:765 / 781
页数:17
相关论文
共 50 条
  • [41] Improving the K-NN classification with the Euclidean distance through linear data transformations
    Bobrowski, L
    Topczewska, M
    [J]. ADVANCES IN DATA MINING: APPLICATIONS IN IMAGE MINING, MEDICINE AND BIOTECHNOLOGY, MANAGEMENT AND ENVIRONMENTAL CONTROL, AND TELECOMMUNICATIONS, 2004, 3275 : 23 - 32
  • [42] Quality assessment of k-NN multi-label classification for music data
    Wieczorkowska, Alicja
    Synak, Piotr
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 389 - 398
  • [43] k-NN Classification of Handwritten Characters Using a New Distortion-tolerant Matching Measure
    Yamashita, Yukihiko
    Wakahara, Toru
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 262 - 267
  • [44] SAR image classification method based on Gabor feature and K-NN
    Wang, Zhiru
    Chen, Liang
    Shi, Hao
    Qi, Baogui
    Wang, Guanqun
    [J]. JOURNAL OF ENGINEERING-JOE, 2019, 2019 (20): : 6734 - 6736
  • [45] Leaf classification based on Shape and Edge feature with k-NN Classifier
    Kumar, Pullela S. V. V. S. R.
    Rao, Konda Naga Venkateswara
    Raju, Akella S. Narasimha
    Kumar, D. J. Nagendra
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 548 - 552
  • [46] Distance based k-NN Classification of Gabor Jet Local Descriptors
    Lefkovits, Szidonia
    Lefkovits, Laszlo
    [J]. 8TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2014, 2015, 19 : 780 - 785
  • [47] Comparative study of motor imagery classification based on BP-NN and SVM
    Jia, Hongru
    Wang, Shuai
    Zheng, Dezhi
    Qu, Xiaolei
    Fan, Shangchun
    [J]. JOURNAL OF ENGINEERING-JOE, 2019, 2019 (23): : 8646 - 8649
  • [48] k-NN-based classification of sleep apnea types using ECG
    Timus, Oguzhan
    Dogru Bolat, Emine
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (04) : 3008 - 3023
  • [49] Utilization of K-NN Algorithm for Expectation Maximization Based Classification Method
    Aci, M.
    Inan, C.
    Avci, M.
    [J]. 2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 786 - 788
  • [50] Human movement detection based on acceleration measurements and k-NN classification
    Darko, Fuduric
    Denis, Siladi
    Mario, Zagar
    [J]. EUROCON 2007: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOLS 1-6, 2007, : 1352 - 1357