Data reduction based on NN-kNN measure for NN classification and regression

被引:9
|
作者
An, Shuang [1 ]
Hu, Qinghua [2 ]
Wang, Changzhong [3 ]
Guo, Ge [1 ]
Li, Piyu [1 ]
机构
[1] Northeastern Univ, Shenyang 110819, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Bohai Univ, Jinzhou 121013, Peoples R China
基金
中国国家自然科学基金;
关键词
Data quality; Sample reduction; kNN; Local evaluation; Robust classification and regression; OUTLIER DETECTION; ALGORITHMS; SELECTION;
D O I
10.1007/s13042-021-01327-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data reduction processes are designed not only to reduce the amount of data, but also to reduce noise interference. In this study, we focus on researching sample reduction algorithms for the classification and regression data. A sample quality evaluation measure denoted by NN-kNN, which is inspired by human social behavior, is proposed. This measure is a local evaluation method that can accurately evaluate the quality of samples under uneven and irregular data distribution. Additionally, the measure is easy to understand and applies to both supervised and unsupervised data. Consequently, it respectively studies the sample reduction algorithms based on the NN-kNN measure for classification and regression data. Experiments are carried out to verify the proposed quality evaluation measure and data reduction algorithms. Experimental results show that NN-kNN can evaluate data quality effectively. High quality samples selected by the reduction algorithms can generate high classification and prediction performance. Furthermore, the robustness of the sample reduction algorithms is also validated.
引用
收藏
页码:765 / 781
页数:17
相关论文
共 50 条
  • [21] Musical Genre Classification on the Marsyas Audio Data Using Convolution NN
    Ahmed, Md Sabbir
    Mahmud, Md Zalish
    Akhter, Shamim
    [J]. 2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [22] A complete system for NN classification based on a VLSI array processor
    Ferrari, A
    Borgatti, M
    Guerrieri, R
    [J]. PATTERN RECOGNITION, 2000, 33 (12) : 2083 - 2093
  • [23] Classification of File Data Based on Confidentiality in Cloud Computing using K-NN Classifier
    Zardari, Munwar Ali
    Jung, Low Tang
    [J]. INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2016, 3 (02) : 61 - 78
  • [24] NN-BASED ORDINAL REGRESSION FOR ASSESSING FLUENCY OF ESL SPEECH
    Mao, Shaoguang
    Wu, Zhiyong
    Jiang, Jingshuai
    Liu, Peiyun
    Soong, Frank K.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7420 - 7424
  • [25] A two-dimensional matrix image based feature extraction method for classification of sEMG: A comparative analysis based on SVM, KNN and RBF-NN
    Wen, Tingxi
    Zhang, Zhongnan
    Qiu, Ming
    Zeng, Ming
    Luo, Weizhen
    [J]. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2017, 25 (02) : 287 - 300
  • [26] A polynomial fitting and k-NN based approach for improving classification of motor imagery BCI data
    Kayikcioglu, Temel
    Aydemir, Onder
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (11) : 1207 - 1215
  • [27] An Optimized k-NN Approach for Classification on Imbalanced Datasets with Missing Data
    Ozan, Ezgi Can
    Riabchenko, Ekaterina
    Kiranyaz, Serkan
    Gabbouj, Moncef
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 387 - 392
  • [28] Evaluation of normalization methods for cDNA microarray data by k-NN classification
    Wei Wu
    Eric P Xing
    Connie Myers
    I Saira Mian
    Mina J Bissell
    [J]. BMC Bioinformatics, 6
  • [29] Content Based Component Retrieval Based on Neural Network (NN) Classification Method
    Garg, Rupali
    Bajwa, Jagpuneet Kaur
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES, ICACDS 2016, 2017, 721 : 577 - 584
  • [30] Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm
    Ventura-Molina, Elias
    Alarcon-Paredes, Antonio
    Aldape-Perez, Mario
    Yanez-Marquez, Cornelio
    Adolfo Alonso, Gustavo
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 241 - 253