A Taxonomy and Experimental Study on Prototype Generation for Nearest Neighbor Classification

被引:192
|
作者
Triguero, Isaac [1 ]
Derrac, Joaquin [1 ]
Garcia, Salvador [2 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Res Ctr Informat & Commun Technol, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
关键词
Classification; learning vector quantization (LVQ); nearest neighbor (NN); prototype generation (PG); taxonomy; VECTOR QUANTIZATION; STATISTICAL COMPARISONS; FINDING PROTOTYPES; REDUCTION; DESIGN; CLASSIFIERS; PERFORMANCE; ALGORITHM; SELECTION; BOOTSTRAP;
D O I
10.1109/TSMCC.2010.2103939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The nearest neighbor (NN) rule is one of the most successfully used techniques to resolve classification and pattern recognition tasks. Despite its high classification accuracy, this rule suffers from several shortcomings in time response, noise sensitivity, and high storage requirements. These weaknesses have been tackled by many different approaches, including a good and well-known solution that we can find in the literature, which consists of the reduction of the data used for the classification rule (training data). Prototype reduction techniques can be divided into two different approaches, which are known as prototype selection and prototype generation (PG) or abstraction. The former process consists of choosing a subset of the original training data, whereas PG builds new artificial prototypes to increase the accuracy of the NN classification. In this paper, we provide a survey of PG methods specifically designed for the NN rule. From a theoretical point of view, we propose a taxonomy based on the main characteristics presented in them. Furthermore, from an empirical point of view, we conduct a wide experimental study that involves small and large datasets to measure their performance in terms of accuracy and reduction capabilities. The results are contrasted through nonparametrical statistical tests. Several remarks are made to understand which PG models are appropriate for application to different datasets.
引用
收藏
页码:86 / 100
页数:15
相关论文
共 50 条
  • [1] Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study
    Garcia, Salvador
    Derrac, Joaquin
    Ramon Cano, Jose
    Herrera, Francisco
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (03) : 417 - 435
  • [2] Using gravitational search algorithm in prototype generation for nearest neighbor classification
    Rezaei, Mohadese
    Nezamabadi-pour, Hossein
    [J]. NEUROCOMPUTING, 2015, 157 : 256 - 263
  • [3] Prototype optimization for nearest-neighbor classification
    Huang, YS
    Chiang, CC
    Shieh, JW
    Grimson, E
    [J]. PATTERN RECOGNITION, 2002, 35 (06) : 1237 - 1245
  • [4] Efficient prototype reordering in nearest neighbor classification
    Bandyopadhyay, S
    Maulik, U
    [J]. PATTERN RECOGNITION, 2002, 35 (12) : 2791 - 2799
  • [5] Prototype Generation Using Multiobjective Particle Swarm Optimization for Nearest Neighbor Classification
    Hu, Weiwei
    Tan, Ying
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (12) : 2719 - 2731
  • [6] IPADE: Iterative Prototype Adjustment for Nearest Neighbor Classification
    Triguero, Isaac
    Garcia, Salvador
    Herrera, Francisco
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (12): : 1984 - 1990
  • [7] Prototype generation in the string space via approximate median for data reduction in nearest neighbor classification
    Castellanos, Francisco J.
    Valero-Mas, Jose J.
    Calvo-Zaragoza, Jorge
    [J]. SOFT COMPUTING, 2021, 25 (24) : 15403 - 15415
  • [8] Prototype generation in the string space via approximate median for data reduction in nearest neighbor classification
    Francisco J. Castellanos
    Jose J. Valero-Mas
    Jorge Calvo-Zaragoza
    [J]. Soft Computing, 2021, 25 : 15403 - 15415
  • [9] Performance evaluation of prototype selection algorithms for nearest neighbor classification
    Sánchez, JS
    Barandela, R
    Alejo, R
    Marqués, AI
    [J]. XIV BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2001, : 44 - 50
  • [10] Prototype, nearest neighbor and hybrid algorithms for time series classification
    Wisotzki, C
    Wysotzki, F
    [J]. MACHINE LEARNING: ECML-95, 1995, 912 : 364 - 367