Adaptive κ-nearest-neighbor classification using a dynamic number of nearest neighbors

被引:0
|
作者
Ougiaroglou, Stefanos [1 ]
Nanopoulos, Alexandros [1 ]
Papadopoulos, Apostolos N. [1 ]
Manolopoulos, Yannis [1 ]
Welzer-Druzovec, Tatjana [2 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
[2] Univ Maribor, Fac Elect Engn & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
kappa NN classification; multidimensional data; performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Classification based on k-nearest neighbors (kNN classification) is one of the most widely used classification methods. The number k of nearest neighbors used for achieving a high accuracy in classification is given in advance and is highly dependent on the data set used. If the size of data set is large, the sequential or binary search of NNs is inapplicable due to the increased computational costs. Therefore, indexing schemes are frequently used to speed-up the classification process. If the required number of nearest neighbors is high, the use of an index may not be adequate to achieve high performance. In this paper, we demonstrate that the execution of the nearest neighbor search algorithm can be interrupted if some criteria are satisfied. This way, a decision can be made without the computation of all k nearest neighbors of a new object. Three different heuristics are studied towards enhancing the nearest neighbor algorithm with an early-break capability. These heuristics aim at: (i) reducing computation and I/O costs as much as possible, and (ii) maintaining classification accuracy at a high level. Experimental results based on real-life data sets illustrate the applicability of the proposed method in achieving better performance than existing methods.
引用
收藏
页码:66 / +
页数:3
相关论文
共 50 条
  • [1] Locally adaptive metric nearest-neighbor classification
    Domeniconi, C
    Peng, J
    Gunopulos, D
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (09) : 1281 - 1285
  • [2] CHOICE OF NEIGHBOR ORDER IN NEAREST-NEIGHBOR CLASSIFICATION
    Hall, Peter
    Park, Byeong U.
    Samworth, Richard J.
    ANNALS OF STATISTICS, 2008, 36 (05): : 2135 - 2152
  • [3] Prototype optimization for nearest-neighbor classification
    Huang, YS
    Chiang, CC
    Shieh, JW
    Grimson, E
    PATTERN RECOGNITION, 2002, 35 (06) : 1237 - 1245
  • [4] Nearest-neighbor classification with categorical variables
    Buttrey, SE
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1998, 28 (02) : 157 - 169
  • [5] Fast Nearest-Neighbor Classification Using RNN in Domains with Large Number of Classes
    Singh, Gautam
    Dasgupta, Gargi
    Deng, Yu
    SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 309 - 321
  • [6] A Bayesian Reassessment of Nearest-Neighbor Classification
    Cucala, Lionel
    Marin, Jean-Michel
    Robert, Christian P.
    Titterington, D. M.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (485) : 263 - 273
  • [7] Nearest-neighbor classification for facies delineation
    Tartakovsky, Daniel M.
    Wohlberg, Brendt
    Guadagnini, Alberto
    WATER RESOURCES RESEARCH, 2007, 43 (07)
  • [8] Adaptive Nearest Neighbors for Classification
    Jhun, Myoungshic
    Choi, Inkyung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2009, 22 (03) : 479 - 488
  • [9] In defense of Nearest-Neighbor based image classification
    Boiman, Oren
    Shechtman, Eli
    Irani, Michal
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1992 - +
  • [10] HANDWRITTEN CHARACTER CLASSIFICATION USING NEAREST-NEIGHBOR IN LARGE DATABASES
    SMITH, SJ
    BOURGOIN, MO
    SIMS, K
    VOORHEES, HL
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (09) : 915 - 919