K-nearest neighbor finding using MaxNearestDist

被引:92
|
作者
Samet, Hanan [1 ]
机构
[1] Univ Maryland, Inst Adv Comp Studies, Ctr Automat Res, Dept Comp Sci, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
k-nearest neighbors; similarity searching; metric spaces; depth-first nearest neighbor finding; best-first nearest neighbor finding;
D O I
10.1109/TPAMI.2007.1182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity searching often reduces to finding the k nearest neighbors to a query object. Finding the k nearest neighbors is achieved by applying either a depth-first or a best-first algorithm to the search hierarchy containing the data. These algorithms are generally applicable to any index based on hierarchical clustering. The idea is that the data is partitioned into clusters that are aggregated to form other clusters, with the total aggregation being represented as a tree. These algorithms have traditionally used a lower bound corresponding to the minimum distance at which a nearest neighbor can be found (termed MINDIST) to prune the search process by avoiding the processing of some of the clusters, as well as individual objects when they can be shown to be farther from the query object q than all of the current k nearest neighbors of q. An alternative pruning technique that uses an upper bound corresponding to the maximum possible distance at which a nearest neighbor is guaranteed to be found (termed MAXNEARESTDIST) is described. The MAXNEARESTDIST upper bound is adapted to enable its use for finding the k nearest neighbors instead of just the nearest neighbor (that is, k 1) as in its previous uses. Both the depth-first and best-first k-nearest neighbor algorithms are modified to use MAXNEARESTDIST, which is shown to enhance both algorithms by overcoming their shortcomings. In particular, for the depth-first algorithm, the number of clusters in the search hierarchy that must be examined is not increased thereby potentially lowering its execution time, while for the best-first algorithm, the number of clusters in the search hierarchy that must be retained in the priority queue used to control the ordering of processing of the clusters is also not increased, thereby potentially lowering its storage requirements.
引用
收藏
页码:243 / 252
页数:10
相关论文
共 50 条
  • [1] Depth-first K-nearest neighbor finding using the MaxNearestDist estimator
    Samet, H
    [J]. 12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 486 - 491
  • [2] Fuzzy Monotonic K-Nearest Neighbor Versus Monotonic Fuzzy K-Nearest Neighbor
    Zhu, Hong
    Wang, Xizhao
    Wang, Ran
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (09) : 3501 - 3513
  • [3] Intrusion Detection Using k-Nearest Neighbor
    Govindarajan, M.
    Chandrasekaran, R. M.
    [J]. FIRST INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING 2009 (ICAC 2009), 2009, : 13 - +
  • [4] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    [J]. 2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [5] The k-Nearest Neighbor Algorithm Using MapReduce Paradigm
    Anchalia, Prajesh P.
    Roy, Kaushik
    [J]. PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 513 - 518
  • [6] k-Nearest Neighbor Classification Using Dissimilarity Increments
    Aidos, Helena
    Fred, Ana
    [J]. IMAGE ANALYSIS AND RECOGNITION, PT I, 2012, 7324 : 27 - 33
  • [7] Diagnosis of Arthritis Using K-Nearest Neighbor Approach
    Kaur, Rupinder
    Madaan, Vishu
    Agrawal, Prateek
    [J]. ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 : 160 - 171
  • [8] Navigating K-Nearest Neighbor Graphs to Solve Nearest Neighbor Searches
    Chavez, Edgar
    Sadit Tellez, Eric
    [J]. ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 270 - 280
  • [9] MKNN: Modified K-Nearest Neighbor
    Parvin, Hamid
    Alizadeh, Hoscin
    Minael-Bidgoli, Behrouz
    [J]. WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 831 - 834
  • [10] A GENERALIZED K-NEAREST NEIGHBOR RULE
    PATRICK, EA
    FISCHER, FP
    [J]. INFORMATION AND CONTROL, 1970, 16 (02): : 128 - &