A Fast Exact k-Nearest Neighbors Algorithm for High Dimensional Search Using k-Means Clustering and Triangle Inequality

被引:0
|
作者
Wang, Xueyi [1 ]
机构
[1] NW Nazarene Univ, Dept Math & Comp Sci, Nampa, ID 83642 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbors (k-NN) algorithm is a widely used machine learning method that finds nearest neighbors of a test object in a feature space. We present a new exact k-NN algorithm called kMkNN (k-Means for k-Nearest Neighbors) that uses the k-means clustering and the triangle inequality to accelerate the searching for nearest neighbors in a high dimensional space. The kMkNN algorithm has two stages. In the buildup stage, instead of using complex tree structures such as metric trees, kd-trees, or ball-tree, kMkNN uses a simple k-means clustering method to preprocess the training dataset. In the searching stage, given a query object, kMkNN finds nearest training objects starting from the nearest cluster to the query object and uses the triangle inequality to reduce the distance calculations. Experiments show that the performance of kMkNN is surprisingly good compared to the traditional k-NN algorithm and tree-based k-NN algorithms such as kd-trees and ball-trees. On a collection of 20 datasets with up to 10(6) records and 10(4) dimensions, kMkNN shows a 2- to 80-fold reduction of distance calculations and a 2- to 60-fold speedup over the traditional k-NN algorithm for 16 datasets. Furthermore, kMkNN performs significant better than a kd-tree based k-NN algorithm for all datasets and performs better than a ball-tree based k-NN algorithm for most datasets. The results show that kMkNN is effective for searching nearest neighbors in high dimensional spaces.
引用
收藏
页码:1293 / 1299
页数:7
相关论文
共 50 条
  • [11] Efficient k-Nearest Neighbors Search in High Dimensions using MapReduce
    Zhu, Pingfei
    Zhan, Xiangwen
    Qiu, Wenming
    [J]. PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 23 - 30
  • [12] Efficient Hand Movement Detection Using k-Means Clustering and k-Nearest Neighbor Algorithms
    Erhan Bergil
    Canan Oral
    Engin Ufuk Ergul
    [J]. Journal of Medical and Biological Engineering, 2021, 41 : 11 - 24
  • [13] Efficient Hand Movement Detection Using k-Means Clustering and k-Nearest Neighbor Algorithms
    Bergil, Erhan
    Oral, Canan
    Ergul, Engin Ufuk
    [J]. JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2021, 41 (01) : 11 - 24
  • [14] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Raneem Qaddoura
    Hossam Faris
    Ibrahim Aljarah
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 675 - 714
  • [15] Feature Selection for High Dimensional Data Using Weighted K-Nearest Neighbors and Genetic Algorithm
    Li, Shuangjie
    Zhang, Kaixiang
    Chen, Qianru
    Wang, Shuqin
    Zhang, Shaoqiang
    [J]. IEEE ACCESS, 2020, 8 : 139512 - 139528
  • [16] Information theoretic clustering using a k-nearest neighbors approach
    Vikjord, Vidar V.
    Jenssen, Robert
    [J]. PATTERN RECOGNITION, 2014, 47 (09) : 3070 - 3081
  • [17] Soft Inference as a Voting Mechanism in k-Nearest Neighbors Clustering Algorithm
    Bury, Tomasz
    Kacprzak, Aleksandra
    Zerdzinski, Piotr
    [J]. INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2023, 2024, 1979 : 309 - 318
  • [18] An efficient clustering algorithm based on the k-nearest neighbors with an indexing ratio
    Qaddoura, Raneem
    Faris, Hossam
    Aljarah, Ibrahim
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (03) : 675 - 714
  • [19] An adaptive k-nearest neighbors clustering algorithm for complex distribution dataset
    Zhang, Yan
    Jia, Yan
    Huang, Xiaobin
    Zhou, Bin
    Gu, Jian
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2007, 4682 : 398 - 407
  • [20] A UNIMODAL CLUSTERING-ALGORITHM BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    [J]. AUTOMATION AND REMOTE CONTROL, 1993, 54 (05) : 794 - 798