A new fast search algorithm for exact k-nearest neighbors based on optimal triangle-inequality-based check strategy

被引:22
|
作者
Pan, Yiwei [1 ]
Pan, Zhibin [1 ]
Wang, Yikun [1 ]
Wang, Wei [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
关键词
Exact k-nearest neighbors; Fast search algorithm; Clustering; Triangle inequality; Optimal check strategy;
D O I
10.1016/j.knosys.2019.105088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbor (KNN) algorithm has been widely used in pattern recognition, regression, outlier detection and other data mining areas. However, it suffers from the large distance computation cost, especially when dealing with big data applications. In this paper, we propose a new fast search (FS) algorithm for exact k-nearest neighbors based on optimal triangle-inequality-based (OTI) check strategy. During the procedure of searching exact k-nearest neighbors for any query, the OTI check strategy can eliminate more redundant distance computations for the instances located in the marginal area of neighboring clusters compared with the original TI check strategy. Considering the large space complexity and extra time complexity of OTI, we also propose an efficient optimal triangle-inequality-based (EOTI) check strategy. The experimental results demonstrate that our proposed two algorithms (OTI and EOTI) achieve the best performance compared with other related KNN fast search algorithms, especially in the case of dealing with high-dimensional datasets. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Fast Exact k-Nearest Neighbors Algorithm for High Dimensional Search Using k-Means Clustering and Triangle Inequality
    Wang, Xueyi
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1293 - 1299
  • [2] Fast, k nearest neighbors search algorithm based on wavelet transform
    Qiao, Yu-Long
    Lu, Zhe-Ming
    Sun, Sheng-He
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (08) : 2239 - 2243
  • [3] Chameleon algorithm based on mutual k-nearest neighbors
    Yuru Zhang
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    Ling Ding
    [J]. Applied Intelligence, 2021, 51 : 2031 - 2044
  • [4] Chameleon algorithm based on mutual k-nearest neighbors
    Zhang, Yuru
    Ding, Shifei
    Wang, Lijuan
    Wang, Yanru
    Ding, Ling
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2031 - 2044
  • [5] A NEW FUZZY K-NEAREST NEIGHBORS ALGORITHM
    Li, Chengjie
    Pei, Zheng
    Li, Bo
    Zhang, Zhen
    [J]. INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 246 - +
  • [6] A New Version of the Dendritic Cell Immune Algorithm Based on the K-Nearest Neighbors
    Ben Ali, Kaouther
    Chelly, Zeineb
    Elouedi, Zied
    [J]. NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 688 - 695
  • [7] Density Peak Clustering Algorithm Based on K-nearest Neighbors and Optimized Allocation Strategy
    Sun L.
    Qin X.-Y.
    Xu J.-C.
    Xue Z.-A.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1390 - 1411
  • [8] Relative density based K-nearest neighbors clustering algorithm
    Liu, QB
    Deng, S
    Lu, CH
    Wang, B
    Zhou, YF
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 133 - 137
  • [9] Tea category identification based on optimal wavelet entropy and weighted k-Nearest Neighbors algorithm
    Wu, Xueyan
    Yang, Jiquan
    Wang, Shuihua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3745 - 3759
  • [10] Tea category identification based on optimal wavelet entropy and weighted k-Nearest Neighbors algorithm
    Xueyan Wu
    Jiquan Yang
    Shuihua Wang
    [J]. Multimedia Tools and Applications, 2018, 77 : 3745 - 3759