K-nearest Neighbor Search by Random Projection Forests

被引:0
|
作者
Yan, Donghui [1 ,2 ]
Wang, Yingjie [3 ]
Wang, Jin [3 ]
Wang, Honggang [3 ]
Li, Zhenpeng [4 ]
机构
[1] Univ Massachusetts, Dept Math, Dartmouth, MA 02747 USA
[2] Univ Massachusetts, Program Data Sci, Dartmouth, MA 02747 USA
[3] Univ Massachusetts, Dept Elect & Comp Engn, Dartmouth, MA 02747 USA
[4] Dali Univ, Dept Appl Stat, Dali 671000, Yunnan, Peoples R China
关键词
k-nearest neighbors; random projection forests; ensemble; unsupervised learning; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-nearest neighbor (kNN) search has wide applications in many areas, including data mining, machine learning, statistics and many applied domains. Inspired by the success of ensemble methods and the flexibility of tree-based methodology, we propose random projection forests, rpForests, for kNN search. rpForests finds kNNs by aggregating results from an ensemble of random projection trees with each constructed recursively through a series of carefully chosen random projections. rpForests achieves a remarkable accuracy in terms of fast decay in the missing rate of kNNs and that of discrepancy in the kNN distances. rpForests has a very low computational complexity. The ensemble nature of rpForests makes it easily run in parallel on multicore or clustered computers; the running time is expected to be nearly inversely proportional to the number of cores or machines. We give theoretical insights by showing the exponential decay of the probability that neighboring points would be separated by ensemble random projection trees when the ensemble size increases. Our theory can be used to refine the choice of random projections in the growth of trees, and experiments show that the effect is remarkable.
引用
下载
收藏
页码:4775 / 4781
页数:7
相关论文
共 50 条
  • [21] k-nearest neighbor search based on node density in MANETs
    Komai, Yuka
    Sasaki, Yuya
    Hara, Takahiro
    Nishio, Shojiro
    MOBILE INFORMATION SYSTEMS, 2014, 10 (04) : 385 - 405
  • [22] Boosting k-nearest neighbor classifier by means of input space projection
    Garcia-Pedrajas, Nicolas
    Ortiz-Boyer, Domingo
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10570 - 10582
  • [23] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [24] Robust Earthquake Cluster Analysis Based on K-Nearest Neighbor Search
    Samadi, Hamid Reza
    Kimiaefar, Roohollah
    Hajian, Alireza
    PURE AND APPLIED GEOPHYSICS, 2020, 177 (12) : 5661 - 5671
  • [25] Applying an efficient k-nearest neighbor search to forest attribute imputation
    Department of Forest Resources, University of Minnesota, 115 Green Hall, 1530 Cleveland Ave. North, St. Paul, MN 55108, United States
    不详
    For. Sci., 2006, 2 (130-135):
  • [26] A real-time monitoring method using random projection and k-nearest neighbor rule for batch process
    Wu, Lan
    Wen, Chenglin
    Zhou, Mei
    Ren, Haipeng
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (06):
  • [27] MKNN: Modified K-Nearest Neighbor
    Parvin, Hamid
    Alizadeh, Hoscin
    Minael-Bidgoli, Behrouz
    WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 831 - 834
  • [28] A GENERALIZED K-NEAREST NEIGHBOR RULE
    PATRICK, EA
    FISCHER, FP
    INFORMATION AND CONTROL, 1970, 16 (02): : 128 - &
  • [29] Improved k-nearest neighbor classification
    Wu, YQ
    Ianakiev, K
    Govindaraju, V
    PATTERN RECOGNITION, 2002, 35 (10) : 2311 - 2318
  • [30] DURS: A Distributed Method for k-Nearest Neighbor Search on Uncertain Graphs
    Li, Xiaodong
    2019 20TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2019), 2019, : 377 - 378