Is it possible to find the single nearest neighbor of a query in high dimensions?

被引:0
|
作者
Ting, Kai Ming [1 ]
Washio, Takashi [2 ]
Zhu, Ye [3 ]
Xu, Yang [1 ]
Zhang, Kaifeng [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol & Sch Artifici, Nanjing 210023, Peoples R China
[2] Kansai Univ, Fac Business & Commerce, Osaka 5648680, Japan
[3] Deakin Univ, Sch Informat Technol, Geelong 3125, Australia
基金
中国国家自然科学基金;
关键词
Curse of dimensionality; Isolation kernel; High dimensions; Nearest neighbor search; Indexed search for exact nearest neighbor; search; Anomaly detection using kernel density; estimation and t-SNE visualization;
D O I
10.1016/j.artint.2024.104206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate an open question in the study of the curse of dimensionality: Is it possible to find the single nearest neighbor of a query in high dimensions? Using the notion of (in)distinguishability to examine whether the feature map of a kernel is able to distinguish two distinct points in high dimensions, we analyze this ability of a metric-based Lipschitz continuous kernel as well as that of the recently introduced Isolation Kernel. Between the two kernels, we show that only Isolation Kernel has distinguishability and it performs consistently well in four tasks: indexed search for exact nearest neighbor search, anomaly detection using kernel density estimation, t-SNE visualization and SVM classification in both low and high dimensions, compared with distance, Gaussian and three other existing kernels.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Approximate Line Nearest Neighbor in High Dimensions
    Andoni, Alexandr
    Indyk, Piotr
    Krauthgamer, Robert
    Nguyen, Huy L.
    PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2009, : 293 - +
  • [2] ON THE PERFORMANCE OF EDITED NEAREST NEIGHBOR RULES IN HIGH DIMENSIONS
    BRODER, AZ
    BRUCKSTEIN, AM
    KOPLOWITZ, J
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (01): : 136 - 139
  • [3] Entropy based Nearest Neighbor Search in High Dimensions
    Panigrahy, Rina
    PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2006, : 1186 - 1195
  • [4] A simple algorithm for nearest neighbor search in high dimensions
    Nene, SA
    Nayar, SK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (09) : 989 - 1003
  • [5] Processing global nearest neighbor query
    Liu Xiaofeng
    Chen Chuanbo
    Liu YunSheng
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 1, PROCEEDINGS, 2007, : 458 - +
  • [6] Range nearest-neighbor query
    Hu, HB
    Lee, DL
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (01) : 78 - 91
  • [7] Randomized Algorithm for Approximate Nearest Neighbor Search in High Dimensions
    Buabal, Ruben
    Homaifarl, Abdollah
    Hendrix, William
    Son, Seung Woo
    Liao, Wei-keng
    Choudhary, Alok
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2014, 9 (01): : 111 - 122
  • [8] PARALLEL ALGORITHMS FOR NEAREST NEIGHBOR SEARCH PROBLEMS IN HIGH DIMENSIONS
    Xiao, Bo
    Biros, George
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (05): : S667 - S699
  • [9] A Privacy Preserving Scheme for Nearest Neighbor Query
    Wang, Yuhang
    Tian, Zhihong
    Zhang, Hongli
    Su, Shen
    Shi, Wei
    SENSORS, 2018, 18 (08)
  • [10] A concurrency control algorithm for nearest neighbor query
    Chen, JK
    Chin, YH
    INFORMATION SCIENCES, 1999, 114 (1-4) : 187 - 204