Is it possible to find the single nearest neighbor of a query in high dimensions?

被引:0
|
作者
Ting, Kai Ming [1 ]
Washio, Takashi [2 ]
Zhu, Ye [3 ]
Xu, Yang [1 ]
Zhang, Kaifeng [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol & Sch Artifici, Nanjing 210023, Peoples R China
[2] Kansai Univ, Fac Business & Commerce, Osaka 5648680, Japan
[3] Deakin Univ, Sch Informat Technol, Geelong 3125, Australia
基金
中国国家自然科学基金;
关键词
Curse of dimensionality; Isolation kernel; High dimensions; Nearest neighbor search; Indexed search for exact nearest neighbor; search; Anomaly detection using kernel density; estimation and t-SNE visualization;
D O I
10.1016/j.artint.2024.104206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate an open question in the study of the curse of dimensionality: Is it possible to find the single nearest neighbor of a query in high dimensions? Using the notion of (in)distinguishability to examine whether the feature map of a kernel is able to distinguish two distinct points in high dimensions, we analyze this ability of a metric-based Lipschitz continuous kernel as well as that of the recently introduced Isolation Kernel. Between the two kernels, we show that only Isolation Kernel has distinguishability and it performs consistently well in four tasks: indexed search for exact nearest neighbor search, anomaly detection using kernel density estimation, t-SNE visualization and SVM classification in both low and high dimensions, compared with distance, Gaussian and three other existing kernels.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Comparative Analysis of Nearest Neighbor Query Processing Techniques
    Mahapatra, Rajendra Prasad
    Chakraborty, Partha Sarathi
    3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 1289 - 1298
  • [32] Data Acquisition for Probabilistic Nearest-Neighbor Query
    Lin, Yu-Chieh
    Yang, De-Nian
    Shuai, Hong-Han
    Chen, Ming-Syan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (02) : 410 - 427
  • [33] Authentication of k Nearest Neighbor Query on Road Networks
    Jing, Yinan
    Hu, Ling
    Ku, Wei-Shinn
    Shahabi, Cyrus
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (06) : 1494 - 1506
  • [34] An Improved DBSCAN Algorithm Based on the Neighbor Similarity and Fast Nearest Neighbor Query
    Li, Shan-Shan
    IEEE ACCESS, 2020, 8 : 47468 - 47476
  • [35] Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    Andoni, Alexandr
    Indyk, Piotr
    47TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2006, : 459 - +
  • [36] Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    Andoni, Alexandr
    Indyk, Piotr
    COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 117 - 122
  • [37] Efficient Nearest-Neighbor Query and Clustering of Planar Curves
    Aronov, Boris
    Filtser, Omrit
    Horton, Michael
    Katz, Matthew J.
    Sheikhan, Khadijeh
    ALGORITHMS AND DATA STRUCTURES, WADS 2019, 2019, 11646 : 28 - 42
  • [38] The Moving K Diversified Nearest Neighbor Query (Extended Abstract)
    Gu, Yu
    Liu, Guanli
    Qi, Jianzhong
    Xu, Hongfei
    Yu, Ge
    Zhang, Rui
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 31 - 32
  • [39] Reverse Nearest Neighbor Query Based on New Index Structure
    Liu R.
    Liang J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (06): : 1335 - 1346
  • [40] Continuous visible nearest neighbor query processing in spatial databases
    Yunjun Gao
    Baihua Zheng
    Gencai Chen
    Qing Li
    Xiaofa Guo
    The VLDB Journal, 2011, 20 : 371 - 396