Effectiveness of the Euclidean distance in high dimensional spaces

被引:35
|
作者
Xia, Shuyin [1 ]
Xiong, Zhongyang [1 ]
Luo, Yueguo [1 ,3 ]
Xu, Wei [2 ]
Zhang, Guanghua [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[2] South Cent Univ Nationalities, Coll Comp Sci, Wuhan 400015, Peoples R China
[3] Yangtze Normal Univ, Network Informat Ctr, Chongqing 408100, Peoples R China
来源
OPTIK | 2015年 / 126卷 / 24期
关键词
Euclidean distance; High dimensional space; Independent and identical distributed;
D O I
10.1016/j.ijleo.2015.09.093
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
This paper presents analysis of applicability and performance of the Euclidean distance in relation to the dimensionality of the space. The effect of dimensionality on the behavior of Euclidean distance is explored; Furthermore, it is shown that the minimum distance approaches the maximum distance under a broader set of conditions without requiring the calculation of variance of random variables. It is demonstrated that the minimum distance approaches the maximum distance even for some low dimensional distributions, such as normal distribution. Many proposed measures not based directly on Euclidean distance cannot enlarger the difference between closest point and farthest point. The analysis has been performed on a wide range of artificial and publicly available datasets. As the variables of different distributions have different convergence rates, the results should not be interpreted to mean that the Euclidean distance is not applicable. In fact, it is shown in experiments that the Euclidean distance is very useful in the noncentral t-distribution even for the dimensionality higher than 10,000. Furthermore, it is observed that the behavior of Euclidean distance becomes more useful with increased number of samples. (C) 2015 Elsevier GmbH. All rights reserved.
引用
收藏
页码:5614 / 5619
页数:6
相关论文
共 50 条
  • [1] On Distance Mapping from non-Euclidean Spaces to Euclidean Spaces
    Ren, Wei
    Miche, Yoan
    Oliver, Ian
    Holtmanns, Silke
    Bjork, Kaj-Mikael
    Lendasse, Amaury
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2017, 2017, 10410 : 3 - 13
  • [2] Packing hyperspheres in high-dimensional Euclidean spaces
    Skoge, Monica
    Donev, Aleksandar
    Stillinger, Frank H.
    Torquato, Salvatore
    [J]. PHYSICAL REVIEW E, 2006, 74 (04)
  • [3] Differentially Private Clustering in High-Dimensional Euclidean Spaces
    Balcan, Maria-Florina
    Dick, Travis
    Liang, Yingyu
    Mou, Wenlong
    Zhang, Hongyang
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [4] On stochastic generation of ultrametrics in high-dimensional Euclidean spaces
    Zubarev A.P.
    [J]. P-Adic Numbers, Ultrametric Analysis, and Applications, 2014, 6 (2) : 155 - 165
  • [5] Comment on "Packing hyperspheres in high-dimensional Euclidean spaces"
    Zamponi, Francesco
    [J]. PHYSICAL REVIEW E, 2007, 75 (04):
  • [6] An effective method for approximating the Euclidean distance in high-dimensional space
    Jeong, Seungdo
    Kim, Sang-Wook
    Kim, Kidong
    Choi, Byung-Uk
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 863 - 872
  • [7] Weak curvatures of irregular curves in high-dimensional Euclidean spaces
    Mucci, Domenico
    Saracco, Alberto
    [J]. ANNALS OF GLOBAL ANALYSIS AND GEOMETRY, 2021, 60 (02) : 181 - 216
  • [8] Homology of moduli spaces of linkages in high-dimensional Euclidean space
    Schuetz, Dirk
    [J]. ALGEBRAIC AND GEOMETRIC TOPOLOGY, 2013, 13 (02): : 1183 - 1224
  • [9] Weak curvatures of irregular curves in high-dimensional Euclidean spaces
    Domenico Mucci
    Alberto Saracco
    [J]. Annals of Global Analysis and Geometry, 2021, 60 : 181 - 216
  • [10] SMALLNESS OF THE SET OF CRITICAL VALUES OF DISTANCE FUNCTIONS IN TWO-DIMENSIONAL EUCLIDEAN AND RIEMANNIAN SPACES
    Rataj, Jan
    Zajicek, Ludek
    [J]. MATHEMATIKA, 2020, 66 (02) : 297 - 324