Distance-based depths for directional data

被引:18
|
作者
Pandolfo, Giuseppe [1 ]
Paindaveine, Davy [2 ,3 ]
Porzio, Giovanni C. [4 ]
机构
[1] Univ Naples Federico II, Dept Ind Engn, I-80125 Naples, Italy
[2] Univ Libre Bruxelles, ECARES, B-1050 Brussels, Belgium
[3] Univ Libre Bruxelles, Dept Math, B-1050 Brussels, Belgium
[4] Univ Cassino & Southern Lazio, Dept Econ & Law, I-03043 Cassino, Italy
关键词
Arc distance depth; chord distance depth; cosine distance depth; hyperspheres; spherical location; statistical depth; supervised classification; MULTIVARIATE;
D O I
10.1002/cjs.11479
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Directional data are constrained to lie on the unit sphere of Double-struck capital R-q for some q >= 2. To address the lack of a natural ordering for such data, depth functions have been defined on spheres. However, the depths available either lack flexibility or are so computationally expensive that they can only be used for very small dimensions q. In this work, we improve on this by introducing a class of distance-based depths for directional data. Irrespective of the distance adopted, these depths can easily be computed in high dimensions too. We derive the main structural properties of the proposed depths and study how they depend on the distance used. We discuss the asymptotic and robustness properties of the corresponding deepest points. We show the practical relevance of the proposed depths in two applications, related to (i) spherical location estimation and (ii) supervised classification. For both problems, we show through simulation studies that distance-based depths have strong advantages over their competitors. The Canadian Journal of Statistics 46: 593-609; 2018 (c) 2018 Societe statistique du Canada
引用
收藏
页码:593 / 609
页数:17
相关论文
共 50 条
  • [41] A Distance-Based Data Aggregation Technique for Periodic Sensor Networks
    Harb, Hassan
    Makhoul, Abdallah
    Laiymani, David
    Jaber, Ali
    [J]. ACM TRANSACTIONS ON SENSOR NETWORKS, 2017, 13 (04)
  • [42] Kernel Learning Method for Distance-Based Classification of Categorical Data
    Chen, Lifei
    Guo, Gongde
    Wang, Shengrui
    Kong, Xiangzeng
    [J]. 2014 14TH UK WORKSHOP ON COMPUTATIONAL INTELLIGENCE (UKCI), 2014, : 58 - 64
  • [43] Data mining of chess chunks: a novel distance-based structure
    Linhares, A
    [J]. DATA MINING IV, 2004, 7 : 595 - 604
  • [44] Robust Distance-Based Clustering with Applications to Spatial Data Mining
    V. Estivill-Castro
    M. E. Houle
    [J]. Algorithmica, 2001, 30 : 216 - 242
  • [45] Distance-Based Continuous Skylines on Geo-Textual Data
    Chen, Jialiang
    Zheng, Jiping
    Jiang, Shunqing
    Qiu, Xianhong
    [J]. WEB TECHNOLOGIES AND APPLICATIONS: APWEB 2016 WORKSHOPS, WDMA, GAP, AND SDMA, 2016, 9865 : 228 - 240
  • [46] Continuous Monitoring of Distance-Based Outliers over Data Streams
    Kontaki, Maria
    Gounaris, Anastasios
    Papadopoulos, Apostolos N.
    Tsichlas, Kostas
    Manolopoulos, Yannis
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 135 - 146
  • [47] Little data is often enough for distance-based outlier detection
    Muhr, David
    Affenzeller, Michael
    [J]. 3RD INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, 2022, 200 : 984 - 992
  • [48] A Euclidean distance-based measure of efficiency in data envelopment analysis
    Amirteimoori, Alireza
    Kordrostami, Sohrab
    [J]. OPTIMIZATION, 2010, 59 (07) : 985 - 996
  • [49] Re-designing distance functions and distance-based applications for high dimensional data
    Aggarwal, CC
    [J]. SIGMOD RECORD, 2001, 30 (01) : 13 - 18
  • [50] Reducing distance computations for distance-based outliers
    Angiulli, Fabrizio
    Basta, Stefano
    Lodi, Stefano
    Sartori, Claudio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 147