Outlier detection in cylindrical data based on Mahalanobis distance

被引:0
|
作者
Dhamale, Prashant S. [1 ,2 ]
Kashikar, Akanksha S. [1 ]
机构
[1] Savitribai Phule Pune Univ, Dept Stat, Pune, India
[2] SVKMS NMIMS Deemed Univ, Nilkamal Sch Math Appl Stat & Analyt, Dept Stat, Mumbai, India
关键词
Bootstrap; Cylindrical data; Johnson-Wehrly distribution; Mahalanobis distance; Outliers; DISCORDANCY;
D O I
10.1080/03610918.2023.2252630
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Cylindrical data are bivariate data formed from the combination of circular and linear variables. Identifying outliers is a crucial step in any data analysis work. This paper proposes a new distribution-free procedure to detect outliers in cylindrical data using the Mahalanobis distance concept. The use of Mahalanobis distance incorporates the correlation between the components of the cylindrical distribution, which had not been accounted for in the earlier papers on outlier detection in cylindrical data. The threshold for declaring an observation to be an outlier can be obtained via parametric or non-parametric bootstrap, depending on whether the underlying distribution is known or unknown. The performance of the proposed method is examined via extensive simulations from the Johnson-Wehrly distribution. The proposed method is applied to two real datasets, and the outliers are identified in those datasets.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Hardware Trojan Detection Based on Cluster Analysis of Mahalanobis Distance
    Cui, Qi
    Zhang, Lei
    Sun, Kewang
    Li, Dongxu
    Wang, Sixiang
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 1, 2016, : 234 - 238
  • [42] Sensor Fault Detection Based on Particle Filter and Mahalanobis Distance
    Li, Tianzhi
    Liu, Gang
    Zhang, Liangliang
    JORDAN JOURNAL OF CIVIL ENGINEERING, 2019, 13 (04) : 501 - 507
  • [43] Fast Distance-based Outlier Detection in Data Streams based on Micro-clusters
    Tran, Luan
    Fan, Liyue
    Shahabi, Cyrus
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 162 - 169
  • [44] Research on the detection method of driver fatigue based on Mahalanobis Distance
    Qi Yu-ming
    Deng San-peng
    Wang Qian
    Miao De-hua
    Guo Shi-jie
    2011 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION AND INDUSTRIAL APPLICATION (ICIA2011), VOL IV, 2011, : 75 - 78
  • [45] DISCRIMINATIVE TRAINING BASED ON MAHALANOBIS DISTANCE FOR PATHOLOGIC VOICE DETECTION
    Sarria-Paja, M.
    Castellanos-Dominguez, G.
    DYNA-COLOMBIA, 2010, 77 (164): : 220 - 228
  • [46] The Application of Mahalanobis Distance Based on the Ridge Estimation in Data with Multicollinearity
    Tao, Jian-Bo
    Cheng, Long-Sheng
    INTERNATIONAL CONFERENCE ON MECHANICS AND CONTROL ENGINEERING (MCE 2015), 2015, : 344 - 349
  • [47] Distance-based outlier detection for high dimension, low sample size data
    Ahn, Jeongyoun
    Lee, Myung Hee
    Lee, Jung Ae
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (01) : 13 - 29
  • [48] Distance Ratio-based Weighted Rank Outlier Detection on Wearable Health Data
    Wang, Kang
    Thou, Zhiping
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 583 - 588
  • [49] An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data
    Hoang Vu Nguyen
    Gopalkrishnan, Vivekanand
    Assent, Ira
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 138 - +
  • [50] GPU Strategies for Distance-Based Outlier Detection
    Angiulli, Fabrizio
    Basta, Stefano
    Lodi, Stefano
    Sartori, Claudio
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (11) : 3256 - 3268