Comparative Study of Clustering-Based Outliers Detection Methods in Circular- Circular Regression Model

被引:2
|
作者
Satari, Siti Zanariah [1 ]
Di, Nur Faraidah Muhammad [1 ]
Zubairi, Yong Zulina [2 ]
Hussin, Abdul Ghapor [3 ]
机构
[1] Univ Malaysia Pahang, Coll Comp & Appl Sci, Ctr Math Sci, Kuantan 26300, Pahang Darul Ma, Malaysia
[2] Univ Malaya, Ctr Fdn Studies Sci, Kuala Lumpur 50603, Federal Territo, Malaysia
[3] Natl Def Univ Malaysia, Fac Def Sci & Technol, Sungai Besi Camp, Kuala Lumpur 57000, Federal Territo, Malaysia
来源
SAINS MALAYSIANA | 2021年 / 50卷 / 06期
关键词
Circular distance; circular-circular regression model; clustering; outliers; stopping rule; FUNCTIONAL-RELATIONSHIP MODEL;
D O I
10.17576/jsm-2021-5006-24
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time.
引用
收藏
页码:1787 / 1798
页数:12
相关论文
共 50 条
  • [41] A clustering-based survival comparison procedure designed to study the Caenorhabditis elegans model
    Paul-Marie Grollemund
    Cyril Poupet
    Élise Comte
    Muriel Bonnet
    Philippe Veisseire
    Stéphanie Bornes
    Scientific Reports, 14 (1)
  • [43] Detection of different outlier scenarios in circular regression model using single-linkage method
    Di, N. F. M.
    Satari, S. Z.
    Zakaria, R.
    1ST INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS 2017 (ICOAIMS 2017), 2017, 890
  • [44] Comparative study on determination methods of resistance curves of circular joints based on single edge notched tensile specimens
    Gong B.
    Tian R.
    Liu X.
    Deng C.
    Wang D.
    Hanjie Xuebao/Transactions of the China Welding Institution, 2022, 43 (05): : 21 - 28
  • [45] Model-Based Detection and Localization of Circular Landmarks in Aerial Images
    Christian Drewniok
    Karl Rohr
    International Journal of Computer Vision, 1997, 24 : 187 - 217
  • [46] Model-based detection and localization of circular landmarks in aerial images
    Drewniok, C
    Rohr, K
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 1997, 24 (03) : 187 - 217
  • [47] Model-based clustering of multivariate skew data with circular components and missing values
    Lagona, Francesco
    Picone, Marco
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (05) : 927 - 945
  • [48] Model-based clustering for noisy longitudinal circular data, with application to animal movement
    Ranalli, M.
    Maruotti, A.
    ENVIRONMETRICS, 2020, 31 (02)
  • [49] A study on fuzzy C-means clustering-based systems in automatic spike detection
    Inan, Z. Hilal
    Kuntalp, Mehmet
    COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (08) : 1160 - 1166
  • [50] Improvement of the regression model for spindle thermal elongation by a Boosting-based outliers detection approach
    Lei, Mohan
    Jiang, Gedong
    Yang, Jun
    Mei, Xuesong
    Xia, Ping
    Shi, Hu
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 99 (5-8): : 1389 - 1403