Comparative Study of Clustering-Based Outliers Detection Methods in Circular- Circular Regression Model

被引:2
|
作者
Satari, Siti Zanariah [1 ]
Di, Nur Faraidah Muhammad [1 ]
Zubairi, Yong Zulina [2 ]
Hussin, Abdul Ghapor [3 ]
机构
[1] Univ Malaysia Pahang, Coll Comp & Appl Sci, Ctr Math Sci, Kuantan 26300, Pahang Darul Ma, Malaysia
[2] Univ Malaya, Ctr Fdn Studies Sci, Kuala Lumpur 50603, Federal Territo, Malaysia
[3] Natl Def Univ Malaysia, Fac Def Sci & Technol, Sungai Besi Camp, Kuala Lumpur 57000, Federal Territo, Malaysia
来源
SAINS MALAYSIANA | 2021年 / 50卷 / 06期
关键词
Circular distance; circular-circular regression model; clustering; outliers; stopping rule; FUNCTIONAL-RELATIONSHIP MODEL;
D O I
10.17576/jsm-2021-5006-24
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper is a comparative study of several algorithms for detecting multiple outliers in circular-circular regression model based on the clustering algorithms. Three measures of similarity based on the circular distance were used to obtain a cluster tree using the agglomerative hierarchical methods. A stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height was used as the cutoff point and classifier to the cluster group that exceeded the stopping rule as potential outliers. The performances of the algorithms have been demonstrated using the simulation studies that consider several outlier scenarios with a certain degree of contamination. Application to real data using wind data and a simulated data set are given for illustrative purposes. Thus, it has been found that Satari's algorithm (S-SL algorithm) performs well for any values of sample size n and error concentration parameter. The algorithms are good in identifying outliers which are not limited to one or few outliers only, but the presence of multiple outliers at one time.
引用
收藏
页码:1787 / 1798
页数:12
相关论文
共 50 条
  • [31] Comparative analysis of learning methods of fuzzy clustering-based neural network pattern classifier
    Kim E.-H.
    Oh S.-K.
    Kim H.-K.
    Oh, Sung-Kwun (ohsk@suwon.ac.kr), 1600, Korean Institute of Electrical Engineers (65): : 1541 - 1550
  • [32] Outlier detection in circular regression model using minimum spanning tree method
    Di, Nur Faraidah Muhammad
    Satari, Siti Zanariah
    Zakaria, Roslinazairimah
    2ND INTERNATIONAL CONFERENCE ON APPLIED & INDUSTRIAL MATHEMATICS AND STATISTICS, 2019, 1366
  • [33] A Bayesian regression model for circular data based on the projected normal distribution
    Nunez-Antonio, Gabriel
    Gutierrez-Pena, Eduardo
    Escarela, Gabriel
    STATISTICAL MODELLING, 2011, 11 (03) : 185 - 201
  • [34] Prediction of the Rock Mass Diggability Index by Using Fuzzy Clustering-Based, ANN and Multiple Regression Methods
    Omid Saeidi
    Seyed Rahman Torabi
    Mohammad Ataei
    Rock Mechanics and Rock Engineering, 2014, 47 : 717 - 732
  • [35] Prediction of the Rock Mass Diggability Index by Using Fuzzy Clustering-Based, ANN and Multiple Regression Methods
    Saeidi, Omid
    Torabi, Seyed Rahman
    Ataei, Mohammad
    ROCK MECHANICS AND ROCK ENGINEERING, 2014, 47 (02) : 717 - 732
  • [36] A NOVEL CENTROID UPDATE APPROACH FOR CLUSTERING-BASED SUPERPIXEL METHODS AND SUPERPIXEL-BASED EDGE DETECTION
    Zhang, Houwang
    Wu, Chong
    Zhang, Le
    Zheng, Hanying
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 693 - 697
  • [37] Detection of Influential Observations in Spatial Regression Model Based on Outliers and Bad Leverage Classification
    Baba, Ali Mohammed
    Midi, Habshah
    Adam, Mohd Bakri
    Rahman, Nur Haizum Abd
    SYMMETRY-BASEL, 2021, 13 (11):
  • [38] Comparing Metaheuristic Search Techniques in Addressing the Effectiveness of Clustering-Based DDoS Attack Detection Methods
    Zeinalpour, Alireza
    McElroy, Charles P.
    ELECTRONICS, 2024, 13 (05)
  • [39] A Clustering-Based Hybrid Support Vector Regression Model to Predict Container Volume at Seaport Sanitary Facilities
    Jesus Ruiz-Aguilar, Juan
    Antonio Moscoso-Lopez, Jose
    Urda, Daniel
    Gonzalez-Enrique, Javier
    Turias, Ignacio
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 17
  • [40] On semantic clustering and adaptive robust regression based energy-aware communication with true outliers detection in WSN
    Chowdhury, Srijit
    Roy, Ambarish
    Benslimane, Abderrahim
    Giri, Chandan
    AD HOC NETWORKS, 2019, 94