Novel Clustering-Based Approach for Local Outlier Detection

被引:0
|
作者
Du, Haizhou [1 ]
Zhao, Shengjie [1 ]
Zhang, Daqiang [2 ]
Wu, Jinsong [3 ]
机构
[1] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai, Peoples R China
[2] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
[3] Univ Chile, Dept Elect Engn, Santiago, Chile
关键词
Outlier detection; Big data; Data mining; Clustering-based; ALGORITHMS; SELECTION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid expansion of data scale, big data mining and analysis have attracted increasing attention. Outlier detection as an important task of data mining is widely used in many applications. However, conventional outlier detection methods have difficulty handling large-scale datasets. In addition, most of them typically can only identify global outliers and are over sensitive to parameters variation. In this paper, we propose a novel method for robust local outlier detection with statistical parameters, which incorporates the clustering-based ideas in dealing with big data. Firstly, this method finds some density peaks of dataset by 3 sigma standard. Secondly, each remaining data object in the dataset is assigned to the same cluster as its nearest neighbor of higher density. Finally, we use Chebyshev's inequality and density peak reachability to identify local outliers of each group. The experimental results demonstrate the efficiency and accuracy of the proposed method in identifying both global and local outliers. Moreover, the method is also proved to be more stability analysis than typical outlier detection methods, such as LOF(Local Outlier Factor) and DBSCAN(Density-Based Spatial Clustering of Applications with Noise).
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Fuzzy Clustering-Based Approach for Outlier Detection
    Al-Zoubi, Moh'd Belal
    Ali, Al-Dahoud
    Yahya, Abdelfatah A.
    RECENT ADVANCES AND APPLICATIONS OF COMPUTER ENGINEERING: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE (ACE 10), 2010, : 192 - +
  • [2] Clustering-Based Outlier Detection Method
    Jiang, Sheng-yi
    An, Qing-bo
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 429 - 433
  • [3] Clustering-Based Trajectory Outlier Detection
    Eldawy, Eman O.
    Mokhtar, Hoda M. O.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 133 - 139
  • [4] An Efficient Outlier Detection and Classification Clustering-Based Approach for WSN
    Al Samara, Mustafa
    Bennis, Ismail
    Abouaissa, Abdelhafid
    Lorenz, Pascal
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [5] A comparative evaluation of clustering-based outlier detection
    Vinces, Braulio V. Sanchez
    Schubert, Erich
    Zimek, Arthur
    Cordeiro, Robson L. F.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2025, 39 (02)
  • [6] A clustering-based method for outlier detection under concept drift
    Tahir, Mahjabeen
    Abdullah, Azizol
    Udzir, Nur Izura
    Kasmiran, Khairul Azhar
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2024, 43 (03) : 205 - 218
  • [7] Random clustering-based outlier detector
    Kiersztyn A.
    Pylak D.
    Horodelski M.
    Kiersztyn K.
    Urbanovich P.
    Information Sciences, 2024, 667
  • [8] A novel approach to noise clustering for outlier detection
    Rehm, Frank
    Klawonn, Frank
    Kruse, Rudolf
    SOFT COMPUTING, 2007, 11 (05) : 489 - 494
  • [9] A Novel Approach for Outlier Detection and Clustering Improvement
    Ahmed, Mohiuddin
    Mahmood, Abdun Naser
    PROCEEDINGS OF THE 2013 IEEE 8TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2013, : 577 - 582
  • [10] A Novel Approach to Noise Clustering for Outlier Detection
    Frank Rehm
    Frank Klawonn
    Rudolf Kruse
    Soft Computing, 2007, 11 : 489 - 494