A Novel Density-Based Clustering Approach for Outlier Detection in High-Dimensional Data

被引:2
|
作者
Messaoud, Thouraya Aouled [1 ]
Smiti, Abir [2 ]
Louati, Aymen [1 ]
机构
[1] Univ Jendouba, Inst Super Informat Kef, Jendouba, Tunisia
[2] Inst Super Gest Tunis, LARODEC, Tunis, Tunisia
关键词
Outliers; Feature selection; Clustering; DBSCAN;
D O I
10.1007/978-3-030-29859-3_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection is a primary aspect in data-mining and machine learning applications, also known as outlier mining. The importance of outlier detection in medical data came from the fact that outliers may carry some precious information however outlier detection can show very bad performance in the presence of high dimensional data. In this paper, a new outlier detection technique is proposed based on a feature selection strategy to avoid the curse of dimensionality, named Infinite Feature Selection DBSCAN. The main purpose of our proposed method is to reduce the dimensions of a high dimensional data set in order to efficiently identify outliers using clustering techniques. Simulations on real databases proved the effectiveness of our method taking into account the accuracy, the error-rate, F-score and the retrieval time of the algorithm.
引用
收藏
页码:322 / 331
页数:10
相关论文
共 50 条
  • [21] Enhancing density-based clustering: Parameter reduction and outlier detection
    Cassisi, Carmelo
    Ferro, Alfredo
    Giugno, Rosalba
    Pigola, Giuseppe
    Pulvirenti, Alfredo
    [J]. INFORMATION SYSTEMS, 2013, 38 (03) : 317 - 330
  • [22] Research on Outlier Detection for High-Dimensional Data Based on PPCLOF
    Chen, Chen
    Luo, Kaiwen
    Min, Lan
    Li, Shenglin
    [J]. JOURNAL OF WEB ENGINEERING, 2021, 20 (03): : 743 - 758
  • [23] Thresholding-based outlier detection for high-dimensional data
    Yang, Xiaona
    Wang, Zhaojun
    Zi, Xuemin
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (11) : 2170 - 2184
  • [24] Efficient Outlier Detection for High-Dimensional Data
    Liu, Huawen
    Li, Xuelong
    Li, Jiuyong
    Zhang, Shichao
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (12): : 2451 - 2461
  • [25] On eigenfunction approach to data mining: outlier detection in high-dimensional data sets
    Nagar, AK
    Muyeba, MK
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 251 - 256
  • [26] DWOF: A Robust Density-Based Outlier Detection Approach
    Momtaz, Rana
    Mohssen, Nesma
    Gowayyed, Mohammad A.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2013, 2013, 7887 : 517 - 525
  • [27] Density-Based Local Outlier Detection on Uncertain Data
    Cao, Keyan
    Shi, Lingxu
    Wang, Guoren
    Han, Donghong
    Bai, Mei
    [J]. WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 67 - 71
  • [28] Hierarchical density-based clustering in high-dimensional spaces using topographic maps
    Gautama, T
    Van Hulle, MM
    [J]. NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 251 - 260
  • [29] Using Projection-Based Clustering to Find Distance- and Density-Based Clusters in High-Dimensional Data
    Michael C. Thrun
    Alfred Ultsch
    [J]. Journal of Classification, 2021, 38 : 280 - 312
  • [30] Using Projection-Based Clustering to Find Distance- and Density-Based Clusters in High-Dimensional Data
    Thrun, Michael C.
    Ultsch, Alfred
    [J]. JOURNAL OF CLASSIFICATION, 2021, 38 (02) : 280 - 312