A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering

被引:9
|
作者
Yogita [1 ]
Toshniwal, Durga [1 ]
机构
[1] IIT Roorkee, Dept Elect & Comp Engn, Roorkee 247667, Uttar Pradesh, India
关键词
Data Streams; Outlier Detection; Concept Evolution; Irrelevant Attribute;
D O I
10.1016/j.protcy.2012.10.026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Outlier detection in streaming data is a very challenging problem. This is because of the fact that data streams cannot be scanned multiple times. Also new concepts may keep evolving. Irrelevant attributes can be termed as noisy attributes and such attributes further magnify the challenge of working with data streams. In this paper, we propose a clustering based framework for outlier detection in evolving data streams that assigns weights to attributes depending upon their respective relevance. Weighted attributes are helpful to reduce or remove the effect of noisy attributes in mining tasks. Keeping in view the challenges of data stream mining, the proposed framework is incremental and adaptive to concept evolution. Experimental results on synthetic and real world data sets show that our proposed approach outperforms other existing approaches in terms of outlier detection rate, false alarm rate, running time and with increasing percentages of outliers. (C) 2012 The Authors. Published by Elsevier Ltd. Selection and/or peer-review under responsibility of the Department of Computer Science & Engineering, National Institute of Technology Rourkela
引用
收藏
页码:214 / 222
页数:9
相关论文
共 50 条
  • [1] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Dalibor Krleža
    Boris Vrdoljak
    Mario Brčić
    [J]. Machine Learning, 2021, 110 : 139 - 184
  • [2] Statistical hierarchical clustering algorithm for outlier detection in evolving data streams
    Krleza, Dalibor
    Vrdoljak, Boris
    Brcic, Mario
    [J]. MACHINE LEARNING, 2021, 110 (01) : 139 - 184
  • [3] An Effective Ensemble-based Framework for Outlier Detection in Evolving Data Streams
    Hassan, Asmaa F.
    Barakat, Sherif
    Rezk, Amira
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 315 - 329
  • [4] A Hybrid Clustering Algorithm for Outlier Detection in Data Streams
    Vijayarani, S.
    Jothi, P.
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (11): : 285 - 295
  • [5] Outlier Detection in Data Streams Using Various Clustering Approaches
    Makkar, Kusum
    Sharma, Meghna
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 690 - 693
  • [6] An Outlier Detection Algorithm for Data Streams Based on Fuzzy Clustering
    Su, Xiaoke
    Qin, Yuming
    Wan, Renxia
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 109 - 112
  • [7] An Adaptive Clustering Approach for Distributed Outlier Detection in Data Streams
    Della Monaca, Andrea
    Cafaro, Massimo
    Pulimeno, Marco
    Epicoco, Italo
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 86 - 99
  • [8] Online Clustering for Evolving Data Streams with Online Anomaly Detection
    Chenaghlou, Milad
    Moshtaghi, Masud
    Leckie, Christopher
    Salehi, Mahsa
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 506 - 519
  • [9] Dynamically Evolving Clustering for Data Streams
    Baruah, Rashmi Dutta
    Angelov, Plamen
    Baruah, Diganta
    [J]. 2014 IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2014,
  • [10] Outlier Detection on Uncertain Data Streams
    Zhu, Bin
    Zhong, Yuling
    Wang, Xite
    Bai, Mei
    [J]. Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2020, 47 (02): : 134 - 140