A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering

被引:9
|
作者
Yogita [1 ]
Toshniwal, Durga [1 ]
机构
[1] IIT Roorkee, Dept Elect & Comp Engn, Roorkee 247667, Uttar Pradesh, India
关键词
Data Streams; Outlier Detection; Concept Evolution; Irrelevant Attribute;
D O I
10.1016/j.protcy.2012.10.026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Outlier detection in streaming data is a very challenging problem. This is because of the fact that data streams cannot be scanned multiple times. Also new concepts may keep evolving. Irrelevant attributes can be termed as noisy attributes and such attributes further magnify the challenge of working with data streams. In this paper, we propose a clustering based framework for outlier detection in evolving data streams that assigns weights to attributes depending upon their respective relevance. Weighted attributes are helpful to reduce or remove the effect of noisy attributes in mining tasks. Keeping in view the challenges of data stream mining, the proposed framework is incremental and adaptive to concept evolution. Experimental results on synthetic and real world data sets show that our proposed approach outperforms other existing approaches in terms of outlier detection rate, false alarm rate, running time and with increasing percentages of outliers. (C) 2012 The Authors. Published by Elsevier Ltd. Selection and/or peer-review under responsibility of the Department of Computer Science & Engineering, National Institute of Technology Rourkela
引用
收藏
页码:214 / 222
页数:9
相关论文
共 50 条
  • [41] Analysis and Evaluation of Outlier Detection Algorithms in Data Streams
    Shukla, Madhu
    Kosta, Y. P.
    Chauhan, Prashant
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONTROL (IC4), 2015,
  • [42] Continuous adaptive outlier detection on distributed data streams
    Su, Liang
    Han, Weihong
    Yang, Shuqiang
    Zou, Peng
    Jia, Yan
    [J]. HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2007, 4782 : 74 - 85
  • [43] A Review of Local Outlier Factor Algorithms for Outlier Detection in Big Data Streams
    Alghushairy, Omar
    Alsini, Raed
    Soule, Terence
    Ma, Xiaogang
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (01) : 1 - 24
  • [44] XSTREAM: Outlier Dete'x'ion in Feature-Evolving Data STREAMS
    Manzoor, Emaad
    Lamba, Hemank
    Akoglu, Leman
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1963 - 1972
  • [45] Feature Drift Detection in Evolving Data Streams
    Zhao, Di
    Koh, Yun Sing
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 335 - 349
  • [46] Detection and classification of changes in evolving data streams
    Gaber, Mohamed Medhat
    Yu, Philip S.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2006, 5 (04) : 659 - 670
  • [47] Weighted Clustering and Evolutionary Analysis of Hybrid Attributes Data Streams
    Chen Xinquan
    [J]. JOURNAL OF COMPUTERS, 2008, 3 (12) : 60 - 67
  • [48] An intuitive framework for understanding changes in evolving data streams
    Aggarwal, CC
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 261 - 261
  • [49] A framework for on-demand classification of evolving data streams
    Aggarwal, CC
    Han, JW
    Wang, JY
    Yu, PS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (05) : 577 - 589
  • [50] Time-sensitive clustering evolving textual data streams
    Ammar, Mohamed
    Hidri, Adel
    Sassi Hidri, Minyar
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 63 (1-2) : 25 - 40