A Framework for Outlier Detection in Evolving Data Streams by Weighting Attributes in Clustering

被引:9
|
作者
Yogita [1 ]
Toshniwal, Durga [1 ]
机构
[1] IIT Roorkee, Dept Elect & Comp Engn, Roorkee 247667, Uttar Pradesh, India
关键词
Data Streams; Outlier Detection; Concept Evolution; Irrelevant Attribute;
D O I
10.1016/j.protcy.2012.10.026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Outlier detection in streaming data is a very challenging problem. This is because of the fact that data streams cannot be scanned multiple times. Also new concepts may keep evolving. Irrelevant attributes can be termed as noisy attributes and such attributes further magnify the challenge of working with data streams. In this paper, we propose a clustering based framework for outlier detection in evolving data streams that assigns weights to attributes depending upon their respective relevance. Weighted attributes are helpful to reduce or remove the effect of noisy attributes in mining tasks. Keeping in view the challenges of data stream mining, the proposed framework is incremental and adaptive to concept evolution. Experimental results on synthetic and real world data sets show that our proposed approach outperforms other existing approaches in terms of outlier detection rate, false alarm rate, running time and with increasing percentages of outliers. (C) 2012 The Authors. Published by Elsevier Ltd. Selection and/or peer-review under responsibility of the Department of Computer Science & Engineering, National Institute of Technology Rourkela
引用
收藏
页码:214 / 222
页数:9
相关论文
共 50 条
  • [21] Trajectory Outlier Detection on Trajectory Data Streams
    Cao, Keyan
    Liu, Yefan
    Meng, Gongjie
    Liu, Haoli
    Miao, Anchen
    Xu, Jingke
    [J]. IEEE Access, 2020, 8 : 34187 - 34196
  • [22] Incremental local outlier detection for data streams
    Pokrajac, Dragojub
    Lazarevic, Aleksandar
    Latecki, Longin Jan
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 504 - 515
  • [23] Continuous Outlier Detection on Uncertain Data Streams
    Shaikh, Salman Ahmed
    Kitagawa, Hiroyuki
    [J]. 2014 IEEE NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (IEEE ISSNIP 2014), 2014,
  • [24] Outlier detection over data streams: Survey
    Brahmi, Zaki
    Souiden, Imen
    [J]. International Journal of Business Intelligence and Data Mining, 2021, 19 (04) : 481 - 507
  • [25] Trajectory Outlier Detection on Trajectory Data Streams
    Cao, Keyan
    Liu, Yefan
    Meng, Gongjie
    Liu, Haoli
    Miao, Anchen
    Xu, Jingke
    [J]. IEEE ACCESS, 2020, 8 : 34187 - 34196
  • [26] An Adaptive Framework for Clustering Data Streams
    Chandrika
    Kumar, K. R. Ananda
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT I, 2011, 190 : 704 - +
  • [27] SCLOPE: An algorithm for clustering data streams of categorical attributes
    Ong, KL
    Li, WY
    Ng, WK
    Lim, EP
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 209 - 218
  • [28] Online Sparse Representation Clustering for Evolving Data Streams
    Chen, Jie
    Yang, Shengxiang
    Fahy, Conor
    Wang, Zhu
    Guo, Yinan
    Chen, Yingke
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [29] Robust Clustering for Tracking Noisy Evolving Data Streams
    Nasraoui, Olfa
    Rojas, Carlos
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 619 - 623
  • [30] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 79 - 93