Unsupervised Outlier Detection in Streaming Data Using Weighted Clustering

被引:0
|
作者
Thakran, Yogita [1 ]
Toshniwal, Durga [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Comp Engn, Roorkee, Uttar Pradesh, India
关键词
Concept Evolution; Irrelevant Attributes; Streaming Data; Unsupervised Outlier Detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection is a very important task in many fields like network intrusion detection, credit card fraud detection, stock market analysis, detecting outlying cases in medical data etc. Outlier detection in streaming data is very challenging because streaming data cannot be scanned multiple times and also new concepts may keep evolving in coming data over time. Irrelevant attributes can be termed as noisy attributes and such attributes further magnify the challenge of working with data streams. In this paper, we propose an unsupervised outlier detection scheme for streaming data. This scheme is based on clustering as clustering is an unsupervised data mining task and it does not require labeled data. In proposed scheme both density based and partitioning clustering method are combined to take advantage of both density based and distance based outlier detection. Proposed scheme also assigns weights to attributes depending upon their respective relevance in mining task and weights are adaptive in nature. Weighted attributes are helpful to reduce or remove the effect of noisy attributes. Keeping in view the challenges of streaming data, the proposed scheme is incremental and adaptive to concept evolution. Experimental results on synthetic and real world data sets show that our proposed approach outperforms other existing approach (CORM) in terms of outlier detection rate, false alarm rate, and increasing percentages of outliers.
引用
收藏
页码:947 / 952
页数:6
相关论文
共 50 条
  • [1] Unsupervised clustering in streaming data
    Tasoulis, Dimitris K.
    Adams, Niall M.
    Hand, David J.
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 638 - +
  • [2] Unsupervised Feature Selection for Outlier Detection on Streaming Data to Enhance Network Security
    Heigl, Michael
    Weigelt, Enrico
    Fiala, Dalibor
    Schramm, Martin
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [3] Unsupervised outlier detection in multidimensional data
    Atiq ur Rehman
    Samir Brahim Belhaouari
    [J]. Journal of Big Data, 8
  • [4] Unsupervised outlier detection in multidimensional data
    Ur Rehman, Atiq
    Belhaouari, Samir Brahim
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [5] Outlier Detection for Categorial Data Using Clustering Algorithms
    Nowak-Brzezinska, Agnieszka
    Lazarz, Weronika
    [J]. COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 714 - 727
  • [6] An Efficient Outlier Detection Approach for Streaming Sensor Data Based on Neighbor Difference and Clustering
    Cai, Saihua
    Chen, Jinfu
    Yin, Baoquan
    Sun, Ruizhi
    Zhang, Chi
    Chen, Haibo
    Chen, Jingyi
    Lin, Min
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [7] Outlier Detection in Streaming Data A research Perspective
    Chugh, Neeraj
    Chugh, Mitali
    Agarwal, Alok
    [J]. 2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 429 - 432
  • [8] Outlier Detection in Data Streams Using Various Clustering Approaches
    Makkar, Kusum
    Sharma, Meghna
    [J]. 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 690 - 693
  • [9] Outlier Detection in Spatial Databases Using Clustering Data Mining
    Karmaker, Amitava
    Rahman, Syed M.
    [J]. PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1657 - +
  • [10] Unsupervised clustering of mammograms for outlier detection and breast density estimation
    Tlusty, Tal
    Amit, Guy
    Ben-Ari, Rami
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3808 - 3813