An evolving approach to data streams clustering based on typicality and eccentricity data analytics

被引:37
|
作者
Bezerra, Clauber Gomes [1 ]
Jales Costa, Bruno Sielly [2 ]
Guedes, Luiz Affonso [3 ]
Angelov, Plamen Parvanov [4 ]
机构
[1] Fed Inst Educ Sci & Technol Rio Grande Norte do N, Campus Natal Zona Leste, BR-59015000 Natal, RN, Brazil
[2] Fed Inst Educ Sci & Technol Rio Grande Norte do N, Campus Natal Zona Norte Rua Brusque 2926, BR-59112490 Natal, RN, Brazil
[3] Fed Inst Educ Sci & Technol Rio Grande Norte do N, Dept Comp Engn & Automat, DCA Campus Univ, BR-59078900 Natal, RN, Brazil
[4] Univ Lancaster, Sch Comp & Commun, Data Sci Grp, Lancaster LA1 4WA, England
关键词
Online clustering; Data stream; Eccentricity; Typicality; Anomaly detection; FAULT-DETECTION; FUZZY; IDENTIFICATION; CLASSIFICATION;
D O I
10.1016/j.ins.2019.12.022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose an algorithm for online clustering of data streams. This algorithm is called AutoCloud and is based on the recently introduced concept of Typicality and Eccentricity Data Analytics, mainly used for anomaly detection tasks. AutoCloud is an evolving, online and recursive technique that does not need training or prior knowledge about the data set. Thus, AutoCloud is fully online, requiring no offline processing. It allows creation and merging of clusters autonomously as new data observations become available. The clusters created by AutoCloud are called data clouds, which are structures without pre-defined shape or boundaries. AutoCloud allows each data sample to belong to multiple data clouds simultaneously using fuzzy concepts. AutoCloud is also able to handle concept drift and concept evolution, which are problems that are inherent in data streams in general. Since the algorithm is recursive and online, it is suitable for applications that require a real-time response. We validate our proposal with applications to multiple well known data sets in the literature. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:13 / 28
页数:16
相关论文
共 50 条
  • [41] Clustering data streams
    Guha, S
    Mishra, N
    Motwani, R
    O'Callaghan, L
    [J]. 41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2000, : 359 - 366
  • [42] An effective density-based clustering and dynamic maintenance framework for evolving medical data streams
    Al-Shammari, Ahmed
    Zhou, Rui
    Naseriparsaa, Mehdi
    Liu, Chengfei
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 126 : 176 - 186
  • [43] Dynamic Analytics for Spatial Data with an Incremental Clustering Approach
    Mendes, Fernando
    Santos, Maribel Yasmina
    Moura-Pires, Joao
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 552 - 559
  • [44] Clustering The Driving Features Based On Data Streams
    Kalsoom, Rizwana
    Halim, Zahid
    [J]. 2013 16TH INTERNATIONAL MULTI TOPIC CONFERENCE (INMIC), 2013, : 89 - 94
  • [45] A Method For Evolving Data Streams
    Wankhade, Kapil
    Hasan, Tasneem
    Thool, Ravindra
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1619 - 1622
  • [46] A clustering approach for sampling data streams in sensor networks
    da Silva, Alzennyr
    Chiky, Raja
    Hebrail, Georges
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 1 - 23
  • [47] A clustering approach for sampling data streams in sensor networks
    Alzennyr da Silva
    Raja Chiky
    Georges Hébrail
    [J]. Knowledge and Information Systems, 2012, 32 : 1 - 23
  • [48] Incremental Clustering Approach for Evolving Trajectory Data Stream
    Shein, Thi Thi
    Puntheeranurak, Sutheera
    [J]. 2018 6TH INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2018,
  • [49] AGILE: A general approach to detect transitions in evolving data streams
    Yang, J
    Wang, W
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 559 - 562
  • [50] Typicality Distribution Function - A New Density-based Data Analytics Tool
    Angelov, Plamen
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,