A Clustering-based Framework for Classifying Data Streams

被引:0
|
作者
Yan, Xuyang [1 ]
Homaifar, Abdollah [1 ]
Sarkar, Mrinmoy [1 ]
Girma, Abenezer [1 ]
Tunstel, Edward [2 ]
机构
[1] North Carolina A&T State Univ, Greensboro, NC 27401 USA
[2] Raytheon Technol Res Ctr, E Hartford, CT 06108 USA
基金
美国国家科学基金会;
关键词
CLASSIFICATION; CLASSIFIERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The non-stationary nature of data streams strongly challenges traditional machine learning techniques. Although some solutions have been proposed to extend traditional machine learning techniques for handling data streams, these approaches either require an initial label set or rely on specialized design parameters. The overlap among classes and the labeling of data streams constitute other major challenges for classifying data streams. In this paper, we proposed a clustering-based data stream classification framework to handle non-stationary data streams without utilizing an initial label set. A density-based stream clustering procedure is used to capture novel concepts with a dynamic threshold and an effective active label querying strategy is introduced to continuously learn the new concepts from the data streams. The sub-cluster structure of each cluster is explored to handle the overlap among classes. Experimental results and quantitative comparison studies reveal that the proposed method provides statistically better or comparable performance than the existing methods.
引用
收藏
页码:3257 / 3263
页数:7
相关论文
共 50 条
  • [1] A semi-supervised clustering-based classification model for classifying imbalanced data streams in the presence of scarcely labelled data
    Bhowmick, Kiran
    Narvekar, Meera
    [J]. International Journal of Business Intelligence and Data Mining, 2022, 20 (02) : 170 - 191
  • [2] Fuzzy Clustering-Based Adaptive Regression for Drifting Data Streams
    Song, Yiliao
    Lu, Jie
    Lu, Haiyan
    Zhang, Guangquan
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (03) : 544 - 557
  • [3] Fast clustering-based anonymization approaches with time constraints for data streams
    Guo, Kun
    Zhang, Qishan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 46 : 95 - 108
  • [4] An Adaptive Framework for Clustering Data Streams
    Chandrika
    Kumar, K. R. Ananda
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT I, 2011, 190 : 704 - +
  • [5] NetCluster: A clustering-based framework to passive measurements data analyze internet
    Baralis, Elena
    Bianco, Andrea
    Cerquitelli, Tania
    Chiaraviglio, Luca
    Mellia, Marco
    [J]. COMPUTER NETWORKS, 2013, 57 (17) : 3300 - 3315
  • [6] Understanding time use via data mining: A clustering-based framework
    Rosales-Salas, Jorge
    Maldonado, Sebastian
    Seret, Alex
    [J]. INTELLIGENT DATA ANALYSIS, 2018, 22 (03) : 597 - 616
  • [7] A Clustering-based Framework for Fast Training of Classifiers
    Sathyamoorthy, Sruthi
    Sivasankar, E.
    [J]. 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2020,
  • [8] Apache Flink and clustering-based framework for fast anonymization of IoT stream data
    Sadeghi-Nasab, Alireza
    Ghaffarian, Hossein
    Rahmani, Mohsen
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [9] A k-means clustering-based security framework for mobile data mining
    Guizani, Sghaier
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2016, 16 (18): : 3449 - 3454
  • [10] AEDS-IoT: Adaptive clustering-based Event Detection Scheme for IoT data streams
    Raut, Ashwin
    Shivhare, Anubhav
    Chaurasiya, Vijay Kumar
    Kumar, Manish
    [J]. INTERNET OF THINGS, 2023, 22