Clustering Heterogeneous Data Streams with Uncertainty over Sliding Window

被引:0
|
作者
Hentech, Houda [1 ]
Gouider, Mohammed Salah [1 ]
Farhat, Amine [1 ]
机构
[1] Univ Tunis, Inst Super Gest Tunis, BESTMOD, Cite Bouchoucha 2000, Le Bardo, Tunisia
来源
关键词
Data streams; uncertainty; clustering; similarity measure; sliding window model;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing methods for clustering uncertain data streams over sliding windows do not treat the categorical attributes. However, uncertain mixed data are ubiquitous. This paper investigates the problem of clustering heterogeneous data streams pervaded by uncertainty over sliding windows, so-called SWHU-Clustering. A Heterogeneous Uncertain Temporal Cluster Feature (HUTCF) is introduced to monitor the distribution statistics of mixed data points. Based on this structure, Exponential Histogram of Heterogeneous Uncertain Cluster Feature (EHHUCF) is presented as a collection of HUTCF. This structure may help to handle the in-cluster evolution, and detects the temporal change of the cluster distribution. Our approach has several advantages over existing method: 1) the higher execution efficiency benefits from its good design as it avoids the effects of old data on the final results. 2) We incorporated the k-NN into the clustering process in order to reduce the complexity of the algorithm. 3) Memory consumption can be managed efficiently by limiting the number of HUTCF in each EHHUCF. Simulations on real databases show the feasibility of SWHU-Clustering as well as its effectiveness by comparing it with UMicro algorithm.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 50 条
  • [21] Mining the frequent patterns in an arbitrary sliding window over online data streams
    Li, Guo-Hui
    Chen, Hui
    [J]. Ruan Jian Xue Bao/Journal of Software, 2008, 19 (10): : 2585 - 2596
  • [22] Sliding window-based frequent pattern mining over data streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. INFORMATION SCIENCES, 2009, 179 (22) : 3843 - 3865
  • [23] High utility pattern mining over data streams with sliding window technique
    Ryang, Heungmo
    Yun, Unil
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 57 : 214 - 231
  • [24] Processing sliding window join aggregate in continuous queries over data streams
    Wang, WP
    Li, JZ
    Zhang, DD
    Guo, LJ
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2004, 3255 : 348 - 363
  • [25] Maintaining sliding window skylines on data streams
    Tao, YF
    Papadias, D
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (03) : 377 - 391
  • [26] Privacy protection on sliding window of data streams
    Wang, Weiping
    Li, Jianzhong
    Ai, Chunyu
    Li, Yingshu
    [J]. 2007 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, 2008, : 213 - +
  • [27] Window specification over data streams
    Patroumpas, Kostas
    Sellis, Timos
    [J]. CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 4254 : 445 - 464
  • [28] Sliding window based weighted maximal frequent pattern mining over data streams
    Lee, Gangin
    Yun, Unil
    Ryu, Keun Ho
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (02) : 694 - 708
  • [29] No pane, no gain: Efficient evaluation of sliding-window aggregates over data streams
    Li, J
    Maier, D
    Tufte, K
    Papadimos, V
    Tucker, PA
    [J]. SIGMOD RECORD, 2005, 34 (01) : 39 - 44
  • [30] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131