Clustering Heterogeneous Data Streams with Uncertainty over Sliding Window

被引:0
|
作者
Hentech, Houda [1 ]
Gouider, Mohammed Salah [1 ]
Farhat, Amine [1 ]
机构
[1] Univ Tunis, Inst Super Gest Tunis, BESTMOD, Cite Bouchoucha 2000, Le Bardo, Tunisia
来源
关键词
Data streams; uncertainty; clustering; similarity measure; sliding window model;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Existing methods for clustering uncertain data streams over sliding windows do not treat the categorical attributes. However, uncertain mixed data are ubiquitous. This paper investigates the problem of clustering heterogeneous data streams pervaded by uncertainty over sliding windows, so-called SWHU-Clustering. A Heterogeneous Uncertain Temporal Cluster Feature (HUTCF) is introduced to monitor the distribution statistics of mixed data points. Based on this structure, Exponential Histogram of Heterogeneous Uncertain Cluster Feature (EHHUCF) is presented as a collection of HUTCF. This structure may help to handle the in-cluster evolution, and detects the temporal change of the cluster distribution. Our approach has several advantages over existing method: 1) the higher execution efficiency benefits from its good design as it avoids the effects of old data on the final results. 2) We incorporated the k-NN into the clustering process in order to reduce the complexity of the algorithm. 3) Memory consumption can be managed efficiently by limiting the number of HUTCF in each EHHUCF. Simulations on real databases show the feasibility of SWHU-Clustering as well as its effectiveness by comparing it with UMicro algorithm.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 50 条
  • [1] HCLUWIN: AN ALGORITHM FOR CLUSTERING HETEROGENEOUS DATA STREAMS OVER SLIDING WINDOWS
    Ren, Jiadong
    Hu, Changzhen
    Ma, Ruiqing
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (05): : 2171 - 2179
  • [2] Density and sliding window-based clustering over evolving data streams
    Yu, Yanwei
    Zhao, Jindong
    Zhang, Yonggang
    Wen, Changci
    [J]. ICIC Express Letters, Part B: Applications, 2015, 6 (08): : 2275 - 2283
  • [3] Clustering Data Streams over Sliding Windows by DCA
    Ta Minh Thuy
    Le Thi Hoai An
    Boudjeloud-Assala, Lydia
    [J]. ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 65 - 75
  • [4] Extending Sliding-Window Semantics over Data Streams
    Chen, Leisong
    Lin, Guoping
    [J]. ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 110 - +
  • [5] On concurrency control in sliding window queries over data streams
    Golab, Lukasz
    Bijay, Kumar Gaurav
    Ozsu, M. Tamer
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 608 - 626
  • [6] Incremental and Adaptive Clustering Stream Data over Sliding Window
    Dang, Xuan Hong
    Lee, Vincent C. S.
    Ng, Wee Keong
    Ong, Kok Leong
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 660 - +
  • [7] Semantics and Implementation of Continuous Sliding Window Queries over Data Streams
    Kraemer, Juergen
    Seeger, Bernhard
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (01):
  • [8] Mining frequent patterns in an arbitrary sliding window over data streams
    Li, Guohui
    Chen, Hui
    Yang, Bing
    Chen, Gang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 496 - 503
  • [9] Simultaneous sliding window join approach over multiple data streams
    Qian, Jiangbo
    Xu, Hongbing
    Wang, Yongli
    Liu, Xuejun
    Dong, Yisheng
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2005, 42 (10): : 1771 - 1778
  • [10] Mining maximal frequent itemsets in a sliding window over data streams
    Mao, Yimin
    Li, Hong
    Yang, Luming
    Liu, Lixin
    [J]. Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11): : 1142 - 1148