Finding hierarchical frequent items in data streams

被引:0
|
作者
Feng, Wenfeng [1 ]
Guo, Qiao [1 ]
Zhang, Zhibin [2 ]
机构
[1] Beijing Inst Technol, Network Informat Ctr, Beijing 100081, Peoples R China
[2] Henan Univ Technol, Dept Comp Sci & Engn, Zhengzhou 454000, Peoples R China
关键词
data stream; summary data structure; hierarchical frequent items; approximate algorithms;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A Hierarchical Sketch was implemented to summarize the hierarchical structure in stream data. The sketch used a XOR-based pair-wise independent family of hash functions on the hierarchical domain to map stream data items to a three dimensional array of counters of sin L xDx W. Of the counter array, L was the layers in hierarchy, D was the number of uniformly and randomly chosen hash functions, and W was the range of hash functions. Based on the sketch, an algorithm that identified and evaluated the hierarchical frequent items over data streams approximately was implemented. This algorithm has sub-linear time and space costs and is almost exact in statistic meaning.
引用
收藏
页码:5972 / +
页数:2
相关论文
共 4 条
  • [1] CARTER JL, 1979, J COMPUT SYST SCI, V18, P143, DOI 10.1016/0022-0000(79)90044-8
  • [2] An improved data stream summary: the count-min sketch and its applications
    Cormode, G
    Muthukrishnan, S
    [J]. JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 55 (01): : 58 - 75
  • [3] Cormode G., 2003, ACM Transactions on Database Systems (TODS), P296, DOI DOI 10.1145/1061318.1061325
  • [4] Manku GS., 2002, P 28 INT C VER LARG, P346, DOI 10.1016/B978-155860869-6/50038-X