Frequent Items Mining on Data Stream Using Hash-Table and Heap

被引:3
|
作者
Shan, Zhang [1 ]
Ling, Chen [1 ]
Li, Tu [2 ]
机构
[1] Yang Zhou Univ, Dept Comp Sci, Yangzhou, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Inst Informat Sci & Technol, Nanjing, Jiangsu, Peoples R China
关键词
data mining; data stream; frequent Items; time fading model; hash table; heap; ALGORITHM;
D O I
10.1109/ICICISYS.2009.5357918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the existing algorithms for mining frequent Items on data stream do not emphasis the importance of the recent data items We present an algorithm to detect the items with frequency counts exceeding a user-specified threshold Our algorithm uses a hash table L and a heap to record the potential frequent Items, and can detect c-approximate frequent data items on data stream using O(ILI+ epsilon(1)) memory space and the processing time for each data item is O(log epsilon(1)) Experimental results on several artificial and real datasets show our algorithm has higher precision, requires less memory and consumes less computation time than other similar methods
引用
收藏
页码:141 / +
页数:2
相关论文
共 50 条
  • [21] Improved data mining algorithms for frequent patterns with composite items
    Wang, Ke
    Liu, James N. K.
    Ma, Wei-Min
    [J]. CHALLENGES IN INFORMATION TECHNOLOGY MANAGEMENT, 2008, : 10 - +
  • [22] Frequent pattern mining on stream data using Hadoop CanTree-GTree
    Kusumakumari, Vanteru
    Sherigar, Deepthi
    Chandran, Roshni
    Patil, Nagamma
    [J]. 7TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2017), 2017, 115 : 266 - 273
  • [23] An Algorithm for Mining Frequent Closed Itemsets in Data Stream
    Dai, Caiyan
    Chen, Ling
    [J]. 2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL I, 2010, : 281 - 284
  • [24] Mining frequent subtree on paging XML data stream
    Department of Compute Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
    不详
    不详
    不详
    [J]. Jisuanji Yanjiu yu Fazhan, 9 (1926-1936):
  • [25] An Efficient Algorithm for Mining Frequent Patterns in Data Stream
    Zhang Guang-lu
    Lei Jing-sheng
    [J]. INTERNATIONAL CONFERENCE OF CHINA COMMUNICATION (ICCC2010), 2010, : 160 - +
  • [26] An Efficient Frequent Pattern Mining Algorithm for Data Stream
    Liu Hualei
    Lin Shukuan
    Qiao Jianzhong
    Yu Ge
    Lu Kaifu
    [J]. INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 757 - 761
  • [27] An Algorithm for Mining Frequent Closed Itemsets in Data Stream
    Dai, Caiyan
    Chen, Ling
    [J]. INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT C, 2012, 24 : 1722 - 1728
  • [28] FlashGhost: Data Sanitization with Privacy Protection Based on Frequent Colliding Hash Table
    Zhu, Yan
    Yang, Shuai
    Chu, William Cheng-Chung
    Feng, Rongquan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2019), 2019, : 90 - 99
  • [29] Approximating Frequent Items in Asynchronous Data Stream over a Sliding Window
    Ting, Hing-Fung
    Lee, Lap-Kei
    Chan, Ho-Leung
    Lam, Tak-Wah
    [J]. ALGORITHMS, 2011, 4 (03) : 200 - 222
  • [30] Approximating Frequent Items in Asynchronous Data Stream over a Sliding Window
    Chan, Ho-Leung
    Lam, Tak-Wah
    Lee, Lap-Kei
    Ting, Hing-Fung
    [J]. APPROXIMATION AND ONLINE ALGORITHMS, 2010, 5893 : 49 - 61