A More Accurate Space Saving Algorithm for Finding the Frequent Items

被引:0
|
作者
Zhou Jun [1 ]
Chen Ming [1 ]
Xiong Huan [2 ]
机构
[1] PLAUST, Inst Command Automat, Dept Comp Sci, Nanjing, Peoples R China
[2] China Elect Syst Engn Res Inst, Dept Comp Network, Beijing, Peoples R China
关键词
component; data stream; frequent items; LRU; NetFlow; anomaly detection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The frequent items problem is to process a stream as a stream of items and find all items occurring more than a given fraction of the time. It is one of the most heavily studied problems in data stream mining, dating back to the 1980s. Aiming at higher false positive rate of the Space-Saving algorithm, an LRU-based (Least Recently Used, LRU) improved algorithm with low frequency item pre-eliminated is proposed. Accuracy, stability and adaptability of the improved algorithm have been apparently enhanced. Experimental results indicate that the algorithm can not only be used to find the frequent items, and can be used to estimate the frequency of them precisely. The improved algorithm can be used for online processing both high-speed network packet stream and backbone NetFlow stream.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [31] A Randomized Algorithm for Finding Frequent Elements in Streams Using O(log log N) Space
    Ogata, Masatora
    Yamauchi, Yukiko
    Kijima, Shuji
    Yamashita, Masafumi
    ALGORITHMS AND COMPUTATION, 2011, 7074 : 514 - 523
  • [32] Memory efficient algorithm for mining recent frequent items in a stream
    Kolaczkowski, Piotr
    Rough Sets and Intelligent Systems Paradigms, Proceedings, 2007, 4585 : 485 - 494
  • [33] Finding frequent items in sliding windows with multinomially-distributed item frequencies
    Golab, L
    DeHaan, D
    López-Ortiz, A
    Demaine, ED
    16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 425 - 426
  • [34] Finding frequent items in sliding windows over data streams using EBF
    Wang, ShuYun
    Xu, HeXiang
    Hu, YunFa
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 682 - +
  • [35] AN IMPROVED ANT COLONY ALGORITHM FOR EFFECTIVE MINING OF FREQUENT ITEMS
    Sundaramoorthy, Suriya
    Shantharajah, S. P.
    JOURNAL OF WEB ENGINEERING, 2014, 13 (3-4): : 263 - 276
  • [36] A simple algorithm for finding frequent elements in streams and bags
    Karp, RM
    Shenker, S
    Papadimitriou, CH
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2003, 28 (01): : 51 - 55
  • [37] An algorithm for finding frequent patterns in a large sparse graph
    Zhou, Xiao Feng
    Gao, Lin
    Dong, An guo
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 290 - 294
  • [38] An Algorithm for Finding Frequent Patterns in Social Media Stream
    Ha, Suwook
    Lee, Yong Mi
    Nam, Kwang Woo
    Ryu, Kenn Ho
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 618 - 623
  • [39] A Fast Algorithm For Finding Frequent Episodes In Event Streams
    Laxman, Srivatsan
    Sastry, P. S.
    Unnikrishnan, K. P.
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 410 - +
  • [40] Commentary on “Is vitamin D deficiency a more frequent finding in hirsutism or not?”
    Anna Vittoria Mattioli
    Archives of Dermatological Research, 317 (1)