A Parallel Frequent Item Counting Algorithm

被引:2
|
作者
Yang, Xun [1 ,2 ]
Liu, Jun [1 ,2 ]
Zhou, Wenli [1 ,2 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst Architecture & Conve, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Ctr Data Sci, Beijing, Peoples R China
[3] HAOHAN Data Technol Co Ltd, Beijing, Peoples R China
来源
2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2 | 2016年
关键词
frequent items; parallel algorithms; stream processing;
D O I
10.1109/IHMSC.2016.123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent items in high-speed streaming data are important to many applications like network monitoring and anomaly detecting. To deal with high arrival rate of streaming data, it is desirable that such systems be capable of supporting high processing throughput with tight guarantees on errors. In this paper, we address the problem of finding frequent and top-k items, and present a parallel version of the Space Saving algorithm in the context of the open source distributed computing system. Based on the theoretical analysis, the errors are restrictively bounded in our algorithm, and our parallel design could achieve high throughput. Taking advantage of the distributed computing resources, our evaluation reveals that such design delivers linear speedup with remarkable scalability.
引用
收藏
页码:225 / 230
页数:6
相关论文
共 50 条
  • [21] Frequent Item Set Mining Algorithm Based on Bit Combination
    Lu, Jun
    Zhao, Renpeng
    Zhou, Kailong
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 72 - 76
  • [22] Algorithm of Frequent Item Sets Mining Based on Index Table
    Zhang Lin
    Yao Nanzhen
    Zhang Jianli
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1076 - +
  • [23] An Improved Association Rules Algorithm based on Frequent Item Sets
    Jiang, Yaqiong
    Wang, Jun
    CEIS 2011, 2011, 15
  • [24] Design and Implementation of Improved Algorithm for Frequent Item Sets Mining
    Zhang Lin
    Zhang Jianli
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1696 - 1698
  • [25] A Generalized Parallel Algorithm for Frequent Itemset Mining
    Craus, Mitica
    Archip, Alexandru
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 520 - +
  • [26] Algorithm for mining frequent itemsets with item constraint based on partition
    Chen, Hui-Ping
    Zhu, Feng
    Wang, Jian-Dong
    Zhou, Xiao-Qin
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2006, 28 (07): : 1082 - 1086
  • [27] A fast parallel algorithm for frequent itemsets mining
    Souliou, Dora
    Pagourtzis, Aris
    Tsanakas, Panayiotis
    ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 213 - +
  • [28] A parallel Apriori algorithm for frequent itemsets mining
    Ye, Yanbin
    Chiang, Chia-Chu
    FOURTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, PROCEEDINGS, 2006, : 87 - +
  • [29] A Parallel Algorithm for Mining Maximal Frequent Subgraphs
    El Radie, Eihab
    Salem, Saeed
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1965 - 1971
  • [30] A Fast Parallel Algorithm for Discovering Frequent Patterns
    Lin, Kawuu W.
    Luo, Yu-Chin
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 398 - 403