A survey on algorithms for mining frequent itemsets over data streams

被引:0
|
作者
James Cheng
Yiping Ke
Wilfred Ng
机构
[1] The Hong Kong University of Science and Technology,Department of Computer Science and Engineering
[2] HKUST,undefined
来源
关键词
Frequent itemsets; Stream mining; Window models; Approximate algorithms;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing prominence of data streams arising in a wide range of advanced applications such as fraud detection and trend learning has led to the study of online mining of frequent itemsets (FIs). Unlike mining static databases, mining data streams poses many new challenges. In addition to the one-scan nature, the unbounded memory requirement and the high data arrival rate of data streams, the combinatorial explosion of itemsets exacerbates the mining task. The high complexity of the FI mining problem hinders the application of the stream mining techniques. We recognize that a critical review of existing techniques is needed in order to design and develop efficient mining algorithms and data structures that are able to match the processing rate of the mining with the high arrival rate of data streams. Within a unifying set of notations and terminologies, we describe in this paper the efforts and main techniques for mining data streams and present a comprehensive survey of a number of the state-of-the-art algorithms on mining frequent itemsets over data streams. We classify the stream-mining techniques into two categories based on the window model that they adopt in order to provide insights into how and why the techniques are useful. Then, we further analyze the algorithms according to whether they are exact or approximate and, for approximate approaches, whether they are false-positive or false-negative. We also discuss various interesting issues, including the merits and limitations in existing research and substantive areas for future research.
引用
收藏
页码:1 / 27
页数:26
相关论文
共 50 条
  • [41] Frequent Pattern Mining Algorithms for Finding Associated Frequent Patterns for Data Streams: A Survey
    Nasreen, Shamila
    Azam, Muhammad Awais
    Shehzad, Khurram
    Naeem, Usman
    Ghazanfar, Mustansar Ali
    5TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS / THE 4TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE / AFFILIATED WORKSHOPS, 2014, 37 : 109 - +
  • [42] Maintaining Only Frequent Itemsets to Mine Approximate Frequent Itemsets over Online Data Streams
    Wang, Yongyan
    Li, Kun
    Wang, Hongan
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 381 - 388
  • [43] An efficient approximate approach to mining frequent itemsets over high speed transactional data streams
    Jea, Kuen-Fang
    Li, Chao-Wei
    Chang, Tsui-Ping
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 275 - 280
  • [44] Mining frequent itemsets over data streams with multiple time-sensitive sliding windows
    Jin, Long
    Chai, Duck Jin
    Lee, Yang Koo
    Ryu, Keun Ho
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 486 - +
  • [45] Mining Recent Frequent Itemsets over Data Streams with a Time-Sensitive Sliding Window
    Jin, Long
    Chai, Duck Jin
    Lee, Jun Wook
    Ryu, Keun Ho
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, PROCEEDINGS, 2007, 4537 : 62 - +
  • [46] A Differentially Private Scheme for Top-k Frequent Itemsets Mining Over Data Streams
    Liang W.-J.
    Chen H.
    Zhao S.-Y.
    Li C.-P.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (04): : 741 - 760
  • [47] Efficient mining frequent itemsets algorithms
    Marghny H. Mohamed
    Mohammed M. Darwieesh
    International Journal of Machine Learning and Cybernetics, 2014, 5 : 823 - 833
  • [48] A New Algorithm for Mining Frequent Closed Itemsets from Data Streams
    Mao, Guojun
    Yang, Xialing
    Wu, Xindong
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 154 - +
  • [49] Mining top-K frequent itemsets from data streams
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (02) : 193 - 217
  • [50] Mining recent frequent itemsets in data streams by radioactively attenuating strategy
    Jia, LF
    Wang, Z
    Zhou, CG
    Xu, XJ
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 804 - 811