SPPC: a new tree structure for mining erasable patterns in data streams

被引:0
|
作者
Tuong Le
Bay Vo
Philippe Fournier-Viger
Mi Young Lee
Sung Wook Baik
机构
[1] Sejong University,Digital Contents Research Institute
[2] Ton Duc Thang University,Division of Data Science
[3] Ton Duc Thang University,Faculty of Information Technology
[4] Harbin Institute of Technology (Shenzhen),School of Natural Sciences and Humanities
来源
Applied Intelligence | 2019年 / 49卷
关键词
Data mining; Data streams; Erasable patterns; Sliding window;
D O I
暂无
中图分类号
学科分类号
摘要
Discovering Erasable Patterns (EPs) consists of identifying product parts that will produce a small profit loss if their production is stopped. It is a data mining problem that has attracted the attention of numerous researchers in recent years due to the possibility of using EPs to reduce profit loss of manufacturers. Though, many algorithms have been designed to mine EPs, an important limitation of state-of-the-art EP mining algorithms is that they are batch algorithms, that is, they are designed to be applied on static databases. But in real-life applications, databases are dynamic, as they are constantly updated by adding or removing products and parts. To be informed about EPs in real-time, traditional EP mining algorithms must be applied over and over again on a database. This is inefficient as those algorithms are always applied from scratch without taking advantage of results generated by previous executions. Considering this important drawback of previous work for handling real-life dynamic data, this paper proposes an efficient algorithm named MSPPC for mining EPs in data streams. It relies on a novel tree structure named SPPC (Streaming Pre-Post Code) tree, which extends the WPPC tree structure for maintaining a compact tree representation of EPs in a data stream. Experimental results show that the designed MSPPC algorithm outperforms the state-of-the-art batch MERIT and dMERIT algorithms when they are run in batch mode using a sliding-window. Besides, the proposed algorithm is also faster than the state-of-the-art algorithms for mining EPs, namely MERIT, dMERIT + , MEI and EIFDD.
引用
收藏
页码:478 / 495
页数:17
相关论文
共 50 条
  • [11] Mining emerging patterns and classification in data streams
    Alhammady, H
    Ramamohanarao, K
    [J]. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings, 2005, : 272 - 275
  • [12] Efficiently mining erasable stream patterns for intelligent systems over uncertain data
    Baek, Yoonji
    Yun, Unil
    Lin, Jerry Chun-Wei
    Yoon, Eunchul
    Fujita, Hamido
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (11) : 1699 - 1734
  • [13] Fast algorithms for mining maximal erasable patterns
    Linh Nguyen
    Giang Nguyen
    Bac Le
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 124 : 50 - 66
  • [14] A novel approach for mining emerging patterns in data streams
    Alhammady, Hamad
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 680 - 683
  • [15] Approximately mining recently representative patterns on data streams
    Koh, Jia-Ling
    Don, Yuan-Bin
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 231 - 243
  • [16] Mining neighbor-based patterns in data streams
    Yang, Di
    Rundensteiner, Elke A.
    Ward, Matthew O.
    [J]. INFORMATION SYSTEMS, 2013, 38 (03) : 331 - 350
  • [17] Mining multidimensional sequential patterns over data streams
    Raissi, Chedy
    Plantevit, Marc
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 263 - 272
  • [18] Discovering Frequent Tree Patterns over Data Streams
    Hsieh, Mark Cheng-Enn
    Wu, Yi-Hung
    Chen, Arbee L. P.
    [J]. PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 629 - +
  • [19] Extremely Fast Decision Tree Mining for Evolving Data Streams
    Bifet, Albert
    Zhang, Jiajin
    Fan, Wei
    He, Cheng
    Zhang, Jianfeng
    Qian, Jianfeng
    Holmes, Geoff
    Pfahringer, Bernhard
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1733 - 1742
  • [20] Incremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
    Yang, Shih-Yang
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    [J]. JOURNAL OF COMPUTERS, 2011, 6 (03) : 449 - 457