SPPC: a new tree structure for mining erasable patterns in data streams

被引:18
|
作者
Le, Tuong [1 ]
Vo, Bay [2 ,3 ]
Fournier-Viger, Philippe [4 ]
Lee, Mi Young [1 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Seoul, South Korea
[2] Ton Duc Thang Univ, Div Data Sci, Ho Chi Minh City, Vietnam
[3] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[4] Harbin Inst Technol Shenzhen, Sch Nat Sci & Humanities, Shenzhen, GD, Peoples R China
关键词
Data mining; Data streams; Erasable patterns; Sliding window; WINDOW BASED ALGORITHM; SLIDING WINDOW; FREQUENT ITEMSETS; EFFICIENT ALGORITHMS; ASSOCIATION RULES; UTILITY PATTERNS;
D O I
10.1007/s10489-018-1280-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering Erasable Patterns (EPs) consists of identifying product parts that will produce a small profit loss if their production is stopped. It is a data mining problem that has attracted the attention of numerous researchers in recent years due to the possibility of using EPs to reduce profit loss of manufacturers. Though, many algorithms have been designed to mine EPs, an important limitation of state-of-the-art EP mining algorithms is that they are batch algorithms, that is, they are designed to be applied on static databases. But in real-life applications, databases are dynamic, as they are constantly updated by adding or removing products and parts. To be informed about EPs in real-time, traditional EP mining algorithms must be applied over and over again on a database. This is inefficient as those algorithms are always applied from scratch without taking advantage of results generated by previous executions. Considering this important drawback of previous work for handling real-life dynamic data, this paper proposes an efficient algorithm named MSPPC for mining EPs in data streams. It relies on a novel tree structure named SPPC (Streaming Pre-Post Code) tree, which extends the WPPC tree structure for maintaining a compact tree representation of EPs in a data stream. Experimental results show that the designed MSPPC algorithm outperforms the state-of-the-art batch MERIT and dMERIT algorithms when they are run in batch mode using a sliding-window. Besides, the proposed algorithm is also faster than the state-of-the-art algorithms for mining EPs, namely MERIT, dMERIT + , MEI and EIFDD.
引用
收藏
页码:478 / 495
页数:18
相关论文
共 50 条
  • [1] SPPC: a new tree structure for mining erasable patterns in data streams
    Tuong Le
    Bay Vo
    Philippe Fournier-Viger
    Mi Young Lee
    Sung Wook Baik
    [J]. Applied Intelligence, 2019, 49 : 478 - 495
  • [2] Erasable pattern mining based on tree structures with damped window over data streams
    Baek, Yoonji
    Yun, Unil
    Kim, Heonho
    Nam, Hyoju
    Lee, Gangin
    Yoon, Eunchul
    Vo, Bay
    Lin, Jerry Chun-Wei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 94
  • [3] DSTree: A tree structure for the mining of frequent sets from data streams
    Leung, Carson Kai-Sang
    Khan, Quamrul I.
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 928 - +
  • [4] Mining Regular Patterns in Data Streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 399 - 413
  • [5] The CART decision tree for mining data streams
    Rutkowski, Leszek
    Jaworski, Maciej
    Pietruczuk, Lena
    Duda, Piotr
    [J]. INFORMATION SCIENCES, 2014, 266 : 1 - 15
  • [6] Hyper-structure mining of frequent patterns in uncertain data streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, Jaehwan John
    Tu, Yi-cheng
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 219 - 244
  • [7] Hyper-structure mining of frequent patterns in uncertain data streams
    Chandima HewaNadungodage
    Yuni Xia
    Jaehwan John Lee
    Yi-cheng Tu
    [J]. Knowledge and Information Systems, 2013, 37 : 219 - 244
  • [8] Prefix tree-based algorithm for mining frequent patterns over data streams
    Yang, Junrui
    Huang, Wei
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2010, 38 (07): : 107 - 110
  • [9] Mining Patterns From Data Streams: An Overview
    Borah, Anindita
    BhabeshNath
    [J]. 2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 371 - 376
  • [10] Mining evolving data streams for frequent patterns
    Laur, Pierre-Alain
    Nock, Richard
    Symphor, Jean-Emile
    Poncelet, Pascal
    [J]. PATTERN RECOGNITION, 2007, 40 (02) : 492 - 503