A Scalable Complex Event Analytical System with Incremental Episode Mining over Data Streams

被引:0
|
作者
Tseng, Jerry C. C. [1 ]
Gu, Jia-Yuan [1 ]
Tseng, Vincent S. [2 ]
Wang, P. F. [3 ]
Chen, Ching-Yu [3 ]
Li, Chu-Feng [3 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[3] Inst Informat Ind, Taipei, Taiwan
关键词
Data Stream; Incremental Mining; Episode Pattern Mining; Lambda Architecture; FREQUENT EPISODES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Episode pattern mining is a very powerful technique to get high-valued information for people to solve real-life cross-disciplinary problems, such as for the analysis of manufacturing, stock markets, weather records and so on. As data grows, the mining process must be re-triggered again and again to obtain the most updated information. However, periodically re-mining the full dataset is not cost-effective, and thus a number of incremental mining approaches arise for the growing data. However, to our best knowledge, there exist few studies targeted on the problem of incremental episode mining. Moreover, streaming data of complex events is more and more popular because digital sensors always collect data around us in this big data age. Now the challenge is not only mining valuable episode patterns of incremental dataset, but also mining episode patterns over data streams of complex events. To address this research problem, we adopt the Lambda Architecture to design a scalable complex event analytical system that could be used to facilitate the incremental episode mining process over complex event sequences of data streams. Apache Spark and Apache Spark Streaming are applied as the development framework of the batch layer and the speed layer, respectively. To take both the efficiency and accuracy into consideration, we develop a series of modules and three algorithms, namely, batch episode mining, delta episode mining and pattern merging. Results from the experimental validation on a real dataset show that the proposed system carries high scalability and delivers excellent performance in terms of efficiency and accuracy.
引用
收藏
页码:648 / 655
页数:8
相关论文
共 50 条
  • [21] Process Mining over Unordered Event Streams
    Awad, Ahmed
    Weidlich, Matthias
    Sakr, Sherif
    2020 2ND INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2020), 2020, : 81 - 88
  • [22] Active Complex Event Processing over Event Streams
    Wang, Di
    Rundensteiner, Elke A.
    Ellison, Richard T., III
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 634 - 645
  • [23] Incremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
    Yang, Shih-Yang
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    JOURNAL OF COMPUTERS, 2011, 6 (03) : 449 - 457
  • [24] Detection of complex event over RFID data streams with multi-levels
    Peng, Shanglian
    Li, Zhanhuai
    Li, Qiang
    Chen, Qun
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (07): : 54 - 58
  • [25] Complex Event Processing over Multi-Granularity RFID Data Streams
    Peng, Shanglian
    Li, Zhanhuai
    Chen, Lin
    Nie, Yanming
    Chen, Qun
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 3, 2009, : 235 - +
  • [26] Complex event processing over distributed probabilistic event streams
    Wang, Y. H.
    Cao, K.
    Zhang, X. M.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2013, 66 (10) : 1808 - 1821
  • [27] Complex Event Processing over Distributed Uncertain Event Streams
    Zhang, XinLong
    Wang, Yongheng
    Zhang, XiaoMing
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS), 2014, 109 : 357 - 361
  • [28] A Novel Complex-Events Analytical System Using Episode Pattern Mining Techniques
    Tseng, Jerry C. C.
    Gu, Jia-Yuan
    Wang, P. F.
    Chen, Ching-Yu
    Tseng, Vincent S.
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 487 - 498
  • [29] Incremental mining of closed sequential patterns in multiple data streams
    Yang S.-Y.
    Chao C.-M.
    Chen P.-Z.
    Sun C.-H.
    Journal of Networks, 2011, 6 (05) : 728 - 735
  • [30] Dynamic incremental SVM learning algorithm for mining, data streams
    Li, Zhong-Wei
    Yang, Jrng
    Zhang, Jian-Pei
    PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 35 - +