Weighted frequent sequential pattern mining

被引:13
|
作者
Islam, Md Ashraful [1 ]
Rafi, Mahfuzur Rahman [2 ]
Azad, Al-amin [2 ]
Ovi, Jesan Ahammed [2 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] East West Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Data mining; Pattern mining; Sequence mining; Weighted sequence; PREFIXSPAN;
D O I
10.1007/s10489-021-02290-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trillions of bytes of data are generated every day in different forms, and extracting useful information from that massive amount of data is the study of data mining. Sequential pattern mining is a major branch of data mining that deals with mining frequent sequential patterns from sequence databases. Due to items having different importance in real-life scenarios, they cannot be treated uniformly. With today's datasets, the use of weights in sequential pattern mining is much more feasible. In most cases, as in real-life datasets, pushing weights will give a better understanding of the dataset, as it will also measure the importance of an item inside a pattern rather than treating all the items equally. Many techniques have been introduced to mine weighted sequential patterns, but typically these algorithms generate a massive number of candidate patterns and take a long time to execute. This work aims to introduce a new pruning technique and a complete framework that takes much less time and generates a small number of candidate sequences without compromising with completeness. Performance evaluation on real-life datasets shows that our proposed approach can mine weighted patterns substantially faster than other existing approaches.
引用
下载
收藏
页码:254 / 281
页数:28
相关论文
共 50 条
  • [21] WLPMiner: Weighted frequent pattern mining with length-decreasing support constraints
    Yun, U
    Leggett, JJ
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 555 - 567
  • [22] Time-weighted counting for recently frequent pattern mining in data streams
    Kang, Yongsub U.
    Kang, U.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (02) : 391 - 422
  • [23] PrivTS: Differentially private frequent time-constrained sequential pattern mining
    Li, Yanhui
    Wang, Guoren
    Yuan, Ye
    Cao, Xin
    Yuan, Long
    Lin, Xuemin
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, 10828 LNCS : 92 - 111
  • [24] PrivTS: Differentially Private Frequent Time-Constrained Sequential Pattern Mining
    Li, Yanhui
    Wang, Guoren
    Yuan, Ye
    Cao, Xin
    Yuan, Long
    Lin, Xuemin
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 92 - 111
  • [25] Sequential Behavior Pattern Discovery with Frequent Episode Mining and Wireless Sensor Network
    Li, Li
    Li, Xin
    Lu, Zhihan
    Lloret, Jaime
    Song, Houbing
    IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (06) : 205 - 211
  • [26] A Generalized Weighted Closed Sequential Pattern Mining Algorithm with Item Interval
    Lu, Haitao
    Li, Shuo
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1599 - 1604
  • [27] Reframing in Frequent Pattern Mining
    Ahmed, Chowdhury Farhan
    Samiullah, Md.
    Lachiche, Nicolas
    Kull, Meelis
    Flach, Peter
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 799 - 806
  • [28] Weighted Frequent Subgraph Mining in Weighted Graph Databases
    Shinoda, Masaki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 58 - +
  • [29] Mining Frequent Weighted Closed Itemsets
    Bay Vo
    Nhu-Y Tran
    Duong-Ha Ngo
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 379 - 390
  • [30] Methods for mining frequent sequential patterns
    Jiang, LH
    Hamilton, HJ
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 486 - 491