Mining weighted frequent sequences in uncertain databases

被引:50
|
作者
Rahman, Md Mahmudur [1 ]
Ahmed, Chowdhury Farhan [1 ,2 ]
Leung, Carson Kai-Sang [3 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Univ Strasbourg, ICube Lab, Strasbourg, France
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Data mining; Pattern mining; Sequence mining; Uncertain database; Weighted sequence; SEQUENTIAL PATTERNS; INTERESTING PATTERNS; ALGORITHM; MAINTENANCE; ITEMSETS; SETS;
D O I
10.1016/j.ins.2018.11.026
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent pattern mining has become very useful and interesting to researchers due to its high applicability. Different real-life databases (e.g., sensor network, medical diagnosis data) are uncertain in their nature. Many algorithms have been developed to mine the frequent uncertain patterns based on expected support values. Nonetheless, those are circumscribed to find the frequent patterns by using some filtering constraints. Moreover, it is challenging to find the actual interesting patterns as different patterns carry different importance. In this work, a new framework is proposed to mine sequences in uncertain databases satisfying both weight and support constraints. Subsequently, an efficient algorithm (uWSequence) is developed to discover the uncertain weighted sequences. In addition, the pruning measures iMaxPr, and expSupport(top) play a vital role to make uWSequence remarkably time-efficient, by filtering out the unfavorable patterns in early stages. The applicability of this proposed framework is shown to solve various problems (e.g., weather forecasting, sensor-based event findings). To our knowledge, ours is the first work on weighted sequences in uncertain databases. Extensive performance analysis confirms the efficiency of the proposed algorithm as well as the superiority over the existing algorithms. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:76 / 100
页数:25
相关论文
共 50 条
  • [1] Weighted frequent itemset mining over uncertain databases
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Vincent S. Tseng
    [J]. Applied Intelligence, 2016, 44 : 232 - 250
  • [2] Efficient Mining of Weighted Frequent Itemsets in Uncertain Databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION (MLDM 2016), 2016, 9729 : 236 - 250
  • [3] Weighted frequent itemset mining over uncertain databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. APPLIED INTELLIGENCE, 2016, 44 (01) : 232 - 250
  • [4] Efficient weighted probabilistic frequent itemset mining in uncertain databases
    Li, Zhiyang
    Chen, Fengjuan
    Wu, Junfeng
    Liu, Zhaobin
    Liu, Weijiang
    [J]. EXPERT SYSTEMS, 2021, 38 (05)
  • [5] Mining Weighted Frequent Itemsets without Candidate Generation in Uncertain Databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Chao, Han-Chieh
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2017, 16 (06) : 1549 - 1579
  • [6] Weighted Frequent Subgraph Mining in Weighted Graph Databases
    Shinoda, Masaki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 58 - +
  • [7] Mining Frequent Itemsets in Correlated Uncertain Databases
    Yong-Xin Tong
    Lei Chen
    Jieying She
    [J]. Journal of Computer Science and Technology, 2015, 30 : 696 - 712
  • [8] Mining Frequent Itemsets over Uncertain Databases
    Tong, Yongxin
    Chen, Lei
    Cheng, Yurong
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11): : 1650 - 1661
  • [9] Probabilistic Frequent Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 119 - 127
  • [10] Mining Frequent Itemsets in Correlated Uncertain Databases
    Tong, Yong-Xin
    Chen, Lei
    She, Jieying
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 696 - 712