Discovering probabilistically weighted sequential patterns in uncertain databases

被引:2
|
作者
Islam, Md Sahidul [1 ]
Kar, Pankaj Chandra [1 ]
Samiullah, Md [1 ]
Ahmed, Chowdhury Farhan [1 ]
Leung, Carson Kai-Sang [2 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Uncertain sequential pattern; Probabilistic world; Weighted pattern; Expected support; INTERESTING PATTERNS; EFFICIENT ALGORITHMS; FREQUENT; SEQUENCES;
D O I
10.1007/s10489-022-03699-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining useful sequential patterns has been a recent trend in data mining as the real-life applications are mostly sequence oriented. Researchers have developed many algorithms to find frequent sub-sequences from sequential databases to find useful information. The emerging and tremendous development of technology has been increasing the number of applications that deal with uncertainty. Ordinary uncertain pattern mining algorithms deal with expected support or probabilistic frequentness of a pattern, ignoring the importance of individual items. However, in real-life, different items can have different importance. Some approaches consider the weight (importance) of items but fail to capture the interestingness of mined patterns. The objective of the work is to address weighted sequential uncertain pattern mining in Possible World Semantics (PWS) to better capture inherent relations among the items and events with different weights and developing a novel method uWSpan. Our proposed approach contains some pruning techniques to provide faster mining capability and introduces itemset extension for the first time in PWS. We have analyzed the performance of our proposed approach both theoretically and empirically where we found uWSpan efficient, scalable and effective. Our approach outperforms existing approaches most of the time when compared using approved datasets. We also analyzed the applicability, efficiency and effectiveness of our proposed method. Finally, the paper concludes with future research directions and a gist of the outcomes of the research.
引用
收藏
页码:6525 / 6553
页数:29
相关论文
共 50 条
  • [1] Discovering probabilistically weighted sequential patterns in uncertain databases
    Md Sahidul Islam
    Pankaj Chandra Kar
    Md Samiullah
    Chowdhury Farhan Ahmed
    Carson Kai-Sang Leung
    [J]. Applied Intelligence, 2023, 53 : 6525 - 6553
  • [2] Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
    Zhao, Zhou
    Yan, Da
    Ng, Wilfred
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1171 - 1184
  • [3] Mining weighted sequential patterns in incremental uncertain databases
    Roy, Kashob Kumar
    Moon, Md Hasibul Haque
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    [J]. INFORMATION SCIENCES, 2022, 582 : 865 - 896
  • [4] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    [J]. LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [5] Discovering time-interval sequential patterns in sequence databases
    Chen, YL
    Chiang, MC
    Ko, MT
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2003, 25 (03) : 343 - 354
  • [6] Fast algorithm to discovering sequential patterns from large databases
    Hu Huirong
    [J]. PROCEEDINGS OF THE 24TH CHINESE CONTROL CONFERENCE, VOLS 1 AND 2, 2005, : 1352 - 1355
  • [7] Mining Weighted a Closed Sequential Patterns in Large Databases
    Ren, Jia-Dong
    Yang, Jing
    Li, Yan
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 640 - 644
  • [8] Mining High-Utility Sequential Patterns in Uncertain Databases
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    Li, Yuanfa
    Hong, Tzung-Pei
    Wang, Shyue-Liang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5373 - 5380
  • [9] A new approach for discovering fuzzy quantitative sequential patterns in sequence databases
    Chen, Yen-Liang
    Huang, Tony Cheng-Kui
    [J]. FUZZY SETS AND SYSTEMS, 2006, 157 (12) : 1641 - 1661
  • [10] Discovering fuzzy time-interval sequential patterns in sequence databases
    Chen, YL
    Huang, TCK
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (05): : 959 - 972