A fundamental approach to discover closed periodic-frequent patterns in very large temporal databases

被引:0
|
作者
Pamalla, Veena [1 ]
Rage, Uday Kiran [2 ,3 ]
Penugonda, Ravikumar [2 ,4 ]
Palla, Likhitha [2 ]
Hayamizu, Yuto [5 ]
Goda, Kazuo [5 ]
Toyoda, Masashi [5 ]
Zettsu, Koji [3 ]
Sourabh, Shrivastava [2 ]
机构
[1] JNTU, Dept Comp Sci, Sri Balaji Pg Coll, Ananthapuram 515001, Andhra Pradesh, India
[2] Univ Aizu, Fukushima, Japan
[3] NICT, Tokyo, Japan
[4] RGUKT AP, IIIT Idupulapaya, Vempalli, India
[5] Univ Tokyo, Tokyo, Japan
关键词
Frequent pattern mining; Periodic-frequent pattern mining; Interesting patterns; Periodic-frequent patterns; And closed periodic-frequent patterns; EFFICIENT ALGORITHM; TRANSACTIONAL DATABASES; MAFIA;
D O I
10.1007/s10489-023-04811-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Periodic frequent-pattern mining (PFPM) is a vital knowledge discovery technique that identifies periodically occurring patterns in a temporal database. Although traditional PFPM algorithms have many applications, they often produce a large set of periodic-frequent patterns (PFPs) in a database. As a result, analyzing PFPs can be very time-consuming for users. Moreover, a large set of PFPs makes PFPM algorithms less efficient regarding runtime and memory consumption. This paper handles this problem by proposing a novel model of closed periodic-frequent patterns (CPFPs) found in databases. CPFPs are less expensive to mine because they represent a concise and lossless subset uniquely describing the entire set of PFPs. We also present an efficient depth-first search algorithm, called Closed Periodic-Frequent Pattern-Miner (CPFP-Miner), to discover the patterns. The proposed algorithm utilizes the weighted ordering of the patterns concept to reduce the patterns' search space. On the other hand, the current periodicity concept is also applied to prune aperiodic patterns from the search space. Extensive experiments on both real-world and synthetic databases demonstrate that the CPFP-Miner algorithm is efficient. It outperforms the state-of-the-art algorithms regarding runtime requirements, memory consumption, and energy consumption on several real-world and synthetic databases. Additionally, the scalability of the CPFP-Miner algorithm is demonstrated to be more effective and productive than the state-of-the-art algorithms. Finally, we present two case studies to show the functionality of the proposed patterns.
引用
收藏
页码:27344 / 27373
页数:30
相关论文
共 50 条
  • [41] Mining frequent trajectory patterns in spatial-temporal databases
    Lee, Anthony J. T.
    Chen, Yi-An
    Ip, Weng-Chong
    [J]. INFORMATION SCIENCES, 2009, 179 (13) : 2218 - 2231
  • [42] Mining Productive-Associated Periodic-Frequent Patterns in Body Sensor Data for Smart Home Care
    Ismail, Walaa N.
    Hassan, Mohammad Mehedi
    [J]. SENSORS, 2017, 17 (05)
  • [43] New parallel algorithms for frequent itemset mining in very large databases
    Veloso, A
    Meira, W
    Parthasarathy, S
    [J]. 15TH SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, PROCEEDINGS, 2003, : 158 - 166
  • [44] Mining Weighted a Closed Sequential Patterns in Large Databases
    Ren, Jia-Dong
    Yang, Jing
    Li, Yan
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 640 - 644
  • [45] Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
    Zhao, Zhou
    Yan, Da
    Ng, Wilfred
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1171 - 1184
  • [46] An efficient approach for mining sequential patterns using multiple threads on very large databases
    Bao Huynh
    Cuong Trinh
    Huy Huynh
    Thien-Trang Van
    Bay Vo
    Snasel, Vaclav
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 74 : 242 - 251
  • [47] Discovering Periodic Patterns in Non-uniform Temporal Databases
    Kiran, R. Uday
    Venkatesh, J. N.
    Fournier-Viger, Philippe
    Toyoda, Masashi
    Reddy, P. Krishna
    Kitsuregawa, Masaru
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 604 - 617
  • [48] A new approach to generate frequent patterns from enterprise databases
    Liu, YC
    Hsu, PY
    [J]. PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 371 - 380
  • [49] Efficient Discovery of Weighted Frequent Neighborhood Itemsets in Very Large Spatiotemporal Databases
    Kiran, R. Uday
    Reddy, P. P. C.
    Zettsu, Koji
    Toyoda, Masashi
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    [J]. IEEE ACCESS, 2020, 8 : 27584 - 27596
  • [50] Discovering during-temporal patterns (DTPs) in large temporal databases
    Zhang, Li
    Chen, Guoqing
    Brijs, Tom
    Zhang, Xing
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) : 1178 - 1189