Mining Significant Utility Discriminative Patterns in Quantitative Databases

被引:0
|
作者
Tang, Huijun [1 ,2 ]
Wang, Jufeng [1 ]
Wang, Le [3 ]
机构
[1] Ningbo Univ Finance & Econ, Fac Finance & Informat, Ningbo 315175, Peoples R China
[2] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315211, Peoples R China
[3] Ningbo Univ Finance & Econ, Fac Digital Technol & Engn, Ningbo 315175, Peoples R China
关键词
high utility pattern; sampling; quantitative database; COVID-19; EFFICIENT ALGORITHMS; ITEMSETS;
D O I
10.3390/math11040950
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Drawing a discriminative pattern in quantitative datasets is often represented to return a high utility pattern (HUP). The traditional methods output patterns with a utility above a pre-given threshold. Nevertheless, the current user-centered algorithm requires outputting the results in a timely manner to strengthen the interaction between the mining system and users. Pattern sampling can return results with a probability guarantee in a short time, and it could be a candidate technology to mine such discriminative patterns. In this paper, a novel approach named HUPSampler is proposed to sample one potential HUP, which is extracted with probability significance according to its utility in the database. HUPSampler introduces an interval constraint on the length of HUP and randomly extracts an integer k according to the utility proportion firstly; then, the HUPs could be obtained efficiently from a random tree by using a pattern growth way, and finally, it returns a HUP of length k randomly. The experimental study shows that HUPSampler is efficient in regard to memory usage, runtime, and utility distribution. In addition, case studies show that HUPSampler can be significantly used in analyzing the COVID-19 epidemic by identifying critical locations.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Statistically Significant Pattern Mining With Ordinal Utility
    Tran, Thien Q. Q.
    Fukuchi, Kazuto
    Akimoto, Youhei
    Sakuma, Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8770 - 8783
  • [42] Mining long high utility itemsets in transaction databases
    Yu, Guangzhu
    Shao, Shihuang
    Sun, Daoqing
    Luo, Bin
    NEW ADVANCES IN SIMULATION, MODELLING AND OPTIMIZATION (SMO '07), 2007, : 326 - +
  • [43] Mining High Utility Itemsets over Uncertain Databases
    Lan, Yuqing
    Wang, Yang
    Wang, Yanni
    Yi, Shengwei
    Yu, Dan
    2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 235 - 238
  • [44] Mining Closed High Utility Itemsets in Uncertain Databases
    Nguyen Bui
    Bay Vo
    Van-Nam Huynh
    Lin, Chun-Wei
    Nguyen, Loan T. T.
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 7 - 14
  • [45] Mining Recent High-Utility Patterns from Temporal Databases with Time-Sensitive Constraint
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 : 3 - 18
  • [46] Mining Statistically Significant Sequential Patterns
    Low-Kam, Cecile
    Raissi, Chedy
    Kaytoue, Mehdi
    Pei, Jian
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 488 - 497
  • [47] Mining Significant Sequential Contrast Patterns
    Conklin, Darrell
    MATHEMATICS AND COMPUTATION IN MUSIC, MCM 2024, 2024, 14639 : 387 - 392
  • [48] Mining sequential patterns from probabilistic databases
    Muhammad Muzammal
    Rajeev Raman
    Knowledge and Information Systems, 2015, 44 : 325 - 358
  • [49] Mining Sequential Patterns from Probabilistic Databases
    Muzammal, Muhammad
    Raman, Rajeev
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 210 - 221
  • [50] Parallel mining of frequent patterns in transactional databases
    Fakhrahmad, S. M.
    Fard, G. H. Dastghaibi
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 605 - +