FHM plus : Faster High-Utility Itemset Mining Using Length Upper-Bound Reduction

被引:23
|
作者
Fournier-Viger, Philippe [1 ]
Lin, Jerry Chun-Wei [2 ]
Duong, Quang-Huy [3 ]
Dam, Thu-Lan [3 ,4 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Sch Nat Sci & Humanities, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Hunan, Peoples R China
[4] Hanoi Univ Ind, Fac Informat Technol, Hanoi, Vietnam
关键词
Pattern mining; High-utility itemsets; Length constraints;
D O I
10.1007/978-3-319-42007-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-utility itemset (HUI) mining is a popular data mining task, consisting of enumerating all groups of items that yield a high profit in a customer transaction database. However, an important issue with traditional HUI mining algorithms is that they tend to find itemsets having many items. But those itemsets are often rare, and thus may be less interesting than smaller itemsets for users. In this paper, we address this issue by presenting a novel algorithm named FHM+ for mining HUIs, while considering length constraints. To discover HUIs efficiently with length constraints, FHM+ introduces the concept of Length Upper-Bound Reduction (LUR), and two novel upper-bounds on the utility of itemsets. An extensive experimental evaluation shows that length constraints are effective at reducing the number of patterns, and the novel upper-bounds can greatly decrease the execution time, and memory usage for HUI mining.
引用
收藏
页码:115 / 127
页数:13
相关论文
共 50 条
  • [1] High-utility and diverse itemset mining
    Amit Verma
    Siddharth Dawar
    Raman Kumar
    Shamkant Navathe
    Vikram Goyal
    Applied Intelligence, 2021, 51 : 4649 - 4663
  • [2] High-utility and diverse itemset mining
    Verma, Amit
    Dawar, Siddharth
    Kumar, Raman
    Navathe, Shamkant
    Goyal, Vikram
    APPLIED INTELLIGENCE, 2021, 51 (07) : 4649 - 4663
  • [3] High-Utility Itemset Mining in Big Dataset
    Wu, Jimmy Ming-Tai
    Lin, Jerry Chun-Wei
    Chen, Chien-Ming
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [4] A survey of incremental high-utility itemset mining
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    Hong, Tzung-Pei
    Fujita, Hamido
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
  • [5] Whitebox Induction of Default Rules Using High-Utility Itemset Mining
    Shakerin, Farhad
    Gupta, Gopal
    PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES (PADL 2020), 2020, 12007 : 168 - 176
  • [6] Ignoring Internal Utilities in High-Utility Itemset Mining
    Oguz, Damla
    SYMMETRY-BASEL, 2022, 14 (11):
  • [7] High-Utility Itemset Mining with Effective Pruning Strategies
    Wu, Jimmy Ming-Tai
    Lin, Jerry Chun-Wei
    Tamrakar, Ashish
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (06)
  • [8] Improved Genetic Algorithm for High-Utility Itemset Mining
    Zhang, Qiang
    Fang, Wei
    Sun, Jun
    Wang, Quan
    IEEE ACCESS, 2019, 7 : 176799 - 176813
  • [9] SMIM Framework to Generalize High-Utility Itemset Mining
    Dawar, Siddharth
    Goyal, Vikram
    Bera, Debajyoti
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT II, 2022, 13088 : 3 - 15
  • [10] A residual utility-based concept for high-utility itemset mining
    Sra, Pushp
    Chand, Satish
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023,