FHM plus : Faster High-Utility Itemset Mining Using Length Upper-Bound Reduction

被引:23
|
作者
Fournier-Viger, Philippe [1 ]
Lin, Jerry Chun-Wei [2 ]
Duong, Quang-Huy [3 ]
Dam, Thu-Lan [3 ,4 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Sch Nat Sci & Humanities, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[3] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Hunan, Peoples R China
[4] Hanoi Univ Ind, Fac Informat Technol, Hanoi, Vietnam
关键词
Pattern mining; High-utility itemsets; Length constraints;
D O I
10.1007/978-3-319-42007-3_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-utility itemset (HUI) mining is a popular data mining task, consisting of enumerating all groups of items that yield a high profit in a customer transaction database. However, an important issue with traditional HUI mining algorithms is that they tend to find itemsets having many items. But those itemsets are often rare, and thus may be less interesting than smaller itemsets for users. In this paper, we address this issue by presenting a novel algorithm named FHM+ for mining HUIs, while considering length constraints. To discover HUIs efficiently with length constraints, FHM+ introduces the concept of Length Upper-Bound Reduction (LUR), and two novel upper-bounds on the utility of itemsets. An extensive experimental evaluation shows that length constraints are effective at reducing the number of patterns, and the novel upper-bounds can greatly decrease the execution time, and memory usage for HUI mining.
引用
收藏
页码:115 / 127
页数:13
相关论文
共 50 条
  • [31] EFIM: a fast and memory efficient algorithm for high-utility itemset mining
    Souleymane Zida
    Philippe Fournier-Viger
    Jerry Chun-Wei Lin
    Cheng-Wei Wu
    Vincent S. Tseng
    Knowledge and Information Systems, 2017, 51 : 595 - 625
  • [32] Efficient evolutionary computation model of closed high-utility itemset mining
    Jerry Chun-Wei Lin
    Youcef Djenouri
    Gautam Srivastava
    Philippe Fourier-Viger
    Applied Intelligence, 2022, 52 : 10604 - 10616
  • [33] Efficient High-Utility Itemset Mining Over Variety of Databases: A Survey
    Suvarna, U.
    Srinivas, Y.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 803 - 816
  • [34] Efficient high-utility occupancy itemset mining algorithm on massive data
    He, Jingxuan
    Han, Xixian
    Wang, Jinbao
    Zhang, Kaiqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [35] Efficient High-utility Itemset Mining Based on a Novel Data Structure
    Shen, Wei
    Zhang, Chao
    Fang, Wei
    Zhang, Xin
    Than, Zhi-Hui
    Lin, Jerry Chun-Wei
    2021 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2021,
  • [36] Cross-Level High-Utility Itemset Mining Using Multi-core Processing
    Tung, N. T.
    Nguyen, Loan T. T.
    Nguyen, Trinh D. D.
    Kozierkiewicz, Adrianna
    COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 12876 : 467 - 479
  • [37] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Li, Kenli
    Fournier-Viger, Philippe
    Quang-Huy Duong
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (02) : 357 - 381
  • [38] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Liu, Xuan
    Wen, Shiting
    Zuo, Wanli
    APPLIED INTELLIGENCE, 2020, 50 (01) : 169 - 191
  • [39] CLS-Miner: efficient and effective closed high-utility itemset mining
    Thu-Lan Dam
    Kenli Li
    Philippe Fournier-Viger
    Quang-Huy Duong
    Frontiers of Computer Science, 2019, 13 : 357 - 381
  • [40] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Xuan Liu
    Shiting Wen
    Wanli Zuo
    Applied Intelligence, 2020, 50 : 169 - 191