Efficient mining of high utility pattern with considering of rarity and length

被引:22
|
作者
Kim, Donggyu [1 ]
Yun, Unil [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Data mining; Utility pattern mining; Multiple minimum support; FREQUENT PATTERNS; ASSOCIATION RULES; ITEMSETS; ALGORITHM;
D O I
10.1007/s10489-015-0750-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques for mining rare patterns have been researched in the association rule mining area because traditional frequent pattern mining methods have to generate a large amount of unnecessary patterns in order to find rare patterns from large databases. One such technique, the multiple minimum support threshold framework was devised to extract rare patterns by using a different minimum item support threshold for each item in a database. Nevertheless, this framework cannot sufficiently reflect environments of the real world. The reason is that it does not consider weights of items, such as market prices of products and fatality rates of diseases, in its mining process. Therefore, an algorithm has been proposed to mine rare patterns with utilities exceeding a user-specified minimum utility by considering rarity and utility information of items. However, since this algorithm employs the concept of traditional high utility pattern mining, patterns' lengths are not considered for determining utilities of the patterns. If the length of a pattern is sufficiently long, the pattern is more likely to have enough utility to become a high utility pattern regardless of item utilities within the pattern. Therefore, the algorithm cannot guarantee that all items in amined pattern have high utilities. In this paper, we propose a novel algorithm that effectively reduces such dependency of patterns on their lengths by considering their lengths in the mining process in order to mine more meaningful rare patterns compared to patterns mined by previous algorithms. Experimental results demonstrate that our algorithm extracts a lesser number of more meaningful patterns and consumes less computational resources compared to state-of-the-art algorithms.
引用
收藏
页码:152 / 173
页数:22
相关论文
共 50 条
  • [1] Efficient mining of high utility pattern with considering of rarity and length
    Donggyu Kim
    Unil Yun
    [J]. Applied Intelligence, 2016, 45 : 152 - 173
  • [2] Efficient Algorithm for Mining High Utility Pattern Considering Length Constraints
    Singh, Kuldeep
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2019, 15 (03) : 1 - 27
  • [3] An Efficient Algorithm for High Utility Sequential Pattern Mining
    Wang, Jun-Zhe
    Yang, Zong-Hua
    Huang, Jiun-Long
    [J]. FRONTIER AND INNOVATION IN FUTURE COMPUTING AND COMMUNICATIONS, 2014, 301 : 49 - 56
  • [4] An Efficient Candidate Pruning Technique for High Utility Pattern Mining
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 749 - 756
  • [5] Efficient Chain Structure for High-Utility Sequential Pattern Mining
    Lin, Jerry Chun-Wei
    Li, Yuanfa
    Fournier-Viger, Philippe
    Djenouri, Youcef
    Zhang, Ji
    [J]. IEEE ACCESS, 2020, 8 : 40714 - 40722
  • [6] Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (12) : 1708 - 1721
  • [7] Scalable and Efficient Approach for High Temporal Fuzzy Utility Pattern Mining
    Ryu, Taewoong
    Kim, Heonho
    Lee, Chanhee
    Kim, Heonmo
    Vo, Bay
    Lin, Jerry Chun-Wei
    Pedrycz, Witold
    Yun, Unil
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (12) : 7672 - 7685
  • [8] EHNL: An efficient algorithm for mining high utility itemsets with negative utility value and length constraints
    Singh, Kuldeep
    Kumar, Ajay
    Singh, Shashank Sheshar
    Shakya, Harish Kumar
    Biswas, Bhaskar
    [J]. INFORMATION SCIENCES, 2019, 484 : 44 - 70
  • [9] Efficient approach for incremental high utility pattern mining with indexed list structure
    Yun, Unil
    Nam, Hyoju
    Lee, Gangin
    Yoon, Eunchul
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 221 - 239
  • [10] HUPM: Efficient High Utility Pattern Mining Algorithm for E-Business
    Bala, Myneni Madhu
    Dandamudi, Rohit
    [J]. PROCEEDINGS OF THE 2018 IEEE 8TH INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC 2018), 2018, : 191 - 195