An evolutionary algorithm for mining rare association rules: a Big Data approach

被引:0
|
作者
Padillo, F. [1 ]
Luna, J. M. [2 ]
Ventura, S. [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, Rabanales Campus, Cordoba, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Association rule mining is one of the most well-known techniques to discover interesting relations between items in data. To date, this task has been mainly focused on the discovery of frequent relationships. However, it is often interesting to focus on those that do not occur frequently. Rare association rule mining is an alluring field aiming at describing rare cases or unexpected behavior. This field is really useful over Big Data where abnormal endeavor are more curious than common behavior. In this sense, our aim is to propose a new evolutionary algorithm based on grammars to obtain rare association rules on Big Data. The novelty of our work is that it is eminently designed to be parallel, enabling its use over emerging technologies as Spark and Flink. Furthermore, while other algorithms focus on maximizing a couple of quality measure ignoring the rest, our fitness function has been precisely designed to obtain a trade-off while maximizing a set of well-known quality measures. The experimental study includes more than 70 datasets revealing alluring results in efficiency when more than 300 million of instances and file sizes up to 250 GBytes are considered, and proving that it is able to run efficiently in huge volumes of data.
引用
收藏
页码:2007 / 2014
页数:8
相关论文
共 50 条
  • [21] Approach of organization data based on mining of association rules
    Kong, Lingfu
    Wang, Han
    Lian, Qiusheng
    Jisuanji Gongcheng/Computer Engineering, 2006, 32 (21): : 12 - 14
  • [22] A new approach for mining association rules in data warehouses
    Ribeiro, MX
    Vieira, MTP
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 98 - 110
  • [23] Association rules mining algorithm
    Bhowmik, R
    Proceedings of the ISCA 20th International Conference on Computers and Their Applications, 2005, : 86 - 90
  • [24] Online education big data mining method based on association rules
    Zhang N.
    International Journal of Information and Communication Technology, 2024, 24 (03) : 262 - 272
  • [25] Mining association rules on Big Data through MapReduce genetic programming
    Padillo, F.
    Luna, J. M.
    Herrera, F.
    Ventura, S.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2018, 25 (01) : 31 - 48
  • [26] Hadoop based Mining of Distributed Association Rules from Big Data
    Bouraoui, Marwa
    Bouzouita, Ines
    Touzi, Amel Grissa
    2017 18TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2017, : 185 - 190
  • [27] A dual-objective evolutionary algorithm for rules extraction in data mining
    Tan, K. C.
    Yu, Q.
    Ang, J. H.
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2006, 34 (02) : 273 - 294
  • [28] A Dual-Objective Evolutionary Algorithm for Rules Extraction in Data Mining
    K. C. Tan
    Q. Yu
    J. H. Ang
    Computational Optimization and Applications, 2006, 34 (2) : 273 - 294
  • [29] Study of an improved Apriori algorithm for data mining of association rules
    Zhang, Xueting
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1211 - 1218
  • [30] A fast algorithm for mining association rules in medical image data
    Olukunle, A
    Ehikioya, S
    IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 1181 - 1187