An evolutionary algorithm for mining rare association rules: a Big Data approach

被引:0
|
作者
Padillo, F. [1 ]
Luna, J. M. [2 ]
Ventura, S. [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, Rabanales Campus, Cordoba, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Association rule mining is one of the most well-known techniques to discover interesting relations between items in data. To date, this task has been mainly focused on the discovery of frequent relationships. However, it is often interesting to focus on those that do not occur frequently. Rare association rule mining is an alluring field aiming at describing rare cases or unexpected behavior. This field is really useful over Big Data where abnormal endeavor are more curious than common behavior. In this sense, our aim is to propose a new evolutionary algorithm based on grammars to obtain rare association rules on Big Data. The novelty of our work is that it is eminently designed to be parallel, enabling its use over emerging technologies as Spark and Flink. Furthermore, while other algorithms focus on maximizing a couple of quality measure ignoring the rest, our fitness function has been precisely designed to obtain a trade-off while maximizing a set of well-known quality measures. The experimental study includes more than 70 datasets revealing alluring results in efficiency when more than 300 million of instances and file sizes up to 250 GBytes are considered, and proving that it is able to run efficiently in huge volumes of data.
引用
收藏
页码:2007 / 2014
页数:8
相关论文
共 50 条
  • [1] Mining Algorithm for Association Rules in Big Data Based on Hadoop
    Fu, Chunhua
    Wang, Xiaojing
    Zhang, Lijun
    Qiao, Liying
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [2] Mining association rules in big data with NGEP
    Yunliang Chen
    Fangyuan Li
    Junqing Fan
    Cluster Computing, 2015, 18 : 577 - 585
  • [3] Mining association rules in big data with NGEP
    Chen, Yunliang
    Li, Fangyuan
    Fan, Junqing
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 577 - 585
  • [4] An Improved Evolutionary Approach for Association Rules Mining
    Djenouri, Youcef
    Bendjoudi, Ahcene
    Nouali-Taboudjemat, Nadia
    Habbas, Zineb
    BIO-INSPIRED COMPUTING - THEORIES AND APPLICATIONS, BIC-TA 2014, 2014, 472 : 93 - 97
  • [5] An improved evolutionary approach for association rules mining
    Djenouri, Youcef
    Bendjoudi, Ahcene
    Nouali-Taboudjemat, Nadia
    Habbas, Zineb
    Communications in Computer and Information Science, 2014, 472 : 93 - 97
  • [6] Research on Hierarchical Mining Algorithm of Spatial Big Data Set Association Rules
    Wang, Yue
    Song, Wei
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT II, 2019, 302 : 200 - 208
  • [7] The Improved Research of Association Rules Mining Algorithm in High-Dimensional Big Data
    Du, Lingling
    NEW INDUSTRIALIZATION AND URBANIZATION DEVELOPMENT ANNUAL CONFERENCE: THE INTERNATIONAL FORUM ON NEW INDUSTRIALIZATION DEVELOPMENT IN BIG-DATA ERA, 2015, : 239 - 244
  • [8] Evolutionary approach for mining association rules on dynamic databases
    Shenoy, PD
    Srinivasa, KG
    Venugopal, KR
    Patnaik, LM
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 : 325 - 336
  • [9] Improvements in data mining association rules algorithm
    Li, Dai
    International Journal of Database Theory and Application, 2015, 8 (02): : 1 - 10
  • [10] Efficient algorithm for the extraction of association rules in data mining
    Mitra, Pinaki
    Chaudhuri, Chitrita
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 2, 2006, 3981 : 1 - 10