Combination of Two Evolutionary Methods for Mining Association Rules in Large and Dense Databases

被引:3
|
作者
Gonzales, Eloy [1 ]
Taboada, Karla [1 ]
Mabu, Shingo [1 ]
Shimada, Kaoru [2 ]
Hirasawa, Kotaro [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, 2-7 Hibikino, Kitakyushu, Fukuoka 8080135, Japan
[2] Waseda Univ, Informat Prod & Syst Res Ctr, Wakamatsu Ku, Kitakyushu, Fukuoka 8080135, Japan
关键词
evolutionary computation; Genetic Network Programming (GNP); data mining; association rules; Genetic Algorithms (GA);
D O I
10.20965/jaciii.2009.p0561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Among several methods of extracting association rules that have been reported, a new evolutionary method named Genetic Network Programming (GNP) has also shown its effectiveness for small databases in the sense that they have a relatively small number of attributes. However, this conventional GNP method is not be able to deal with large databases with a huge number of attributes, because its search space becomes very large, causing bad performance at running time. The aim of this paper is to propose a new method to extract association rules from large and dense databases with a huge amount of attributes through the combination of conventional GNP based mining method and a specially designed genetic algorithm (GA). Each of these evolutionary methods works in its own processing level and they are highly synchronized to act as one system. Our strategy consists in the division of a large and dense database into many small databases. These small databases are considered as individuals and form a population. Then the conventional GNP based mining method is applied to extract association rules for each of these individuals. Finally, the population is evolved through several generations using GA with special genetic operators considering the acquired information. Two complementary processing levels are defined: Global Level and Local Level, each with its own independent tasks and processes. In the Global Level mainly GA process is carried out, whereas in the Local Level, conventional GNP based mining method is carried out in parallel and they generate their own local pools of association rules. Several special genetic operations for GA in the Global Level are proposed and the performance of each of them and their combination is shown and compared. In our simulations, the conventional GNP based mining method and our proposed method are compared using a real world large and dense database with a huge amount of attributes. The results show that extending the conventional GNP based mining method using GA allows to extract association rules from large and dense databases directly and more efficiently than the conventional GNP method.
引用
收藏
页码:561 / 572
页数:12
相关论文
共 50 条
  • [41] An Efficient Approach for Mining Positive and Negative Association Rules from Large Transactional Databases
    Kishor, Peddi
    Porika, Sammulal
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 85 - 89
  • [42] An efficient algorithm for mining quantitative association rules to raise reliance of data in large databases
    Lee, HJ
    Park, WH
    Park, DS
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 672 - 681
  • [43] Mining interesting association rules from customer databases and transaction databases
    Tsai, PSM
    Chen, CM
    INFORMATION SYSTEMS, 2004, 29 (08) : 685 - 696
  • [44] Multipass Algorithms for Mining Association Rules in Text Databases
    John D. Holt
    Soon M. Chung
    Knowledge and Information Systems, 2001, 3 (2) : 168 - 183
  • [45] Mining association rules of quantitative movement pattern in databases
    Yuan, XJ
    Kang, YN
    Wang, XR
    Yu, CJ
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C32 - C37
  • [46] Secure Mining of Association Rules in Horizontally Distributed Databases
    Tassa, Tamir
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (04) : 970 - 983
  • [47] An Efficient Framework for Mining Association Rules in the Distributed Databases
    Goyal, Lalit Mohan
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    COMPUTER JOURNAL, 2018, 61 (05): : 645 - 657
  • [48] Mining association rules with improved semantics in medical databases
    Delgado, M
    Sánchez, D
    Martín-Bautista, MJ
    Vila, MA
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2001, 21 (1-3) : 241 - 245
  • [49] Mining association rules in non-transactional databases
    Lee, Ho-Jong
    Lim, Seung-Hwan
    Oh, Hyun-Kyo
    Cho, Jinsoo
    Kim, Sang-Wook
    Cha, Jaehyuk
    Lee, Junghoon
    Kim, Hanil
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (11B): : 5055 - 5069
  • [50] Parallel mining of association rules from text databases
    Holt, John D.
    Chung, Soon M.
    JOURNAL OF SUPERCOMPUTING, 2007, 39 (03): : 273 - 299