Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications

被引:0
|
作者
Sunita Sarawagi
Shiby Thomas
Rakesh Agrawal
机构
[1] IBM Almaden Research Center,
[2] IBM Almaden Research Center,undefined
[3] IBM Almaden Research Center,undefined
来源
关键词
mining system architecture; association rule mining; database mining; mining algorithms in SQL;
D O I
暂无
中图分类号
学科分类号
摘要
Data mining on large data warehouses is becoming increasingly important. In support of this trend, we consider a spectrum of architectural alternatives for coupling mining with database systems. These alternatives include: loose-coupling through a SQL cursor interface; encapsulation of a mining algorithm in a stored procedure; caching the data to a file system on-the-fly and mining; tight-coupling using primarily user-defined functions; and SQL implementations for processing in the DBMS. We comprehensively study the option of expressing the mining algorithm in the form of SQL queries using Association rule mining as a case in point. We consider four options in SQL-92 and six options in SQL enhanced with object-relational extensions (SQL-OR). Our evaluation of the different architectural alternatives shows that from a performance perspective, the Cache option is superior, although the performance of the SQL-OR option is within a factor of two. Both the Cache and the SQL-OR approaches incur a higher storage penalty than the loose-coupling approach which performance-wise is a factor of 3 to 4 worse than Cache. The SQL-92 implementations were too slow to qualify as a competitive option. We also compare these alternatives on the basis of qualitative factors like automatic parallelization, development ease, portability and inter-operability. As a byproduct of this study, we identify some primitives for native support in database systems for decision-support applications.
引用
收藏
页码:89 / 125
页数:36
相关论文
共 50 条
  • [31] Association rule mining in intrusion detection systems
    Zhao, D
    Lu, YS
    [J]. APOC 2003: ASIA-PACIFIC OPTICAL AND WIRELESS COMMUNICATIONS; NETWORK ARCHITECTURES, MANAGEMENT, AND APPLICATIONS, PTS 1 AND 2, 2003, 5282 : 577 - 581
  • [32] X-Ray - Towards integrating XML and relational database systems
    Kappel, G
    Kapsammer, E
    Rausch-Schott, S
    Retschitzegger, W
    [J]. CONCEPTUAL MODELING ER 2000, PROCEEDINGS, 2000, 1920 : 339 - 353
  • [33] Integrating association rule mining algorithms with the F2OODBMS
    Al-Jadir, L
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, 2736 : 724 - 736
  • [34] Effective classification by integrating support vector machine and association rule mining
    Kianmehr, Keivan
    Alhajj, Reda
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 920 - 927
  • [35] Integrating Web content clustering into Web log association rule mining
    Guo, J
    Keselj, V
    Gao, Q
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3501 : 182 - 193
  • [36] Integrating Collaborative Filtering and Association Rule Mining for Market Basket Recommendation
    Wang, Feiran
    Wen, Yiping
    Chen, Jinjun
    Cao, Buqing
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT II, 2018, 11234 : 19 - 34
  • [37] Data mining used in rule design for active database systems
    Dai, Min
    Huang, Ya-Lou
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS, 2007, : 588 - +
  • [38] INTEGRATING DATABASE-SYSTEMS AND RULE-BASED SYSTEMS - CLARIFICATIONS AND SYNERGY
    YEN, D
    LEE, S
    TANG, HL
    [J]. PROCEEDINGS OF THE 17TH ANNUAL NORTH AMERICAN CONFERENCE OF THE INTERNATIONAL BUSINESS SCHOOLS COMPUTER USERS GROUP: MANAGING INFORMATION TECHNOLOGY : BUSINESS SCHOOLS ROLE IN THE 1990S, 1989, : 107 - 111
  • [39] Heuristics for interesting class association rule mining a colorectal cancer database
    Delgado-Osuna, Jose A.
    Garcia-Martinez, Carlos
    Gomez-Barbadillo, Jose
    Ventura, Sebastian
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [40] Identifying Technical Debt in Database Normalization Using Association Rule Mining
    Albarak, Mashel
    Alrazgan, Muna
    Bahsoon, Rami
    [J]. 44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 437 - 441