Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications

被引:0
|
作者
Sunita Sarawagi
Shiby Thomas
Rakesh Agrawal
机构
[1] IBM Almaden Research Center,
[2] IBM Almaden Research Center,undefined
[3] IBM Almaden Research Center,undefined
来源
关键词
mining system architecture; association rule mining; database mining; mining algorithms in SQL;
D O I
暂无
中图分类号
学科分类号
摘要
Data mining on large data warehouses is becoming increasingly important. In support of this trend, we consider a spectrum of architectural alternatives for coupling mining with database systems. These alternatives include: loose-coupling through a SQL cursor interface; encapsulation of a mining algorithm in a stored procedure; caching the data to a file system on-the-fly and mining; tight-coupling using primarily user-defined functions; and SQL implementations for processing in the DBMS. We comprehensively study the option of expressing the mining algorithm in the form of SQL queries using Association rule mining as a case in point. We consider four options in SQL-92 and six options in SQL enhanced with object-relational extensions (SQL-OR). Our evaluation of the different architectural alternatives shows that from a performance perspective, the Cache option is superior, although the performance of the SQL-OR option is within a factor of two. Both the Cache and the SQL-OR approaches incur a higher storage penalty than the loose-coupling approach which performance-wise is a factor of 3 to 4 worse than Cache. The SQL-92 implementations were too slow to qualify as a competitive option. We also compare these alternatives on the basis of qualitative factors like automatic parallelization, development ease, portability and inter-operability. As a byproduct of this study, we identify some primitives for native support in database systems for decision-support applications.
引用
收藏
页码:89 / 125
页数:36
相关论文
共 50 条
  • [1] Integrating association rule mining with relational database systems: Alternatives and implications
    Sarawagi, S
    Thomas, S
    Agrawal, R
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2000, 4 (2-3) : 89 - 125
  • [2] AN IMPROVED ALGORITHM FOR MINING ASSOCIATION RULE IN RELATIONAL DATABASE
    Wang, Pei
    An, Chunhong
    Wang, Lei
    [J]. PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 247 - 252
  • [3] Integrating frequent itemsets mining with relational database
    Qiu Yong
    [J]. ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 543 - 546
  • [4] Integrating constraint and relational database systems
    Cai, MC
    [J]. CONSTRAINT DATABASES, PROCEEDINGS, 2004, 3074 : 173 - 180
  • [5] Integrating XML and relational database systems
    Kappel, G
    Kapsammer, E
    Retschitzegger, W
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2004, 7 (04): : 343 - 384
  • [6] Integrating XML and Relational Database Systems
    Gerti Kappel
    Elisabeth Kapsammer
    Werner Retschitzegger
    [J]. World Wide Web, 2004, 7 : 343 - 384
  • [7] Association Rule Mining on Fragmented Database
    Hamzaoui, Amel
    Malluhi, Qutaibah
    Clifton, Chris
    Riley, Ryan
    [J]. DATA PRIVACY MANAGEMENT, AUTONOMOUS SPONTANEOUS SECURITY, AND SECURITY ASSURANCE, 2015, 8872 : 335 - 342
  • [8] A layered optimizer for mining association rules over relational database management systems
    Dudgikar, M
    Chakravarthy, S
    Liuzzi, R
    Wong, L
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 422 - 427
  • [9] Implementing Multi-relational Mining with Relational Database Systems
    Inuzuka, Nobuhiro
    Makino, Toshiyuki
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, PROCEEDINGS, 2009, 5712 : 672 - 680
  • [10] A novel concurrent relational association rule mining approach
    Czibula, Gabriela
    Czibula, Istvan Gergely
    Miholca, Diana-Lucia
    Crivei, Liana Maria
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 125 : 142 - 156