Integrating clustering data mining into the multidimensional modeling of Data Warehouses with UML profiles

被引:0
|
作者
Zubcoff, Jose [1 ]
Pardillo, Jesus [2 ]
Trujillo, Juan [2 ]
机构
[1] Univ Alicante, Dept Sea Sci & Appl Biol, Alicante, Spain
[2] Univ of Alicante, Dept Software & Comp Syst, Alicante, Spain
关键词
conceptual modeling; multidimensional modeling; UML extension; KDD; Data Warehouse; data mining; clustering;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering can be considered the most important unsupervised learning technique finding similar behaviors (clusters) on large collections of data. Data warehouses (DWs) can help users to analyze stored data, because they contain preprocessed data for analysis purposes. Furthermore, the multidimensional (MD) model of DWs, intuitively represents the system underneath. However, most of the clustering data mining are applied at a low-level of abstraction to complex unstructured data. While there are several approaches for clustering on DWs, there is still not a conceptual model for clustering that facilitates modeling with this technique on the multidimensional (MD) model of a DW. Here, we propose (i) a conceptual model for clustering that helps focusing on the data-mining process at the adequate abstraction level and (ii) an extension of the unified modeling language (UML) by means of the UML profiling mechanism allowing us to design clustering data-mining models on top of the MD model of a DW. This will allow us to avoid the duplication of the time-consuming preprocessing stage and simplify the clustering design on top of DWs improving the discovery of knowledge.
引用
收藏
页码:199 / +
页数:2
相关论文
共 50 条
  • [1] A UML profile for multidimensional modeling in data warehouses
    Lujan-Mora, Sergio
    Trujillo, Juan
    Song, Il-Yeol
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 725 - 769
  • [2] Multidimensional data modeling for data warehouses
    Harbin Inst of Technology, Harbin, China
    [J]. Ruan Jian Xue Bao/Journal of Software, 2000, 11 (07): : 908 - 917
  • [3] A UML 2.0 profile to design Association Rule mining models in the multidimensional conceptual modeling of data warehouses
    Zubcoff, Jose
    Trujillo, Juan
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 63 (01) : 44 - 62
  • [4] Conceptual Multidimensional Modeling for Data Warehouses: A Survey
    Gosain, Anjana
    Singh, Jaspreeti
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 305 - 316
  • [5] Modeling Information Quality Risk for Data Mining in Data Warehouses
    Su, Ying
    Peng, Jie
    Jin, Zhanming
    [J]. HUMAN AND ECOLOGICAL RISK ASSESSMENT, 2009, 15 (02): : 332 - 350
  • [6] Conceptual modeling for classification mining in data warehouses
    Zubcoff, Jose
    Trujillo, Juan
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 566 - 575
  • [7] Extending the UML for designing association rule mining models for data warehouses
    Zubcoff, JJ
    Trujillo, J
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 11 - 21
  • [8] A UML based approach for modeling ETL processes in data warehouses
    Trujillo, J
    Luján-Mora, S
    [J]. CONCEPTUAL MODELING - ER 2003, PROCEEDINGS, 2003, 2813 : 307 - 320
  • [9] A UML profile for the conceptual modelling of data-mining with time-series in data warehouses
    Zubcoff, Jose
    Pardillo, Jesus
    Trujillo, Juan
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2009, 51 (06) : 977 - 992
  • [10] Mining multidimensional data using clustering techniques
    Pagani, Marco
    Bordogna, Gloria
    Valle, Massimiliano
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 382 - +