Integration inconsistencies removal in data mining

被引:0
|
作者
Stuller, J [1 ]
机构
[1] Acad Sci Czech Republ, Inst Comp Sci, Prague 18207 8, Czech Republic
关键词
data mining; data warehousing; databases integration; inconsistency; integrity constraint;
D O I
10.1117/12.381743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The technological progress in the areas of the hardware, specially in the field of the (secondary) memories where the ever increasing capacities are paradoxically in the last several years available at ever decreasing prices and smaller physical sizes, and the software, continuously more and more user friendly, efficient and cheaper, together with the general expansion of the computers to almost all human activities, make it easier to realize the integration of many already existing databases. Unfortunately the process of databases integration can be accompanied by many various difficulties and problems. One of them is surely the possible occurrence of the inconsistencies appearing in this process of the integration. As we will see these inconsistencies can occur at various levels and they can be of different types. At the next stage some users go even further and try to get more from the accumulated data through data mining techniques. A data warehouse can be considered as a suitable technology for this purpose. Having in mind the data mining view of a data warehouse, one needs to know the sources of possible inconsistencies when building such a data warehouse in order to eliminate them as much as possible. In the paper we will define several existence conditions under which can occur different types of the inconsistencies in a warehouse and we will propose a classification of these inconsistencies based on the their sources. We will also propose a methodology and a procedure both of which aim at the elimination of these inconsistencies.
引用
收藏
页码:281 / 291
页数:11
相关论文
共 50 条
  • [1] A Solution of Data Inconsistencies in Data Integration — Designed for Pervasive Computing Environment
    Xin Wang
    Lin-Peng Huang
    Yi Zhang
    Xiao-Hui Xu
    Jun-Qing Chen
    [J]. Journal of Computer Science and Technology, 2010, 25 : 499 - 508
  • [2] A Solution of Data Inconsistencies in Data Integration - Designed for Pervasive Computing Environment
    Wang, Xin
    Huang, Lin-Peng
    Zhang, Yi
    Xu, Xiao-Hui
    Chen, Jun-Qing
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (03) : 499 - 508
  • [3] A Solution of Data Inconsistencies in Data Integration——Designed for Pervasive Computing Environment
    王欣
    黄林鹏
    章义
    徐小辉
    陈俊清
    [J]. Journal of Computer Science & Technology, 2010, 25 (03) : 499 - 508
  • [4] Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources
    Motro, Amihai
    Anokhin, Philipp
    [J]. INFORMATION FUSION, 2006, 7 (02) : 176 - 196
  • [5] Integration of text and data mining
    Drewes, B
    [J]. DATA MINING III, 2002, 6 : 289 - 298
  • [6] Data mining and integration for environmental data archives
    Hluchy, L.
    Habala, O.
    Ciglan, M.
    Iran, V.
    Simo, B.
    [J]. RUSSIAN JOURNAL OF EARTH SCIENCES, 2009, 11 (02):
  • [7] Integration and Automation of Data Preparation and Data Mining
    Narayanan, Shrikanth
    Jaiswal, Ayush
    Chiang, Yao-Yi
    Geng, Yanhui
    Knoblock, Craig A.
    Szekely, Pedro
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 1076 - 1085
  • [8] Integration of Data Mining with Game Theory
    Wang, Yi
    [J]. Knowledge Enterprise: Intelligent Strategies in Product Design, Manufacturing, and Management, 2006, 207 : 275 - 280
  • [9] A Distributed Architecture for Data Mining and Integration
    Atkinson, Malcolm P.
    van Hemert, Jano I.
    Han, Liangxiu
    Hume, Ally
    Liew, Chee Sun
    [J]. DADC 2009: SECOND INTERNATIONAL WORKSHOP ON DATA AWARE DISTRIBUTED COMPUTING, 2009, : 11 - 20
  • [10] Advanced data integration and data mining for enviromental scenarios
    Hluchy, Ladislav
    Krammer, Peter
    Habala, Ondrej
    Seleng, Martin
    Tran, Viet
    [J]. 12TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2010), 2011, : 400 - 406