Integration inconsistencies removal in data mining

被引:0
|
作者
Stuller, J [1 ]
机构
[1] Acad Sci Czech Republ, Inst Comp Sci, Prague 18207 8, Czech Republic
关键词
data mining; data warehousing; databases integration; inconsistency; integrity constraint;
D O I
10.1117/12.381743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The technological progress in the areas of the hardware, specially in the field of the (secondary) memories where the ever increasing capacities are paradoxically in the last several years available at ever decreasing prices and smaller physical sizes, and the software, continuously more and more user friendly, efficient and cheaper, together with the general expansion of the computers to almost all human activities, make it easier to realize the integration of many already existing databases. Unfortunately the process of databases integration can be accompanied by many various difficulties and problems. One of them is surely the possible occurrence of the inconsistencies appearing in this process of the integration. As we will see these inconsistencies can occur at various levels and they can be of different types. At the next stage some users go even further and try to get more from the accumulated data through data mining techniques. A data warehouse can be considered as a suitable technology for this purpose. Having in mind the data mining view of a data warehouse, one needs to know the sources of possible inconsistencies when building such a data warehouse in order to eliminate them as much as possible. In the paper we will define several existence conditions under which can occur different types of the inconsistencies in a warehouse and we will propose a classification of these inconsistencies based on the their sources. We will also propose a methodology and a procedure both of which aim at the elimination of these inconsistencies.
引用
收藏
页码:281 / 291
页数:11
相关论文
共 50 条
  • [21] Data mining and decision support: Integration and collaboration
    Bawden, D
    [J]. JOURNAL OF DOCUMENTATION, 2005, 61 (03) : 443 - 445
  • [22] An overview of data mining in heterogeneous schema integration
    Dao, S
    Perry, B
    [J]. WESCON - 96, CONFERENCE PROCEEDINGS, 1996, : 478 - 483
  • [23] INTEGRATION OF GEOPHYSICAL DATA INTO MINING EXPLORATION PROGRAMS
    WRIGHT, PM
    [J]. ECONOMIC GEOLOGY, 1975, 70 (01) : 253 - 253
  • [24] Distributed web integration with multiagent data mining
    Niimi, A
    Noji, H
    Konishi, O
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2005, 3683 : 513 - 519
  • [25] Inconsistencies in Big Data
    Zhang, Du
    [J]. PROCEEDINGS OF THE 2013 12TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI CC 2013), 2013, : 61 - 67
  • [26] An integration of data mining and data warehousing for hierarchical multimedia information retrieval
    You, J
    Dillon, T
    Liu, J
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 373 - 376
  • [27] Data Mining and Integration for Predicting Significant Meteorological Phenomena
    Bartok, Juraj
    Habala, Ondrej
    Bednar, Peter
    Gazak, Martin
    Hluchy, Ladislav
    [J]. ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 37 - 46
  • [28] A mutually beneficial integration of data mining and information extraction
    Nahm, UY
    Mooney, RJ
    [J]. SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 627 - 632
  • [29] Research on the Integration of Academic Resources Based on Data Mining
    Yang, Bo
    Zhang, Lina
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON MANAGEMENT, EDUCATION, INFORMATION AND CONTROL (MEICI 2017), 2017, 156 : 696 - 699
  • [30] Mining and integration of pathway diagrams from imaging data
    Kozhenkov, Sergey
    Baitaluk, Michael
    [J]. BIOINFORMATICS, 2012, 28 (05) : 739 - 742