Knowledge Based Data Cleaning for Data Warehouse Quality

被引:0
|
作者
Bradji, Louardi [1 ,2 ]
Boufaida, Mahmoud [2 ]
机构
[1] Univ Tebessa, Tebessa 12002, Algeria
[2] Mentouri Univ Constantine, LIRE lab, Constantine 25017, Algeria
关键词
Data Cleaning; Data Quality; Data Warehouse; Knowledge;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an approach for improvement the quality of data warehouse and operational databases with using knowledge. The benefit of this approach is three-folds. First, the incorporation of knowledge into data cleaning is successful to meet the user's demands and then the data cleaning can be expanded and modified. The knowledge that can be extracted automatically or manually is stored in repository in order to be used and validated among an appropriate process. Second, the propagation of cleaned data to their original sources in order to validate them by the user so the data cleaning can give valid values but incorrect. In addition, the mutual coherence of data is ensured. Third, the user interaction with data cleaning process is taken account in order to control it. The proposed approach is based in the idea that the quality of data will be assured at the sources and the target of data.
引用
收藏
页码:373 / +
页数:3
相关论文
共 50 条
  • [21] Stakeholder perceptions of data quality in a data warehouse environment
    Giannoccaro, A
    Shanks, G
    Darke, P
    [J]. AUSTRALIAN COMPUTER JOURNAL, 1999, 31 (04): : 110 - 117
  • [22] Data currency quality satisfaction in the design of a data warehouse
    Theodoratos, D
    Bouzeghoub, M
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2001, 10 (03) : 299 - 326
  • [23] Medical data mining: Knowledge discovery in a clinical data warehouse
    Prather, JC
    Lobach, DF
    Goodwin, LK
    Hales, JW
    Hage, ML
    Hammond, WE
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1997, : 101 - 105
  • [24] Building XML Data Warehouse with Data Reconstruction by Knowledge Graph
    Jiang, Yuyi
    Shao, Zhiqing
    Guo, Yi
    Zhang, Huanhuan
    Sun, Liping
    [J]. PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 314 - 320
  • [25] The BioKET biodiversity data warehouse: Data and knowledge integration and extraction
    [J]. Inthasone, Somsack, 1600, Springer Verlag (8819):
  • [26] The BioKET Biodiversity Data Warehouse: Data and Knowledge Integration and Extraction
    Inthasone, Somsack
    Pasquier, Nicolas
    Tettamanzi, Andrea G. B.
    Pereira, Celia da Costa
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIII, 2014, 8819 : 131 - 142
  • [27] Data Warehouse Design For Knowledge Discovery From Healthcare Data
    Ahmed, Aftab
    Zafar, Kashif
    Siddiqui, Abdul Basit
    Abdullah, Umair
    [J]. WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 1589 - +
  • [28] Optimising data quality of a data warehouse using data purgation process
    Gupta, Neha
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2023, 15 (01) : 102 - 131
  • [29] Data Warehouse for Quality Management Systems
    慕春棣
    戴剑彬
    [J]. Tsinghua Science and Technology, 1998, (03) : 83 - 86
  • [30] Quality-Based Framework for Requirement Analysis in Data Warehouse
    Munawar
    Salim, Naomie
    Ibrahim, Roliana
    [J]. 2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 152 - 158