Knowledge Based Data Cleaning for Data Warehouse Quality

被引:0
|
作者
Bradji, Louardi [1 ,2 ]
Boufaida, Mahmoud [2 ]
机构
[1] Univ Tebessa, Tebessa 12002, Algeria
[2] Mentouri Univ Constantine, LIRE lab, Constantine 25017, Algeria
来源
DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS, PT 2 | 2011年 / 189卷
关键词
Data Cleaning; Data Quality; Data Warehouse; Knowledge;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an approach for improvement the quality of data warehouse and operational databases with using knowledge. The benefit of this approach is three-folds. First, the incorporation of knowledge into data cleaning is successful to meet the user's demands and then the data cleaning can be expanded and modified. The knowledge that can be extracted automatically or manually is stored in repository in order to be used and validated among an appropriate process. Second, the propagation of cleaned data to their original sources in order to validate them by the user so the data cleaning can give valid values but incorrect. In addition, the mutual coherence of data is ensured. Third, the user interaction with data cleaning process is taken account in order to control it. The proposed approach is based in the idea that the quality of data will be assured at the sources and the target of data.
引用
收藏
页码:373 / +
页数:3
相关论文
共 50 条
  • [41] Taxonomy of data quality problems in multidimensional Data Warehouse models
    de Almeida, Wesley Gongora
    de Sousa, Rafael Timoteo, Jr.
    de Deus, Flavio Elias
    Amvame Nze, Georges Daniel
    Lopes de Mendonca, Fabio Lucio
    PROCEEDINGS OF THE 2013 8TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2013), 2013,
  • [42] From Capturing Nursing Knowledge to Retrieval of Data From a Data Warehouse
    Thoroddsen, Asta
    Gudjonsdottir, Hanna K.
    Gudmundsdottir, Elisabet
    NURSING INFORMATICS 2014: EAST MEETS WEST ESMART+, 2014, 201 : 79 - 86
  • [43] A knowledge-based approach for duplicate elimination in data cleaning
    Low, WL
    Lee, ML
    Ling, TW
    INFORMATION SYSTEMS, 2001, 26 (08) : 585 - 606
  • [44] Towards Reusing Data Cleaning Knowledge
    Almeida, Ricardo
    Maio, Paulo
    Oliveira, Paulo
    Joao, Barroso
    NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 143 - 150
  • [45] WSEMQT: a novel approach for quality-based evaluation of web data sources for a data warehouse
    Bhutani, Priyanka
    Saha, Anju
    Gosain, Anjana
    IET SOFTWARE, 2020, 14 (07) : 806 - 815
  • [46] Raw data to knowledge warehouse in proteomic-based drug discovery: A scientific data management issue
    Helfrich, JP
    BIOTECHNIQUES, 2002, : 48 - +
  • [48] Data Warehouse and Data Mining Based on Marine Environment
    Dong, Xiaowei
    JOURNAL OF COASTAL RESEARCH, 2020, : 394 - 397
  • [49] Study of MIS based on data warehouse & data mining
    Zhao, JH
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, 2000, : 328 - 330
  • [50] Integrating Web-based data into a data warehouse
    Huang, ZY
    Chen, LD
    Frolick, MN
    INFORMATION SYSTEMS MANAGEMENT, 2002, 19 (01) : 23 - 34