Big data quality framework: a holistic approach to continuous quality management

被引:0
|
作者
Ikbal Taleb
Mohamed Adel Serhani
Chafik Bouhaddioui
Rachida Dssouli
机构
[1] Zayed University,College of Technological Innovation
[2] UAE University,College of Information Technology
[3] UAE University,Department of Statistics, College of Business and Economics
[4] Concordia Institute for Information Systems Engineering,undefined
[5] Concordia University,undefined
来源
关键词
Big data quality; Data quality profile; Quality assessment; Quality metrics and scores; Pre-processing;
D O I
暂无
中图分类号
学科分类号
摘要
Big Data is an essential research area for governments, institutions, and private agencies to support their analytics decisions. Big Data refers to all about data, how it is collected, processed, and analyzed to generate value-added data-driven insights and decisions. Degradation in Data Quality may result in unpredictable consequences. In this case, confidence and worthiness in the data and its source are lost. In the Big Data context, data characteristics, such as volume, multi-heterogeneous data sources, and fast data generation, increase the risk of quality degradation and require efficient mechanisms to check data worthiness. However, ensuring Big Data Quality (BDQ) is a very costly and time-consuming process, since excessive computing resources are required. Maintaining Quality through the Big Data lifecycle requires quality profiling and verification before its processing decision. A BDQ Management Framework for enhancing the pre-processing activities while strengthening data control is proposed. The proposed framework uses a new concept called Big Data Quality Profile. This concept captures quality outline, requirements, attributes, dimensions, scores, and rules. Using Big Data profiling and sampling components of the framework, a faster and efficient data quality estimation is initiated before and after an intermediate pre-processing phase. The exploratory profiling component of the framework plays an initial role in quality profiling; it uses a set of predefined quality metrics to evaluate important data quality dimensions. It generates quality rules by applying various pre-processing activities and their related functions. These rules mainly aim at the Data Quality Profile and result in quality scores for the selected quality attributes. The framework implementation and dataflow management across various quality management processes have been discussed, further some ongoing work on framework evaluation and deployment to support quality evaluation decisions conclude the paper.
引用
收藏
相关论文
共 50 条
  • [1] Big data quality framework: a holistic approach to continuous quality management
    Taleb, Ikbal
    Serhani, Mohamed Adel
    Bouhaddioui, Chafik
    Dssouli, Rachida
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [2] A Big Data Approach for Memory Quality Management
    Yeo, Yvonne
    Xue, Feng
    Low, Wen Wei
    Yoon, Jung H.
    Gold, Steve
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2448 - 2452
  • [3] A Holistic Framework for Big Scientific Data Management
    Kantere, Verena
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 220 - 226
  • [4] Quality Management in Big Data
    Ge, Mouzhi
    Dohnal, Vlastislav
    [J]. INFORMATICS-BASEL, 2018, 5 (02):
  • [5] A big data analytics approach to quality, reliability and risk management
    Mazzuto, Giovanni
    Ciarapica, Filippo Emanuele
    [J]. INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2019, 36 (01) : 2 - 6
  • [6] THE BIG PICTURE - TOTAL QUALITY MANAGEMENT AND CONTINUOUS QUALITY IMPROVEMENT
    KIRK, R
    [J]. JOURNAL OF NURSING ADMINISTRATION, 1992, 22 (04): : 24 - 31
  • [7] Total quality management in education a holistic approach
    de Beer, WHJ
    Fowler, M
    Camerius, JW
    Egle, F
    [J]. COMPLEX DEMANDS ON TEACHING REQUIRE INNOVATION: CASE METHOD & OTHER TECHNIQUES, 2000, : 313 - 323
  • [8] Drinking water quality management: a holistic approach
    Rizak, S
    Cunliffe, D
    Sinclair, M
    Vulcano, R
    Howard, J
    Hrudey, S
    Callan, P
    [J]. WATER SCIENCE AND TECHNOLOGY, 2003, 47 (09) : 31 - 36
  • [9] Data Quality Management for Big Data Applications
    Khaleel, Majida Yaseen
    Hamad, Murtadha M.
    [J]. 12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 357 - 362
  • [10] A Framework for Ensuring the Quality of a Big Data Service
    Ding, Junhua
    Zhang, Dongmei
    Hui, Xin-Hua
    [J]. PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 82 - 89