From Data Quality to Big Data Quality

被引:67
|
作者
Batini, Carlo [1 ]
Rula, Anisa [1 ]
Scannapieco, Monica [2 ]
Viscusi, Gianluigi [3 ]
机构
[1] Univ Milano Bicocca, Dept Informat Syst & Commun DISCo, Milan, Italy
[2] Italian Natl Inst Stat Istat, Rome, Italy
[3] Ecole Polytech Fed Lausanne, CDM MTEI CSI, Lausanne, Switzerland
关键词
Data Quality; Big Data; Linked Open Data; Maps; Semi-Structured Texts; SENSOR;
D O I
10.4018/JDM.2015010103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the evolution of data quality issues from traditional structured data managed in relational databases to Big Data. In particular, the paper examines the nature of the relationship between Data Quality and several research coordinates that are relevant in Big Data, such as the variety of data types, data sources and application domains, focusing on maps, semi-structured texts, linked open data, sensor & sensor networks and official statistics. Consequently a set of structural characteristics is identified and a systematization of the a posteriori correlation between them and quality dimensions is provided. Finally, Big Data quality issues are considered in a conceptual framework suitable to map the evolution of the quality paradigm according to three core coordinates that are significant in the context of the Big Data phenomenon: the data type considered, the source of data, and the application domain. Thus, the framework allows ascertaining the relevant changes in data quality emerging with the Big Data phenomenon, through an integrative and theoretical literature review.
引用
收藏
页码:60 / 82
页数:23
相关论文
共 50 条
  • [31] Quality improvement in the era of big data
    Hassett, Michael
    [J]. ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2021, 17 : 70 - 70
  • [32] Quality of Big Data in health care
    Sukumar, Sreenivas R.
    Natarajan, Ramachandran
    Ferrell, Regina K.
    [J]. INTERNATIONAL JOURNAL OF HEALTH CARE QUALITY ASSURANCE, 2015, 28 (06) : 621 - +
  • [33] Quality Evaluation for Documental Big Data
    Fugini, Mariagrazia
    Finocchi, Jacopo
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS), VOL 1, 2020, : 132 - 139
  • [34] Big Data Quality - Whose problem is it?
    Sadiq, Shazia
    Papotti, Paolo
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1446 - 1447
  • [35] Quality Issues with Big data Analytics
    Sangeeta
    Sharma, Kapil
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3589 - 3591
  • [36] Big Data and Quality: A Literature Review
    Lakshen, Guma Abdulkhader
    Vranes, Sanja
    Janev, Valentina
    [J]. 2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 802 - 805
  • [37] Big Data in Food Safety and Quality
    Strawn, Laura K.
    Brown, Eric W.
    David, Jairus R. D.
    Den Bakker, Henk C.
    Vangay, Pajau
    Yiannas, Frank
    Wiedmann, Martin
    [J]. FOOD TECHNOLOGY, 2015, 69 (02) : 42 - +
  • [38] Quality Analytics in a Big Data Supply Chain Commodity Data Analytics for Quality Engineering
    Tan, Julian S. K.
    Ang, Ai Kiar
    Lu, Liu
    Gan, Sheena W. Q.
    Corral, Marilyn G.
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3455 - 3463
  • [39] Quality of Information for Quality of Life: Healthcare Big Data Analytics
    Dantanarayana, G. G. T.
    Sahama, Tony
    Wikramanayake, G. N.
    [J]. 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer), 2015, : 281 - 281
  • [40] Enhancing Data Quality by Cleaning Inconsistent Big RDF Data
    Benbernou, Salima
    Ouziri, Mourad
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 74 - 79