From Data Quality to Big Data Quality

被引:67
|
作者
Batini, Carlo [1 ]
Rula, Anisa [1 ]
Scannapieco, Monica [2 ]
Viscusi, Gianluigi [3 ]
机构
[1] Univ Milano Bicocca, Dept Informat Syst & Commun DISCo, Milan, Italy
[2] Italian Natl Inst Stat Istat, Rome, Italy
[3] Ecole Polytech Fed Lausanne, CDM MTEI CSI, Lausanne, Switzerland
关键词
Data Quality; Big Data; Linked Open Data; Maps; Semi-Structured Texts; SENSOR;
D O I
10.4018/JDM.2015010103
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the evolution of data quality issues from traditional structured data managed in relational databases to Big Data. In particular, the paper examines the nature of the relationship between Data Quality and several research coordinates that are relevant in Big Data, such as the variety of data types, data sources and application domains, focusing on maps, semi-structured texts, linked open data, sensor & sensor networks and official statistics. Consequently a set of structural characteristics is identified and a systematization of the a posteriori correlation between them and quality dimensions is provided. Finally, Big Data quality issues are considered in a conceptual framework suitable to map the evolution of the quality paradigm according to three core coordinates that are significant in the context of the Big Data phenomenon: the data type considered, the source of data, and the application domain. Thus, the framework allows ascertaining the relevant changes in data quality emerging with the Big Data phenomenon, through an integrative and theoretical literature review.
引用
收藏
页码:60 / 82
页数:23
相关论文
共 50 条
  • [41] Big Data and quality data for fake news and misinformation detection
    Asr, Fatemeh Torabi
    Taboada, Maite
    [J]. BIG DATA & SOCIETY, 2019, 6 (01):
  • [42] Perspective of anomaly detection in big data for data quality improvement
    Keskar, Vinaya
    Yadav, Jyoti
    Kumar, Ajay
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 532 - 537
  • [43] Special Issue on Data Quality in Big Data and Trust Preface
    Song, William Wei
    Chen, Deren
    [J]. INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2016, 13 (02) : V - VII
  • [44] Context-aware data quality assessment for big data
    Ardagna, Danilo
    Cappiello, Cinzia
    Sama, Walter
    Vitali, Monica
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 89 : 548 - 562
  • [45] Evaluating the Quality of Social Media Data in Big Data Architecture
    Immonen, Anne
    Paakkonen, Pekka
    Ovaska, Eila
    [J]. IEEE ACCESS, 2015, 3 : 2028 - 2043
  • [46] Rethinking big data: A review on the data quality and usage issues
    Liu, Jianzheng
    Li, Jie
    Li, Weifeng
    Wu, Jianzheng
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 115 : 134 - 142
  • [47] An Approach to Improve Data Quality from Big Data Aspect by Sensitive Cost and Time
    Mohammad, Banan
    Alzyadat, Wael
    Al-Fayoumi, Mohammad
    El Hawi, Ruba
    Alhroob, Aysh
    [J]. 2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 022 - 026
  • [48] A Big Data Framework for Electric Power Data Quality Assessment
    Liu, He
    Huang, Fupeng
    Li, Han
    Liu, Weiwei
    Wang, Tongxun
    [J]. 2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 289 - 292
  • [49] Relating Big Data and Data Quality in Financial Service Organizations
    Wahyudi, Agung
    Farhani, Adiska
    Janssen, Marijn
    [J]. CHALLENGES AND OPPORTUNITIES IN THE DIGITAL ERA, 2018, 11195 : 504 - 519
  • [50] Towards a Data Collection Quality Model for Big Data Applications
    Abdallah, Mohammad
    Hammad, Alaa
    AlZyadat, Wael
    [J]. BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2021, 2022, 444 : 103 - 108