BIG DATA, BIG DATA QUALITY PROBLEM

被引:0
|
作者
Becker, David [1 ]
McMullen, Bill [1 ]
King, Trish Dunn [1 ]
机构
[1] Mitre Corp, Dayton, OH 45431 USA
关键词
Big Data; Data Quality; Returns to Scale;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A USAF sponsored MITRE research team undertook four separate, domain-specific case studies about Big Data applications. Those case studies were initial investigations into the question of whether or not data quality issues encountered in Big Data collections are substantially different in cause, manifestation, or detection than those data quality issues encountered in more traditionally sized data collections. The study addresses several factors affecting Big Data Quality at multiple levels, including collection, processing, and storage. Though not unexpected, the key findings of this study reinforce that the primary factors affecting Big Data reside in the limitations and complexities involved with handling Big Data while maintaining its integrity. These concerns are of a higher magnitude than the provenance of the data, the processing, and the tools used to prepare, manipulate, and store the data. Data quality is extremely important for all data analytics problems. From the study's findings, the " truth about Big Data" is there are no fundamentally new DQ issues in Big Data analytics projects. Some DQ issues exhibit return-s-to-scale effects, and become more or less pronounced in Big Data analytics, though. Big Data Quality varies from one type of Big Data to another and from one Big Data technology to another.
引用
收藏
页码:2644 / 2653
页数:10
相关论文
共 50 条
  • [1] Big Data Quality - Whose problem is it?
    Sadiq, Shazia
    Papotti, Paolo
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1446 - 1447
  • [2] Big problem, big data, big solution?
    不详
    [J]. CHEMISTRY & INDUSTRY, 2015, 79 (09) : 59 - 59
  • [3] Big Data. Big Problem?
    Harrop, Clare
    Dallman, Aaron R.
    Boyd, Brian A.
    [J]. AUTISM RESEARCH, 2021, 14 (02) : 238 - 239
  • [4] Big Data or Big (Privacy) Problem?
    Scotti, Veronica
    [J]. IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2017, 20 (05) : 23 - 26
  • [5] MEDICAL BIG DATA AND BIG DATA QUALITY PROBLEMS
    Hoffman, Sharona
    [J]. CONNECTICUT INSURANCE LAW JOURNAL, 2014, 21 (01): : 289 - 316
  • [6] From Data Quality to Big Data Quality
    Batini, Carlo
    Rula, Anisa
    Scannapieco, Monica
    Viscusi, Gianluigi
    [J]. JOURNAL OF DATABASE MANAGEMENT, 2015, 26 (01) : 60 - 82
  • [7] Big Data and Data Quality Dimensions
    Rambli, Yanty Rahayu
    Shahibi, Mohd Sazili
    Ibrahim, Zaharudin
    Ismail, Mohd Nasir
    [J]. INNOVATION MANAGEMENT AND EDUCATION EXCELLENCE THROUGH VISION 2020, VOLS I -XI, 2018, : 6959 - 6964
  • [8] Data Quality Issues in Big Data
    Rao, Dhana
    Gudivada, Venkat N.
    Raghavan, Vijay V.
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2654 - 2660
  • [9] Is Big Data a Transient Problem?
    Lin, Jimmy
    [J]. IEEE INTERNET COMPUTING, 2015, 19 (05) : 86 - 90
  • [10] Big data (Big data)
    Miguel Castagnino, Juan
    [J]. ACTA BIOQUIMICA CLINICA LATINOAMERICANA, 2018, 52 (03): : 279 - 280