Big Data Cleaning

被引:0
|
作者
Tang, Nan [1 ]
机构
[1] Qatar Comp Res Inst, Doha, Qatar
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data cleaning is, in fact, a lively subject that has played an important part in the history of data management and data analytics, and it still is undergoing rapid development. Moreover, data cleaning is considered as a main challenge in the era of big data, due to the increasing volume, velocity and variety of data in many applications. This paper aims to provide an overview of recent work in different aspects of data cleaning: error detection methods, data repairing algorithms, and a generalized data cleaning system. It also includes some discussion about our efforts of data cleaning methods from the perspective of big data, in terms of volume, velocity and variety.
引用
收藏
页码:13 / 24
页数:12
相关论文
共 50 条
  • [1] Big RDF Data Cleaning
    Tang, Nan
    2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 77 - 79
  • [2] Research on the Technology of Data Cleaning in Big Data
    Feng, Fu-jun
    Yao, Jun-ping
    Li, Xiao-jun
    2018 2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2018), 2018, 305 : 176 - 181
  • [3] CleanCloud: Cleaning Big Data on Cloud
    Wang, Hongzhi
    Ding, Xiaoou
    Chen, Xiangying
    Li, Jianzhong
    Gao, Hong
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2543 - 2546
  • [4] Data Cleaning Mechanism for Big Data and Cloud Computing
    Rahul, Kumar
    Banyal, R. K.
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 195 - 198
  • [5] Data Cleaning Technique for Security Big Data Ecosystem
    Martinez-Mosquera, Diana
    Lujan-Mora, Sergio
    IOTBDS: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY, 2017, : 380 - 385
  • [6] Big Data Cleaning Algorithms in Cloud Computing
    Feng, Zhang
    Hui-Feng, Xue
    Dong-Sheng, Xu
    Yong-Heng, Zhang
    Fei, You
    INTERNATIONAL JOURNAL OF ONLINE ENGINEERING, 2013, 9 (03) : 77 - 81
  • [7] Cleanix: a Parallel Big Data Cleaning System
    Wang, Hongzhi
    Li, Mingda
    Bu, Yingyi
    Li, Jianzhong
    Gao, Hong
    Zhang, Jiacheng
    SIGMOD RECORD, 2015, 44 (04) : 35 - 40
  • [8] Enhancing Data Quality by Cleaning Inconsistent Big RDF Data
    Benbernou, Salima
    Ouziri, Mourad
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 74 - 79
  • [9] Exploring and cleaning big data with random sample data blocks
    Salloum, Salman
    Huang, Joshua Zhexue
    He, Yulin
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [10] Exploring and cleaning big data with random sample data blocks
    Salman Salloum
    Joshua Zhexue Huang
    Yulin He
    Journal of Big Data, 6