Research on automatic cleaning algorithm of multi-dimensional network redundant data based on big data

被引:0
|
作者
Jie Fang
机构
[1] Hefei Normal University,School of Computer Science and Technology
[2] Southeast University,School of Cyber Science and Engineering
来源
Evolutionary Intelligence | 2022年 / 15卷
关键词
Network redundant data; Big data; Multi-dimensional; Cleaning algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
In order to realize the research on network redundant data cleaning based on big data, this paper designs a set of redundant data cleaning framework according to the data processing flow before data analysis. According to the spatial correlation of redundant data, a method of data cleaning is designed. In the data cleaning method, appropriate cleaning algorithms are designed for abnormal data and missing data respectively, in which mathematical probability design is applied to abnormal data to delete the data with obvious deviation from the normal data value. The spatial model and algorithm are designed by applying spatial correlation to the missing data to fill the missing data value after the redundant data is cleaned by other steps in the method. The accuracy of the model is compared with that of the common data prediction algorithm, and the accuracy between the algorithm and the redundant data set is verified.
引用
收藏
页码:2609 / 2617
页数:8
相关论文
共 50 条
  • [41] Research on financial network big data processing technology based on fireworks algorithm
    Luo, Tao
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (1)
  • [42] Research on financial network big data processing technology based on fireworks algorithm
    Tao Luo
    [J]. EURASIP Journal on Wireless Communications and Networking, 2019
  • [43] Automatic clustering of multi-dimensional data (ACMD) applied to hyperspectral images
    Shulman, DS
    Roth, I
    Rotman, SR
    [J]. ELECTRO-OPTICAL AND INFRARED SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2004, 5612 : 117 - 125
  • [44] A data-agnostic approach to automatic testing of multi-dimensional databases
    Marin, Marius
    [J]. 2014 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2014, : 133 - 142
  • [45] Research on Data Cleaning Method Based on SNM Algorithm
    Zhang, Ningning
    Guo, Aizhang
    Sun, Tao
    [J]. 2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 2639 - 2643
  • [46] Correlation network analysis for multi-dimensional data in stocks market
    Kazemilari, Mansooreh
    Djauhari, Maman Abdurachman
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 429 : 62 - 75
  • [47] Similarity Search Problem Research on Multi-dimensional Data Sets
    Shi, Yong
    Graham, Brian
    [J]. PROCEEDINGS OF THE 2013 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, 2013, : 573 - 577
  • [48] Research of Decision Support Technologies in Multi-Dimensional Data Environment
    Qiu Ze-Guo
    [J]. GREEN POWER, MATERIALS AND MANUFACTURING TECHNOLOGY AND APPLICATIONS II, 2012, 214 : 601 - 604
  • [49] Workload-Based Ordering of Multi-Dimensional Data
    Yang, Shengxun
    He, Zhen
    Chen, Yi-Ping Phoebe
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) : 831 - 844
  • [50] Clustering-based histograms for multi-dimensional data
    Furfaro, F
    Mazzeo, GM
    Sirangelo, C
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 478 - 487