Research on automatic cleaning algorithm of multi-dimensional network redundant data based on big data

被引:0
|
作者
Jie Fang
机构
[1] Hefei Normal University,School of Computer Science and Technology
[2] Southeast University,School of Cyber Science and Engineering
来源
Evolutionary Intelligence | 2022年 / 15卷
关键词
Network redundant data; Big data; Multi-dimensional; Cleaning algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
In order to realize the research on network redundant data cleaning based on big data, this paper designs a set of redundant data cleaning framework according to the data processing flow before data analysis. According to the spatial correlation of redundant data, a method of data cleaning is designed. In the data cleaning method, appropriate cleaning algorithms are designed for abnormal data and missing data respectively, in which mathematical probability design is applied to abnormal data to delete the data with obvious deviation from the normal data value. The spatial model and algorithm are designed by applying spatial correlation to the missing data to fill the missing data value after the redundant data is cleaned by other steps in the method. The accuracy of the model is compared with that of the common data prediction algorithm, and the accuracy between the algorithm and the redundant data set is verified.
引用
收藏
页码:2609 / 2617
页数:8
相关论文
共 50 条
  • [1] Research on automatic cleaning algorithm of multi-dimensional network redundant data based on big data
    Fang, Jie
    [J]. EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2609 - 2617
  • [2] Automatic Generation of Communications for Redundant Multi-dimensional Data Parallel Redistributions
    Ancourt, Corinne
    Petrisor, Teodora
    Irigoin, Francois
    Lenormand, Eric
    [J]. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 800 - 811
  • [3] The multi-dimensional power big data mining based on improved grey clustering algorithm
    Li, Hui
    Lu, Guangqian
    [J]. WEB INTELLIGENCE, 2023, 21 (02) : 203 - 210
  • [4] Research on multi-dimensional data compression algorithm for cluster-based routing in wireless sensor network
    Shenyang Institute of Computing Technology, Chinese Academy of Sciences, Shenyang 110171, China
    不详
    不详
    [J]. Tien Tzu Hsueh Pao, 2009, 5 (1109-1114): : 1109 - 1114
  • [5] BBoxDB - A Scalable Data Store for Multi-Dimensional Big Data
    Nidzwetzki, Jan Kristof
    Gueting, Ralf Hartmut
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1867 - 1870
  • [6] Research on the Massive Redundant Data Mining Algorithm based on Kernel Clustering and Data Cleaning Technology
    Mao, YaoFeng
    [J]. 2015 2ND INTERNATIONAL SYMPOSIUM ON ENGINEERING TECHNOLOGY, EDUCATION AND MANAGEMENT (ISETEM 2015), 2015, : 112 - 117
  • [7] A novel regression mining algorithm based on multi-dimensional data
    Tang, Zhihang
    Li, Rongjun
    [J]. Journal of Computational Information Systems, 2010, 6 (05): : 1459 - 1465
  • [8] Research on Multi-Dimensional Dynamic Clustering Method of Big Data Alliance Users
    Wang, Xiaoxiao
    Zhai, Lili
    Hu, Yanling
    Zhang, Shuchen
    [J]. PROCEEDINGS OF NINETEENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, 2020, : 25 - 26
  • [9] Collection and analysis of multi-dimensional network data for opportunistic networking research
    Hossmann, Theus
    Nomikos, George
    Spyropoulos, Thrasyvoulos
    Legendre, Franck
    [J]. COMPUTER COMMUNICATIONS, 2012, 35 (13) : 1613 - 1625
  • [10] Research on the Mining of Basic Talents based on Multi-dimensional Data
    Xu, Bin
    Xu, Zipeng
    Zhang, Lu
    Zhang, Zhaowei
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7586 - 7589