Design of Data Standardization Cleaning System Under Multi-source Data Access

被引:1
|
作者
Li, Bo [1 ]
Zhao, Ruifeng [2 ]
Chen, Fengchao [3 ]
Zhang, Bo [1 ]
Zhou, Lide [3 ]
He, Yipeng [3 ]
Lu, Chengbo [3 ]
机构
[1] South China Univ Technol, Sch Elect Power, Guangzhou 510640, Guangdong, Peoples R China
[2] Power Dispatching & Control Ctr Guangdong Grid Co, Guangzhou 510600, Guangdong, Peoples R China
[3] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan 523008, Guangdong, Peoples R China
关键词
Multi source data; Multi task optimization; Data cleaning; Massive data;
D O I
10.1007/978-3-030-99581-2_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the poor initial clustering ability of data, the processing time of data standardization cleaning system for multi-source data is increasing. To solve this problem, the data standardization cleaning system with multi-source data access is designed. According to the characteristics of multi-source data, the preliminary clustering module is set to complete the data preprocessing. The data similarity is calculated to determine whether the data to be processed or not need to be cleaned. The traditional system data cleaning technology is used to process the data to complete the multi-source data cleaning. So far, the design of data standardization cleaning system under multi-source data access has been completed. The experimental results show that the speed of data missing value processing, the effect of data screening and data standardization processing are better, and the comprehensive performance of system data cleaning is better. Therefore, this system is more suitable for multi-source data processing.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 50 条
  • [41] A DYNAMIC CLOUD BAYES NETWORK-BASED CLEANING METHOD OF MULTI-SOURCE UNSTRUCTURED DATA
    Yin Chao
    Liao Xinian
    Li Xiaobin
    PROCEEDINGS OF ASME 2022 17TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, MSEC2022, VOL 2, 2022,
  • [42] THREE-DIMENSIONAL INTEGRATED SYSTEM FOR MULTI-SOURCE HETEROGENEOUS DATA
    Ding, Ling
    Li, Hongyi
    Hu, Changmiao
    Liu, Wenlong
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4336 - 4339
  • [43] Measuring accessibility of bus system based on multi-source traffic data
    Zuo, Yufan
    Liu, Zhiyuan
    Fu, Xiao
    GEO-SPATIAL INFORMATION SCIENCE, 2020, 23 (03) : 248 - 257
  • [44] A Multi-source Heterogeneous Data Storage and Retrieval System for Intelligent Manufacturing
    Kong, Yaning
    Li, Dongmei
    Li, Chunshan
    Chu, Dianhui
    Yao, Zekun
    2021 IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE 2021), 2021, : 82 - 87
  • [45] Editorial deep multi-source data analysis
    Zhang, Shichao
    Xie, Qing
    Guo, Yanrong
    PATTERN RECOGNITION LETTERS, 2021, 151 : 1 - 2
  • [46] Multi-source Data Hiding in Neural Networks
    Yang, Ziyun
    Wang, Zichi
    Zhang, Xinpeng
    Tang, Zhenjun
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [47] Multi-Source Data Fusion Study in Scientometrics
    Xu, Hai-Yun
    Wang, Chao
    Pang, Hong-shen
    Ru, Li-jie
    Fang, Shu
    QUALITATIVE & QUANTITATIVE METHODS IN LIBRARIES, 2016, : 611 - 626
  • [48] A Multi-source data Face Recognition Algorithm
    Ye Jihua
    Xia Guomiao
    Hu Dan
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 1015 - 1018
  • [49] A TRNG Exploiting Multi-Source Physical Data
    Gaglio, Vincenzo
    De Paola, Alessandra
    Ortolani, Marco
    Lo Re, Giuseppe
    Q2SWINET 2010: PROCEEDINGS OF THE SIXTH ACM SYMPOSIUM ON QOS AND SECURITY FOR WIRELESS AND MOBILE NETWORKS, 2010, : 82 - 89
  • [50] Study on Traffic Multi-Source Data Fusion
    Liu, Suping
    Zhang, Dongbo
    Li, Jialin
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2019, 13 (02) : 63 - 75