Design of Data Standardization Cleaning System Under Multi-source Data Access

被引:1
|
作者
Li, Bo [1 ]
Zhao, Ruifeng [2 ]
Chen, Fengchao [3 ]
Zhang, Bo [1 ]
Zhou, Lide [3 ]
He, Yipeng [3 ]
Lu, Chengbo [3 ]
机构
[1] South China Univ Technol, Sch Elect Power, Guangzhou 510640, Guangdong, Peoples R China
[2] Power Dispatching & Control Ctr Guangdong Grid Co, Guangzhou 510600, Guangdong, Peoples R China
[3] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan 523008, Guangdong, Peoples R China
关键词
Multi source data; Multi task optimization; Data cleaning; Massive data;
D O I
10.1007/978-3-030-99581-2_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the poor initial clustering ability of data, the processing time of data standardization cleaning system for multi-source data is increasing. To solve this problem, the data standardization cleaning system with multi-source data access is designed. According to the characteristics of multi-source data, the preliminary clustering module is set to complete the data preprocessing. The data similarity is calculated to determine whether the data to be processed or not need to be cleaned. The traditional system data cleaning technology is used to process the data to complete the multi-source data cleaning. So far, the design of data standardization cleaning system under multi-source data access has been completed. The experimental results show that the speed of data missing value processing, the effect of data screening and data standardization processing are better, and the comprehensive performance of system data cleaning is better. Therefore, this system is more suitable for multi-source data processing.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 50 条
  • [31] Multi-source heterogeneous cultural big data integration platforms design
    Liu P.
    Wang H.
    Zheng D.
    Liu F.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (02): : 95 - 101
  • [32] Dynamic design of building livability based on multi-source spatiotemporal data
    Zhang J.
    Qian C.
    Guo C.-Y.
    Qian Y.-J.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2019, 49 (04): : 1169 - 1173
  • [33] Classification of Multi-Source Sensor Data with Limited Labeled Data
    Crawford, Melba M.
    Prasad, Saurabh
    Zhou, Xiong
    Zhang, Zhou
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XXI, 2015, 9472
  • [34] Approach to Anomaly Detection in Microservice System with Multi-Source Data Streams
    ZHANG Qixun
    HAN Jing
    CHENG Li
    ZHANG Baisheng
    GONG Zican
    ZTE Communications, 2022, 20 (03) : 85 - 92
  • [35] Reliability analysis for system by transmitting, pooling and integrating multi-source data
    Jia, Xiang
    Cheng, Zhijun
    Guo, Bo
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 224
  • [36] Multi-source Distributed System Data for AI-Powered Analytics
    Nedelkoski, Sasho
    Bogatinovski, Jasmin
    Mandapati, Ajay Kumar
    Becker, Soeren
    Cardoso, Jorge
    Kao, Odej
    SERVICE-ORIENTED AND CLOUD COMPUTING (ESOCC 2020), 2020, 12054 : 161 - 176
  • [37] Equipment Condition Monitoring System based on Multi-source Heterogeneous Data
    Wang, Peijie
    He, Yan
    Wu, Pengcheng
    Hao, Chuanpeng
    Li, Yufeng
    Yan, Ping
    2020 10TH INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2020), 2020, : 209 - 213
  • [38] Data normalization method for a multi-source inverse geometry CT system
    Baek, Jongduk
    Pelc, Norbert J.
    MEDICAL IMAGING 2012: PHYSICS OF MEDICAL IMAGING, 2012, 8313
  • [39] Green tide disaster monitoring system based on multi-source data
    Shang, Weitao
    Gao, Zhiqiang
    Jiang, Xiaopeng
    Liu, Chaoshun
    Gao, Wei
    REMOTE SENSING AND MODELING OF ECOSYSTEMS FOR SUSTAINABILITY XIII, 2016, 9975
  • [40] Research and Application of Multi-source Data Fusion Identity Authentication System
    Ma Tianlong
    Xu Yingying
    Zhu Yanni
    Xu Xinxin
    Zhu Quanyin
    Gao Shangbing
    Wang Shengbiao
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 295 - 298