Design of Data Standardization Cleaning System Under Multi-source Data Access

被引:1
|
作者
Li, Bo [1 ]
Zhao, Ruifeng [2 ]
Chen, Fengchao [3 ]
Zhang, Bo [1 ]
Zhou, Lide [3 ]
He, Yipeng [3 ]
Lu, Chengbo [3 ]
机构
[1] South China Univ Technol, Sch Elect Power, Guangzhou 510640, Guangdong, Peoples R China
[2] Power Dispatching & Control Ctr Guangdong Grid Co, Guangzhou 510600, Guangdong, Peoples R China
[3] Guangdong Power Grid Corp, Dongguan Power Supply Bur, Dongguan 523008, Guangdong, Peoples R China
关键词
Multi source data; Multi task optimization; Data cleaning; Massive data;
D O I
10.1007/978-3-030-99581-2_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the poor initial clustering ability of data, the processing time of data standardization cleaning system for multi-source data is increasing. To solve this problem, the data standardization cleaning system with multi-source data access is designed. According to the characteristics of multi-source data, the preliminary clustering module is set to complete the data preprocessing. The data similarity is calculated to determine whether the data to be processed or not need to be cleaned. The traditional system data cleaning technology is used to process the data to complete the multi-source data cleaning. So far, the design of data standardization cleaning system under multi-source data access has been completed. The experimental results show that the speed of data missing value processing, the effect of data screening and data standardization processing are better, and the comprehensive performance of system data cleaning is better. Therefore, this system is more suitable for multi-source data processing.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 50 条
  • [21] Multi-source Heterogeneous Data Fusion
    Zhang, Lili
    Xie, Yuxiang
    Luan Xidao
    Zhang, Xin
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 47 - 51
  • [22] Bayesian analysis of multi-source data
    Bhat, PC
    Prosper, HB
    Snyder, SS
    PHYSICS LETTERS B, 1997, 407 (01) : 73 - 78
  • [23] Learning from multi-source data
    Fromont, E
    Cordier, MO
    Quiniou, R
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 503 - 505
  • [24] A framework for multi-source data fusion
    Yager, RR
    INFORMATION SCIENCES, 2004, 163 (1-3) : 175 - 200
  • [25] Research on Data Fusion Method of Multi-source Complex System
    Cai, Yuxiang
    JOURNAL OF WEB ENGINEERING, 2021, 20 (05): : 1553 - 1571
  • [26] Intelligent Visualization System for Big Multi-source Medical Data Based on Data Lake
    Ren, Peng
    Mao, Ziyun
    Li, Shuaibo
    Xiao, Yang
    Ke, Yating
    Yao, Lanyu
    Lan, Hao
    Li, Xin
    Sheng, Ming
    Zhang, Yong
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 706 - 717
  • [27] Evaluating Rural Access Index across China with Multi-source Open Data
    Liu Y.
    Li W.
    Zhang X.
    Zhang Y.
    Li R.
    Zhou Q.
    Journal of Geo-Information Science, 2023, 25 (04) : 783 - 793
  • [28] Improving access to multimedia using multi-source hierarchical meta-data
    Martin, TP
    Shen, Y
    ADAPTIVE MULTIMEDIA RETRIEVAL: USER, CONTEXT, AND FEEDBACK, 2006, 3877 : 266 - 278
  • [29] The Design of a New Upper Limb Rehabilitation Robot System Based on Multi-Source Data Fusion
    Meng, Fancheng
    Dai, Yaping
    Jin, Ying
    Wang, Yanni
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 3840 - 3845
  • [30] Customized Bus Line Design Model Based on Multi-Source Data
    Chen, Xi
    Wang, Yinhai
    Ma, Xiaolei
    INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2018: TRAFFIC AND FREIGHT OPERATIONS AND RAIL AND PUBLIC TRANSIT, 2018, : 218 - 228