Data Preparation for Data Mining in Chemical Plants using Big Data

被引:0
|
作者
Borrison, Reuben [1 ]
Kloepper, Benjamin [1 ]
Mullen, Jennifer [1 ]
机构
[1] ABB Corp, Res Ctr, Ladenburg, Germany
关键词
Data quality; Soft sensors; Big data; SOFT SENSORS; MISSING DATA; MODEL; PLS;
D O I
10.1109/indin41052.2019.8972078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data preparation for data mining in industrial applications is a key success factor which requires considerable repeated efforts. Although the required activities need to be repeated in very similar fashion across many projects, details of their implementation differ and require both application understanding and experience. As a result, data preparation is done by data mining experts with a strong domain background and a good understanding of the characteristics of the data to be analyzed. Experts with these profiles usually have an engineering background and no strong expertise in distributed programming or big data technology. Unfortunately, the amount of data can be so large that distributed algorithms are required to allow for inspection of results and iteration of preparation steps. This contribution introduces an interactive data preparation workflow for signal data from chemical plants enabling domain experts without background in distributed computing and extensive programming experience to leverage the power of big data technologies.
引用
收藏
页码:1185 / 1191
页数:7
相关论文
共 50 条
  • [1] Mining "Big Data" using Big Data Services
    Reips, Ulf-Dietrich
    Matzat, Uwe
    [J]. INTERNATIONAL JOURNAL OF INTERNET SCIENCE, 2014, 9 (01) : 1 - 8
  • [2] Data Mining with Big Data
    Wu, Xindong
    Zhu, Xingquan
    Wu, Gong-Qing
    Ding, Wei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 97 - 107
  • [3] Data Mining with Big Data
    Sowmya, R.
    Suneetha, K. R.
    [J]. PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2017), 2017, : 246 - 250
  • [4] Reusable Big Data System for Industrial Data Mining - A Case Study on Anomaly Detection in Chemical Plants
    Borrison, Reuben
    Kloepper, Benjamin
    Chioua, Moncef
    Dix, Marcel
    Sprick, Barbara
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 611 - 622
  • [5] Analysis of agriculture data using data mining techniques: application of big data
    Majumdar J.
    Naraseeyappa S.
    Ankalaki S.
    [J]. Journal of Big Data, 4 (1)
  • [6] Data field for mining big data
    Wang, Shuliang
    Li, Ying
    Wang, Dakui
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2016, 19 (02) : 106 - 118
  • [7] A Big Data Framework for Mining Sensor Data Using Hadoop
    El-Shafeiy, Engy A.
    El-Desouky, Ali I.
    [J]. STUDIES IN INFORMATICS AND CONTROL, 2017, 26 (03): : 365 - 376
  • [8] Big Data Analytics Using Data Mining Techniques: A Survey
    Mittal, Shweta
    Sangwan, Om Prakash
    [J]. ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 264 - 273
  • [9] Framework for Data Mining of Big Data Using Probabilistic Grammars
    Algwaiz, Aljoharah
    Ammar, Reda
    Rajasekaran, Sanguthevar
    [J]. PROCEEDINGS 2015 FIFTH INTERNATIONAL CONFERENCE ON E-LEARNING (ECONF 2015), 2015, : 241 - 246
  • [10] A review of data mining using big data in health informatics
    Herland M.
    Khoshgoftaar T.M.
    Wald R.
    [J]. Journal Of Big Data, 1 (1)