Data Preparation for Data Mining in Chemical Plants using Big Data

被引:0
|
作者
Borrison, Reuben [1 ]
Kloepper, Benjamin [1 ]
Mullen, Jennifer [1 ]
机构
[1] ABB Corp, Res Ctr, Ladenburg, Germany
关键词
Data quality; Soft sensors; Big data; SOFT SENSORS; MISSING DATA; MODEL; PLS;
D O I
10.1109/indin41052.2019.8972078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data preparation for data mining in industrial applications is a key success factor which requires considerable repeated efforts. Although the required activities need to be repeated in very similar fashion across many projects, details of their implementation differ and require both application understanding and experience. As a result, data preparation is done by data mining experts with a strong domain background and a good understanding of the characteristics of the data to be analyzed. Experts with these profiles usually have an engineering background and no strong expertise in distributed programming or big data technology. Unfortunately, the amount of data can be so large that distributed algorithms are required to allow for inspection of results and iteration of preparation steps. This contribution introduces an interactive data preparation workflow for signal data from chemical plants enabling domain experts without background in distributed computing and extensive programming experience to leverage the power of big data technologies.
引用
收藏
页码:1185 / 1191
页数:7
相关论文
共 50 条
  • [21] Teaching Data Mining in the Era of Big Data
    King, Brian R.
    Satyanarayana, Ashwin
    [J]. 2013 ASEE ANNUAL CONFERENCE, 2013,
  • [22] Harmony Search for Data Mining with Big Data
    Balicki, Jerzy
    Dryja, Piotr
    Korlub, Waldemar
    [J]. COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2016, 2016, 9842 : 553 - 565
  • [23] The Application of Data Mining Technology to Big Data
    Wang, Jinlong
    Liu, Jing
    Higgs, Russell
    Zhou, Li
    Zhou, Chuanai
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 2, 2017, : 284 - 288
  • [24] Spatial Data Mining: A Perspective of Big Data
    Wang, Shuliang
    Yuan, Hanning
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2014, 10 (04) : 50 - 70
  • [25] Data preparation using data quality matrices for classification mining
    Davidson, Ian
    Tayi, Giri
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 197 (02) : 764 - 772
  • [26] RADAR DATA PREPARATION FOR DATA MINING
    Keller, David
    Ondryhal, Vojtech
    [J]. ICMT '07: INTERNATIONAL CONFERENCE ON MILITARY TECHNOLOGIES, 2007, : 622 - 628
  • [27] A Data Model for Integrating Data Management and Data Mining in Social Big Data
    Ishikawa, Hiroshi
    Chbeir, Richard
    [J]. NINTH INTERNATIONAL CONFERENCES ON ADVANCES IN MULTIMEDIA (MMEDIA 2017), 2017, : 32 - 37
  • [28] Big (Bio)Chemical Data Mining Using Chemometric Methods: A Need for Chemists
    Parastar, Hadi
    Tauler, Roma
    [J]. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2022, 61 (44) : e201801134
  • [29] Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology
    Afendi, Farit M.
    Ono, Naoaki
    Nakamura, Yukiko
    Nakamura, Kensuke
    Darusman, Latifah K.
    Kibinge, Nelson
    Morita, Aki Hirai
    Tanaka, Ken
    Horai, Hisayuki
    Altaf-Ul-Amin, Md.
    Kanaya, Shigehiko
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2013, 4 (05):
  • [30] Finding tendencies in streaming data using Big Data frequent itemset mining
    Fernandez-Basso, Carlos
    Francisco-Agra, Abel J.
    Martin-Bautista, Maria J.
    Dolores Ruiz, M.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 666 - 674