Modeling Self-Service Machine-Learning Agents for Distributed Stream Processing

被引:0
|
作者
Zehnder, Philipp [1 ]
Riemer, Dominik [1 ]
机构
[1] FZI Res Ctr Informat Technol, Karlsruhe, Germany
关键词
self-service stream processing; machine learning; distributed systems;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many emerging application areas such as the Internet of Things rely on the usage of Machine Learning (ML) techniques that operate on continuous streaming data in order to automatically generate added business value. However, the integration of ML components into stream processing programs is still a challenging task that requires both technical expertise (e.g., selection of algorithms and parameters) and domain knowledge. Therefore, this paper introduces a novel development methodology and tool support involving multiple roles and tasks that aims to enable domain experts to leverage from ML algorithms within self-service stream processing pipelines. At its core, we introduce the concept of a reusable Machine Learning Pipeline Agent (MLPA). Such an MLPA can be defined by using a developed Domain-Specific Language (DSL) consisting of reusable building blocks. Afterwards, MLPAs can be used by domain experts using a graphical user interface. Our approach is based on distributed, scalable ML algorithms that are automatically deployed and contributes newly developed algorithms such as a scalable version for neural networks operating on data streams. In our evaluation, we show that the developed concepts and algorithms do scale in a distributed setting with an increasing number of nodes. Therefore, our solution bridges the gap between domain experts and technical experts and eases the access for non-technical users to advanced streaming ML analytics.
引用
收藏
页码:2203 / 2212
页数:10
相关论文
共 50 条
  • [1] Self-Service Data Science in Healthcare with Automated Machine Learning
    Ooms, Richard
    Spruit, Marco
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [2] Self-service analytics and the processing of hydrocarbons
    Siang, Lim C.
    Elnawawi, Shams
    Steele, Darren
    [J]. DIGITAL CHEMICAL ENGINEERING, 2022, 3
  • [3] Interactive Visual Self-service Data Classification Approach to Democratize Machine Learning
    Wagle, Sridevi Narayana
    Kovalerchuk, Boris
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 280 - 285
  • [4] Self-service workflows for recommendation systems using online machine learning services
    Ng, Bryan
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [5] Automated self-service modeling: predictive analytics as a service
    Don Kridel
    Daniel Dolk
    [J]. Information Systems and e-Business Management, 2013, 11 : 119 - 140
  • [6] Automated self-service modeling: predictive analytics as a service
    Kridel, Don
    Dolk, Daniel
    [J]. INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2013, 11 (01) : 119 - 140
  • [7] Modeling the habit of self-service technology usage
    Wang, Cheng
    Harris, Jennifer
    Patterson, Paul G.
    [J]. AUSTRALIAN JOURNAL OF MANAGEMENT, 2017, 42 (03) : 462 - 481
  • [8] Interpretable Machine Learning for Self-Service High-Risk Decision-Making
    Recaido, Charles
    Kovalerchuk, Boris
    [J]. 2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 322 - 329
  • [9] A machine-learning based load prediction approach for distributed service-oriented applications
    Wang, Jun
    Ren, Yi
    Zheng, Di
    Wu, Quan-Yuan
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 462 - +
  • [10] Machine-learning methods for stream water temperature prediction
    Feigl, Moritz
    Lebiedzinski, Katharina
    Herrnegger, Mathew
    Schulz, Karsten
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2021, 25 (05) : 2951 - 2977