Supporting the Design of Machine Learning Workflows with a Recommendation System

被引:13
|
作者
Jannach, Dietmar [1 ]
Jugovac, Michael [1 ]
Lerche, Lukas [1 ]
机构
[1] TU Dortmund, Dept Comp Sci, Dortmund, Germany
关键词
Data analysis workflows; RapidMiner; visual process modeling;
D O I
10.1145/2852082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning and data analytics tasks in practice require several consecutive processing steps. RapidMiner is a widely used software tool for the development and execution of such analytics workflows. Unlike many other algorithm toolkits, it comprises a visual editor that allows the user to design processes on a conceptual level. This conceptual and visual approach helps the user to abstract from the technical details during the development phase and to retain a focus on the core modeling task. The large set of preimplemented data analysis and machine learning operations available in the tool, as well as their logical dependencies, can, however, be overwhelming in particular for novice users. In this work, we present an add-on to the RapidMiner framework that supports the user during the modeling phase by recommending additional operations to insert into the currently developed machine learning workflow. First, we propose different recommendation techniques and evaluate them in an offline setting using a pool of several thousand existing workflows. Second, we present the results of a laboratory study, which show that our tool helps users to significantly increase the efficiency of the modeling process. Finally, we report on analyses using data that were collected during the real-world deployment of the plug-in component and compare the results of the live deployment of the tool with the results obtained through an offline analysis and a replay simulation.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Diagnosis Recommendation Using Machine Learning Scientific Workflows
    Ahmed, Ishtiaq
    Lu, Shiyong
    Bai, Changxin
    Bhuyan, Fahima Amin
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 82 - 90
  • [2] Machine Learning Approach for the Design of an Assessment Outcomes Recommendation System
    Al-Zahra, Fatime
    Mounir, Shaimaa
    Dalbah, Lamees
    Abu Zitar, Raed
    [J]. 2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 80 - 86
  • [3] MACHINE LEARNING BASED RECOMMENDATION SYSTEM
    Ganguli, Subhankar
    Thakur, Sanjeev
    [J]. PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, 2020, : 660 - 664
  • [4] Provenance-and machine learning-based recommendation of parameter values in scientific workflows
    Junior D.S.
    Pacitti E.
    Paes A.
    de Oliveira D.
    [J]. PeerJ Computer Science, 2021, 7 : 1 - 46
  • [5] Self-service workflows for recommendation systems using online machine learning services
    Ng, Bryan
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [6] Provenance-and machine learning-based recommendation of parameter values in scientific workflows
    Junior, Daniel Silva
    Pacitti, Esther
    Paes, Aline
    de Oliveira, Daniel
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7
  • [7] Machine learning driven course recommendation system
    Lazarevic, Sara
    Zuvela, Tamara
    Djordjevic, Sofija
    Sladojevic, Srdjan
    Arsenovic, Marko
    [J]. 2022 21ST INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2022,
  • [8] Personalized medical recommendation system with machine learning
    Basma M. Hassan
    Shahd Mohamed Elagamy
    [J]. Neural Computing and Applications, 2025, 37 (9) : 6431 - 6447
  • [9] Machine Learning Based Recommendation System: A Review
    Sharda, Shreya
    Josan, Gurpreet S.
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (02): : 134 - 144
  • [10] Integrating a Machine Learning System Into Clinical Workflows: Qualitative Study
    Sandhu, Sahil
    Lin, Anthony L.
    Brajer, Nathan
    Sperling, Jessica
    Ratliff, William
    Bedoya, Armando D.
    Balu, Suresh
    O'Brien, Cara
    Sendak, Mark P.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (11)