Self-Service Data Science in Healthcare with Automated Machine Learning

被引:5
|
作者
Ooms, Richard [1 ]
Spruit, Marco [1 ]
机构
[1] Univ Utrecht, Dept Informat & Comp Sci, NL-3512 JE Utrecht, Netherlands
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 09期
关键词
automated machine learning; applied data science; self-service data science; healthcare analytics; SYSTEMS;
D O I
10.3390/app10092992
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
(1) Background: This work investigates whether and how researcher-physicians can be supported in their knowledge discovery process by employing Automated Machine Learning (AutoML). (2) Methods: We take a design science research approach and select the Tree-based Pipeline Optimization Tool (TPOT) as the AutoML method based on a benchmark test and requirements from researcher-physicians. We then integrate TPOT into two artefacts: a web application and a notebook. We evaluate these artefacts with researcher-physicians to examine which approach suits researcher-physicians best. Both artefacts have a similar workflow, but different user interfaces because of a conflict in requirements. (3) Results: Artefact A, a web application, was perceived as better for uploading a dataset and comparing results. Artefact B, a Jupyter notebook, was perceived as better regarding the workflow and being in control of model construction. (4) Conclusions: Thus, a hybrid artefact would be best for researcher-physicians. However, both artefacts missed model explainability and an explanation of variable importance for their created models. Hence, deployment of AutoML technologies in healthcare remains currently limited to the exploratory data analysis phase.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Self-service Data Science for Healthcare Professionals: A Data Preparation Approach
    Spruit, Marco
    Dedding, Thomas
    Vijlbrief, Daniel
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 724 - 734
  • [2] Interactive Visual Self-service Data Classification Approach to Democratize Machine Learning
    Wagle, Sridevi Narayana
    Kovalerchuk, Boris
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 280 - 285
  • [3] Automated self-service modeling: predictive analytics as a service
    Don Kridel
    Daniel Dolk
    [J]. Information Systems and e-Business Management, 2013, 11 : 119 - 140
  • [4] Automated self-service modeling: predictive analytics as a service
    Kridel, Don
    Dolk, Daniel
    [J]. INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2013, 11 (01) : 119 - 140
  • [5] Support for Self-service Automated Parking Systems
    Baranovski, Igor
    Stankovski, Stevan
    Ostojic, Gordana
    Horvat, Sabolo
    [J]. 2020 19TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2020,
  • [6] @An automated self-service multi-cloud HPC platform applied to the simulation of cardiac valve disease with machine learning
    Gentzsch, Wolfgang
    Gruber, Daniel
    Dabiri, Yaghoub
    Guccione, Julius M.
    Kassab, Ghassan S.
    [J]. 2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 744 - 746
  • [7] Self-service workflows for recommendation systems using online machine learning services
    Ng, Bryan
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,
  • [8] Modeling Self-Service Machine-Learning Agents for Distributed Stream Processing
    Zehnder, Philipp
    Riemer, Dominik
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2203 - 2212
  • [9] Comprehensive efficiency analysis of logistics industry based on machine learning and self-service data envelopment analysis model
    Man, Huang
    Jie, Lian
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 6913 - 6924
  • [10] Interpretable Machine Learning for Self-Service High-Risk Decision-Making
    Recaido, Charles
    Kovalerchuk, Boris
    [J]. 2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 322 - 329