DiffML: End-to-end Differentiable ML Pipelines

被引:3
|
作者
Hilprecht, Benjamin [1 ]
Hammacher, Christian [2 ]
Reis, Eduardo [1 ]
Abdelaal, Mohamed
Binnig, Carsten [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
[2] Software AG, Mainz, Germany
关键词
data engineering; differentiable ML pipelines; data cleaning;
D O I
10.1145/3595360.3595857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present our vision of differentiable ML pipelines called DiffML that truly allows to automate the construction of ML pipelines in an end-to-end fashion. DiffML allows to jointly train not just the ML model itself but also the entire pipeline including data engineering steps, e.g., data cleaning, data augmentation, etc. Our core idea is to formulate all steps in a differentiable way such that the entire pipeline can be trained using backpropagation. However, this is a non-trivial problem and opens up many new research questions. To show the feasibility of this direction, we demonstrate initial ideas and a general principle of how typical data engineering steps can be formulated as differentiable programs and jointly learned with the ML model. Moreover, we discuss a research roadmap and core challenges that have to be systematically tackled to enable fully differentiable ML pipelines.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [1] End-to-End Differentiable Proving
    Rocktaschel, Tim
    Riedel, Sebastian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] End-to-End Differentiable Physics for Learning and Control
    Belbute-Peres, Filipe de A.
    Smith, Kevin A.
    Allen, Kelsey R.
    Tenenbaum, Joshua B.
    Kolter, J. Zico
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] End-to-End Differentiable Learning of Protein Structure
    AlQuraishi, Mohammed
    CELL SYSTEMS, 2019, 8 (04) : 292 - +
  • [4] Differentiable MPC for End-to-end Planning and Control
    Amos, Brandon
    Rodriguez, Ivan Dario Jimenez
    Sacks, Jacob
    Boots, Byron
    Kolter, J. Zico
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [5] End-to-End Differentiable Adversarial Imitation Learning
    Baram, Nir
    Anschel, Oron
    Caspi, Itai
    Mannor, Shie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [6] End-to-End Differentiable GANs for Text Generation
    Kumar, Sachin
    Tsvetkov, Yulia
    NEURIPS WORKSHOPS, 2020, 2020, 137 : 118 - 128
  • [7] Differentiable Product Quantization for End-to-End Embedding Compression
    Chen, Ting
    Li, Lala
    Sun, Yizhou
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [8] Differentiable Product Quantization for End-to-End Embedding Compression
    Chen, Ting
    Li, Lala
    Sun, Yizhou
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [9] End-to-End Trustworthy ML for Multidomain Operations
    Karim, Hassan
    Rawat, Danda B.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [10] DRUM: End-To-End Differentiable Rule Mining On Knowledge Graphs
    Sadeghian, Ali
    Armandpour, Mohammadreza
    Ding, Patrick
    Wang, Daisy Zhe
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32