Data Management in Machine Learning Systems

被引:0
|
作者
Boehm, Matthias [1 ]
Kumar, Arun [2 ]
Yang, Jun [3 ]
机构
[1] Graz University of Technology, Austria
[2] University of California, San Diego, United States
[3] Duke University, United States
来源
Synthesis Lectures on Data Management | 2019年 / 11卷 / 01期
关键词
Information management;
D O I
10.2200/S00895ED1V01Y201901DTM057
中图分类号
学科分类号
摘要
Large-scale data analytics using machine learning (ML) underpins many modern data-driven applications. ML systems provide means of specifying and executing these ML workloads in an efficient and scalable manner. Data management is at the heart of many ML systems due to data-driven application characteristics, data-centric workload characteristics, and system architectures inspired by classical data management techniques. In this book, we follow this data-centric view of ML systems and aim to provide a comprehensive overview of data management in ML systems for the end-to-end data science or ML lifecycle. We review multiple interconnected lines of work: (1) ML support in database (DB) systems, (2) DB-inspired ML systems, and (3) ML lifecycle systems. Covered topics include: in-database analytics via query generation and user-defined functions, factorized and statistical-relational learning; optimizing compilers for ML workloads; execution strategies and hardware accelerators; data access methods such as compression, partitioning and indexing; resource elasticity and cloud markets; as well as systems for data preparation for ML, model selection, model management, model debugging, and model serving. Given the rapidly evolving field, we strive for a balance between an up-to-date survey of ML systems, an overview of the underlying concepts and techniques, as well as pointers to open research questions. Hence, this book might serve as a starting point for both systems researchers and developers. © 2019 by Morgan & Claypool.
引用
收藏
页码:1 / 173
相关论文
共 50 条
  • [31] Machine Learning and Data Mining Applications in Power Systems
    Leonowicz, Zbigniew
    Jasinski, Michal
    ENERGIES, 2022, 15 (05)
  • [32] Streaming Machine Learning Algorithms with Big Data Systems
    Abeykoon, Vibhatha
    Kamburugamuve, Supun
    Govindrarajan, Kannan
    Wickramasinghe, Pulasthi
    Widanage, Chathura
    Perera, Niranda
    Uyar, Ahmet
    Gunduz, Gurhan
    Akkas, Selahattin
    Von Laszewski, Gregor
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 5661 - 5666
  • [33] Educational Data Mining with Learning Management Systems
    Espigares Pinazo, Manuel Jesus
    Garcia Perez, Rafael
    REVISTA ELECTRONICA DE LEEME, 2011, (27): : 1 - 16
  • [34] Implikationen von Machine Learning auf das Datenmanagement in UnternehmenImplications of Machine Learning on Data Management in Companies
    René Kessler
    Jorge Marx Gómez
    HMD Praxis der Wirtschaftsinformatik, 2020, 57 (1) : 89 - 105
  • [35] Learning nodes: machine learning-based energy and data management strategy
    Kim, Yunmin
    Lee, Tae-Jin
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [36] Learning nodes: machine learning-based energy and data management strategy
    Yunmin Kim
    Tae-Jin Lee
    EURASIP Journal on Wireless Communications and Networking, 2021
  • [37] Data mining and machine learning in the context of disaster and crisis management
    Zagorecki, Adam T.
    Johnson, David E. A.
    Ristvej, Jozef
    INTERNATIONAL JOURNAL OF EMERGENCY MANAGEMENT, 2013, 9 (04) : 351 - 365
  • [38] Big Data and Machine Learning Driven Handover Management and Forecasting
    Vy, Le Luong
    Tung, Li-Ping
    Lin, Bao-Shuh Paul
    2017 IEEE CONFERENCE ON STANDARDS FOR COMMUNICATIONS AND NETWORKING (CSCN), 2017, : 214 - 219
  • [39] Research data management using FAIR data repository with integrated machine learning
    Tkachenko, Valery
    Sattarov, Boris
    Korotcov, Alexandru
    Zakharov, Rick
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [40] Recent Developments in Machine Learning for Energy Systems Reliability Management
    Duchesne, Laurine
    Karangelos, Efthymios
    Wehenkel, Louis
    PROCEEDINGS OF THE IEEE, 2020, 108 (09) : 1656 - 1676