Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

被引:82
|
作者
Lwakatare, Lucy Ellen [1 ]
Raj, Aiswarya [1 ]
Crnkovic, Ivica [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Horselgagen 11, S-41296 Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Nordenskioldsgatan 1, S-21119 Malmo, Sweden
关键词
Machine learning systems; Software engineering; Industrial settings; Challenges; Solutions; SLR;
D O I
10.1016/j.infsof.2020.106368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background : Developing and maintaining large scale machine learning (ML) based software systems in an in-dustrial setting is challenging. There are no well-established development guidelines, but the literature contains reports on how companies develop and maintain deployed ML-based software systems. Objective : This study aims to survey the literature related to development and maintenance of large scale ML -based systems in industrial settings in order to provide a synthesis of the challenges that practitioners face. In addition, we identify solutions used to address some of these challenges. Method : A systematic literature review was conducted and we identified 72 papers related to development and maintenance of large scale ML-based software systems in industrial settings. The selected articles were qualita-tively analyzed by extracting challenges and solutions. The challenges and solutions were thematically synthe-sized into four quality attributes: adaptability, scalability, safety and privacy. The analysis was done in relation to ML workflow, i.e. data acquisition, training, evaluation, and deployment. Results : We identified a total of 23 challenges and 8 solutions related to development and maintenance of large scale ML-based software systems in industrial settings including six different domains. Challenges were most often reported in relation to adaptability and scalability. Safety and privacy challenges had the least reported solutions. Conclusion : The development and maintenance on large-scale ML-based systems in industrial settings introduce new challenges specific for ML, and for the known challenges characteristic for these types of systems, require new methods in overcoming the challenges. The identified challenges highlight important concerns in ML system development practice and the lack of solutions point to directions for future research.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Large-scale real-world radio signal recognition with deep learning
    Ya TU
    Yun LIN
    Haoran ZHA
    Ju ZHANG
    Yu WANG
    Guan GUI
    Shiwen MAO
    Chinese Journal of Aeronautics, 2022, (09) : 35 - 48
  • [2] Large-scale real-world radio signal recognition with deep learning
    Ya TU
    Yun LIN
    Haoran ZHA
    Ju ZHANG
    Yu WANG
    Guan GUI
    Shiwen MAO
    Chinese Journal of Aeronautics, 2022, 35 (09) : 35 - 48
  • [3] Large-scale real-world radio signal recognition with deep learning
    Tu, Ya
    Lin, Yun
    Zha, Haoran
    Zhang, Ju
    Wang, Yu
    Gui, Guan
    Mao, Shiwen
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (09) : 35 - 48
  • [4] Recommender Systems for Large-Scale Social Networks: A review of challenges and solutions
    Eirinaki, Magdalini
    Gao, Jerry
    Varlamis, Iraklis
    Tserpes, Konstantinos
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 413 - 418
  • [5] CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
    Han, Seungju
    Hessel, Jack
    Dziri, Nouha
    Choi, Yejin
    Yu, Youngjae
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15452 - 15463
  • [6] Large-scale clique cover of real-world networks
    Conte, Alessio
    Grossi, Roberto
    Marino, Andrea
    INFORMATION AND COMPUTATION, 2020, 270
  • [7] The Proof Is in the Putting Large-Scale Demonstrations of Renewables Integration Showcase Real-World Solutions
    Lorenzo, Miguel
    Burdalo, Unai
    Veguillas, Roberto
    Birke, Anders
    Despouys, Olivier
    Byriel, Inger Pihl
    Druet, Christophe
    Abart, Andreas
    Bissel, Gareth
    Sebastian-Viana, Maria
    IEEE POWER & ENERGY MAGAZINE, 2015, 13 (01): : 75 - 83
  • [8] ScalarFlow: A Large-Scale Volumetric Data Set of Real-world Scalar Transport Flows for Computer Animation and Machine Learning
    Eckert, Marie-Lena
    Um, Kiwon
    Thuerey, Nils
    ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (06):
  • [9] Challenges and Practices of Large Scale Visual Intelligence in the Real-World
    Hua, Xian-Sheng
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 364 - 364
  • [10] Introducing a process framework for implementing models of large-scale real-world systems in software
    Andreou, Andreas S.
    Software Process Improvement and Practice, 2004, 9 (03): : 133 - 155