Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

被引:82
|
作者
Lwakatare, Lucy Ellen [1 ]
Raj, Aiswarya [1 ]
Crnkovic, Ivica [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Horselgagen 11, S-41296 Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Nordenskioldsgatan 1, S-21119 Malmo, Sweden
关键词
Machine learning systems; Software engineering; Industrial settings; Challenges; Solutions; SLR;
D O I
10.1016/j.infsof.2020.106368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background : Developing and maintaining large scale machine learning (ML) based software systems in an in-dustrial setting is challenging. There are no well-established development guidelines, but the literature contains reports on how companies develop and maintain deployed ML-based software systems. Objective : This study aims to survey the literature related to development and maintenance of large scale ML -based systems in industrial settings in order to provide a synthesis of the challenges that practitioners face. In addition, we identify solutions used to address some of these challenges. Method : A systematic literature review was conducted and we identified 72 papers related to development and maintenance of large scale ML-based software systems in industrial settings. The selected articles were qualita-tively analyzed by extracting challenges and solutions. The challenges and solutions were thematically synthe-sized into four quality attributes: adaptability, scalability, safety and privacy. The analysis was done in relation to ML workflow, i.e. data acquisition, training, evaluation, and deployment. Results : We identified a total of 23 challenges and 8 solutions related to development and maintenance of large scale ML-based software systems in industrial settings including six different domains. Challenges were most often reported in relation to adaptability and scalability. Safety and privacy challenges had the least reported solutions. Conclusion : The development and maintenance on large-scale ML-based systems in industrial settings introduce new challenges specific for ML, and for the known challenges characteristic for these types of systems, require new methods in overcoming the challenges. The identified challenges highlight important concerns in ML system development practice and the lack of solutions point to directions for future research.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] The psychosis analysis in real-world on a cohort of large-scale patients with schizophrenia
    Wenyan Tan
    Haicheng Lin
    Baoxin Lei
    Aihua Ou
    Zehui He
    Ning Yang
    Fujun Jia
    Heng Weng
    Tianyong Hao
    BMC Medical Informatics and Decision Making, 20
  • [32] RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception
    Hao, Ruiyang
    Fan, Siqi
    Dai, Yingru
    Zhang, Zhenlin
    Li, Chenxi
    Wang, Yuntian
    Yu, Haibao
    Yang, Wenxian
    Yuan, Jirui
    Nie, Zaiqing
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 22347 - 22357
  • [33] Real-world large-scale study on adaptive notification scheduling on smartphones
    Okoshi, Tadashi
    Tsubouchi, Kota
    Tokuda, Hideyuki
    PERVASIVE AND MOBILE COMPUTING, 2018, 50 : 1 - 24
  • [34] Association Between Myopia and Pupil Diameter in Preschoolers: Evidence from a Machine Learning Approach Based on a Real-World Large-Scale Dataset
    Xu, Shengsong
    Li, Linling
    Han, Wenjing
    Zhu, Yingting
    Hu, Yin
    Li, Zhidong
    Ruan, Zhenbang
    Zhou, Zhuandi
    Zhuo, Yehong
    Fu, Min
    Yang, Xiao
    OPHTHALMOLOGY AND THERAPY, 2024, 13 (07) : 2009 - 2022
  • [35] Learning Mechanisms in Digital Control of Large-Scale Industrial Systems
    Tsyganov, V. V.
    2018 GLOBAL SMART INDUSTRY CONFERENCE (GLOSIC), 2018,
  • [36] Real-world large-scale terrain model reconstruction and real-time rendering
    Li, Rui
    28TH INTERNATIONAL CONFERENCE ON WEB3D TECHNOLOGY, WEB3D 2023, 2023,
  • [37] Deep Learning based Channel Extrapolation for Large-Scale Antenna Systems: Opportunities, Challenges and Solutions
    Zhang, Shun
    Liu, Yushan
    Gao, Feifei
    Xing, Chengwen
    An, Jianping
    Dobre, Octavia A.
    IEEE WIRELESS COMMUNICATIONS, 2021, 28 (06) : 160 - 167
  • [38] Deep Learning Based Facial Age Phenotyping in a Large-Scale, Real-World Cancer Patient Population
    Lee, G.
    Haugg, F.
    Bontempi, D.
    He, J.
    Zalay, O.
    Bitterman, D. S.
    Catalano, P. J.
    Prudente, V.
    Pai, S.
    Guthier, C. V.
    Kann, B. H.
    Aerts, H.
    Mak, R. H.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 120 (02): : S170 - S170
  • [39] Deep Learning for Large-Scale Real-World ACARS and ADS-B Radio Signal Classification
    Chen, Shichuan
    Zheng, Shilian
    Yang, Lifeng
    Yang, Xiaoniu
    IEEE ACCESS, 2019, 7 : 89256 - 89264
  • [40] Continual Learning for Real-World Autonomous Systems: Algorithms, Challenges and Frameworks
    Shaheen, Khadija
    Hanif, Muhammad Abdullah
    Hasan, Osman
    Shafique, Muhammad
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)