Navigating Challenges and Technical Debt in Large Language Models Deployment

被引:0
|
作者
Menshawy, Ahmed [1 ]
Nawaz, Zeeshan [1 ]
Fahmy, Mahmoud [1 ]
机构
[1] Mastercard, AI Engn, Dublin, Ireland
关键词
Large Language Models (LLMs); LLMs Deployment; Technical Debt in AI; LLM Model Compression and Pruning; High-Throughput LLM Processing; LLM Deployment Challenges; Scalability Challenges in LLMs Deployment;
D O I
10.1145/3642970.3655840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) have become an essential tool in advancing artificial intelligence and machine learning, enabling outstanding capabilities in natural language processing, and understanding. However, the efficient deployment of LLMs in production environments reveals a complex landscape of challenges and technical debt. In this paper, we aim to highlight unique forms of challenges and technical debt associated with the deployment of LLMs, including those related to memory management, parallelism strategies, model compression, and attention optimization. These challenges emphasize the necessity of custom approaches to deploying LLMs, demanding customization and sophisticated engineering solutions not readily available in broad-use machine learning libraries or inference engines.
引用
收藏
页码:192 / 199
页数:8
相关论文
共 50 条
  • [21] The Social Opportunities and Challenges in the Era of Large Language Models
    Huimin, Chen
    Zhiyuan, Liu
    Maosong, Sun
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1094 - 1103
  • [22] Ethical and regulatory challenges of large language models in medicine
    Ong, Jasmine Chiat Ling
    Chang, Shelley Yin-Hsi
    William, Wasswa
    Butte, Atul J
    Shah, Nigam H
    Chew, Lita Sui Tjien
    Liu, Nan
    Doshi-Velez, Finale
    Lu, Wei
    Savulescu, Julian
    Ting, Daniel Shu Wei
    [J]. The Lancet Digital Health, 2024, 6 (06):
  • [23] ChatGPT and large language models in academia: opportunities and challenges
    Meyer, Jesse G.
    Urbanowicz, Ryan J.
    Martin, Patrick C. N.
    O'Connor, Karen
    Li, Ruowang
    Peng, Pei-Chen
    Bright, Tiffani J.
    Tatonetti, Nicholas
    Won, Kyoung Jae
    Gonzalez-Hernandez, Graciela
    Moore, Jason H.
    [J]. BIODATA MINING, 2023, 16 (01)
  • [24] Large Language Models: Opportunities and Challenges For Cognitive Assessment
    Efremova, Maria
    Kubiak, Emeric
    Baron, Simon
    Bernard, David
    [J]. EUROPEAN JOURNAL OF PSYCHOLOGY OPEN, 2023, 82 : 133 - 134
  • [25] Ethical and regulatory challenges of large language models in medicine
    Ong, Jasmine Chiat Ling
    Chang, Shelley Yin -Hsi
    William, Wasswa
    Butte, Atul J.
    Shah, Nigam H.
    Chew, Lita Sui Tjien
    Liu, Nan
    Doshi-Velez, Finale
    Lu, Wei
    Savulescu, Julian
    Ting, Daniel Shu Wei
    [J]. LANCET DIGITAL HEALTH, 2024, 6 (06): : e428 - e432
  • [26] Exploring Large Language Models for Trajectory Prediction: A Technical Perspective
    Munir, Farzeen
    Mihaylova, Tsvetomila
    Azam, Shoaib
    Kucner, Tomasz Piotr
    Kyrki, Ville
    [J]. COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, : 774 - 778
  • [27] Technical deployment of aggregator business models
    Martin-Martinez, Francisco
    Boal, Jaime
    Sanchez-Miralles, Alvaro
    Robles, Carlos Becker
    Rodriguez-Vilches, Ruben
    [J]. HELIYON, 2024, 10 (09)
  • [28] Large language model ChatGPT versus small deep learning models for self-admitted technical debt detection: Why not together?
    Li, Jun
    Li, Lixian
    Liu, Jin
    Yu, Xiao
    Liu, Xiao
    Keung, Jacky Wai
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2024,
  • [29] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [30] Utilizing Large Language Models in Ophthalmology: The Current Landscape and Challenges
    Chotcomwongse, Peranut
    Ruamviboonsuk, Paisan
    Grzybowski, Andrzej
    [J]. OPHTHALMOLOGY AND THERAPY, 2024,