Navigating Challenges and Technical Debt in Large Language Models Deployment

被引:0
|
作者
Menshawy, Ahmed [1 ]
Nawaz, Zeeshan [1 ]
Fahmy, Mahmoud [1 ]
机构
[1] Mastercard, AI Engn, Dublin, Ireland
关键词
Large Language Models (LLMs); LLMs Deployment; Technical Debt in AI; LLM Model Compression and Pruning; High-Throughput LLM Processing; LLM Deployment Challenges; Scalability Challenges in LLMs Deployment;
D O I
10.1145/3642970.3655840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) have become an essential tool in advancing artificial intelligence and machine learning, enabling outstanding capabilities in natural language processing, and understanding. However, the efficient deployment of LLMs in production environments reveals a complex landscape of challenges and technical debt. In this paper, we aim to highlight unique forms of challenges and technical debt associated with the deployment of LLMs, including those related to memory management, parallelism strategies, model compression, and attention optimization. These challenges emphasize the necessity of custom approaches to deploying LLMs, demanding customization and sophisticated engineering solutions not readily available in broad-use machine learning libraries or inference engines.
引用
收藏
页码:192 / 199
页数:8
相关论文
共 50 条
  • [11] Large language models in psychiatry: Opportunities and challenges
    Volkmer, Sebastian
    Meyer-Lindenberg, Andreas
    Schwarz, Emanuel
    [J]. PSYCHIATRY RESEARCH, 2024, 339
  • [12] Ethical and Theological Challenges of Large Language Models
    Strahornik, Vojko
    [J]. BOGOSLOVNI VESTNIK-THEOLOGICAL QUARTERLY-EPHEMERIDES THEOLOGICAE, 2023, 83 (04): : 839 - 852
  • [13] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
    Deng, Yue
    Zhang, Wenxuan
    Pan, Sinno Jialin
    Bing, Lidong
    [J]. arXiv, 2023,
  • [14] Deployment and Comparison of Large Language Models Based on Virtual Cluster
    Li, Kai
    Cao, Rongqiang
    Wan, Meng
    Wang, Xiaoguang
    Wang, Zongguo
    Wang, Jue
    Wang, Yangang
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 359 - 365
  • [15] Navigating the legal landscape: large language models and the hesitancy of legal professionals
    Ogunde, Fife
    [J]. INTERNATIONAL JOURNAL OF THE LEGAL PROFESSION, 2024,
  • [16] Navigating the Evolution: The Rising Tide of Large Language Models for AI and Education
    Clark, Peter
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, PT I, 2024, 2150 : XXXI - XXXIV
  • [17] Navigating the security landscape of large language models in enterprise information systems
    Gupta, Brij B.
    Gaurav, Akshat
    Arya, Varsha
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2024, 18 (04)
  • [18] Navigating the Evolution: The Rising Tide of Large Language Models for AI and Education
    Clark, Peter
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, 2024, 2151 : XXXI - XXXIV
  • [19] ChatGPT and large language models in academia: opportunities and challenges
    Jesse G. Meyer
    Ryan J. Urbanowicz
    Patrick C. N. Martin
    Karen O’Connor
    Ruowang Li
    Pei-Chen Peng
    Tiffani J. Bright
    Nicholas Tatonetti
    Kyoung Jae Won
    Graciela Gonzalez-Hernandez
    Jason H. Moore
    [J]. BioData Mining, 16
  • [20] Relationalizing Tables with Large Language Models: The Promise and Challenges
    Huang, Zezhou
    Wu, Eugene
    [J]. 2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 305 - 309