Navigating Challenges and Technical Debt in Large Language Models Deployment

被引：0

作者：

Menshawy, Ahmed ^{[1
]}

Nawaz, Zeeshan ^{[1
]}

Fahmy, Mahmoud ^{[1
]}

机构：

[1] Mastercard, AI Engn, Dublin, Ireland

来源：

PROCEEDINGS OF THE 2024 4TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2024 | 2024年

关键词：

Large Language Models (LLMs); LLMs Deployment; Technical Debt in AI; LLM Model Compression and Pruning; High-Throughput LLM Processing; LLM Deployment Challenges; Scalability Challenges in LLMs Deployment;

D O I：

10.1145/3642970.3655840

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Large Language Models (LLMs) have become an essential tool in advancing artificial intelligence and machine learning, enabling outstanding capabilities in natural language processing, and understanding. However, the efficient deployment of LLMs in production environments reveals a complex landscape of challenges and technical debt. In this paper, we aim to highlight unique forms of challenges and technical debt associated with the deployment of LLMs, including those related to memory management, parallelism strategies, model compression, and attention optimization. These challenges emphasize the necessity of custom approaches to deploying LLMs, demanding customization and sophisticated engineering solutions not readily available in broad-use machine learning libraries or inference engines.

引用

页码：192 / 199

页数：8

共 50 条

[11] Large language models in psychiatry: Opportunities and challenges
Volkmer, Sebastian
Meyer-Lindenberg, Andreas
Schwarz, Emanuel
[J]. PSYCHIATRY RESEARCH, 2024, 339
[12] Ethical and Theological Challenges of Large Language Models
Strahornik, Vojko
[J]. BOGOSLOVNI VESTNIK-THEOLOGICAL QUARTERLY-EPHEMERIDES THEOLOGICAE, 2023, 83 (04): : 839 - 852
[13] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
Deng, Yue
Zhang, Wenxuan
Pan, Sinno Jialin
Bing, Lidong
[J]. arXiv, 2023,
[14] Deployment and Comparison of Large Language Models Based on Virtual Cluster
Li, Kai
Cao, Rongqiang
Wan, Meng
Wang, Xiaoguang
Wang, Zongguo
Wang, Jue
Wang, Yangang
[J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 359 - 365
[15] Navigating the legal landscape: large language models and the hesitancy of legal professionals
Ogunde, Fife
[J]. INTERNATIONAL JOURNAL OF THE LEGAL PROFESSION, 2024,
[16] Navigating the Evolution: The Rising Tide of Large Language Models for AI and Education
Clark, Peter
[J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, PT I, 2024, 2150 : XXXI - XXXIV
[17] Navigating the security landscape of large language models in enterprise information systems
Gupta, Brij B.
Gaurav, Akshat
Arya, Varsha
[J]. ENTERPRISE INFORMATION SYSTEMS, 2024, 18 (04)
[18] Navigating the Evolution: The Rising Tide of Large Language Models for AI and Education
Clark, Peter
[J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, 2024, 2151 : XXXI - XXXIV
[19] ChatGPT and large language models in academia: opportunities and challenges
Jesse G. Meyer
Ryan J. Urbanowicz
Patrick C. N. Martin
Karen O’Connor
Ruowang Li
Pei-Chen Peng
Tiffani J. Bright
Nicholas Tatonetti
Kyoung Jae Won
Graciela Gonzalez-Hernandez
Jason H. Moore
[J]. BioData Mining, 16
[20] Relationalizing Tables with Large Language Models: The Promise and Challenges
Huang, Zezhou
Wu, Eugene
[J]. 2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 305 - 309

← 1 2 3 4 5 →