A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges

被引：19

作者：

Raiaan, Mohaimenul Azam Khan ^{[1
]}

Mukta, Md. Saddam Hossain ^{[2
]}

Fatema, Kaniz ^{[3
]}

Fahad, Nur Mohammad ^{[1
]}

Sakib, Sadman ^{[1
]}

Mim, Most Marufatul Jannat ^{[1
]}

Ahmad, Jubaer ^{[1
]}

Ali, Mohammed Eunus ^{[4
]}

Azam, Sami ^{[3
]}

机构：

[1] United Int Univ, Dept Comp Sci & Engn, Dhaka 1212, Bangladesh

[2] Lappeenranta Lahti Univ Technol, LUT Sch Engn Sci, Lappeenranta 53850, Finland

[3] Charles Darwin Univ, Fac Sci & Technol, Casuarina, NT 0909, Australia

[4] Bangladesh Univ Engn & Technol BUET, Dept CSE, Dhaka 1000, Bangladesh

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Cognition; Artificial intelligence; Transformers; Training; Taxonomy; Task analysis; Surveys; Natural language processing; Question answering (information retrieval); Information analysis; Linguistics; Large language models (LLM); natural language processing (NLP); artificial intelligence; transformer; pre-trained models; taxonomy; application; GPT-4; BIAS;

D O I：

10.1109/ACCESS.2024.3365742

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large Language Models (LLMs) recently demonstrated extraordinary capability in various natural language processing (NLP) tasks including language translation, text generation, question answering, etc. Moreover, LLMs are new and essential part of computerized language processing, having the ability to understand complex verbal patterns and generate coherent and appropriate replies in a given context. Though this success of LLMs has prompted a substantial increase in research contributions, rapid growth has made it difficult to understand the overall impact of these improvements. Since a plethora of research on LLMs have been appeared within a short time, it is quite impossible to track all of these and get an overview of the current state of research in this area. Consequently, the research community would benefit from a short but thorough review of the recent changes in this area. This article thoroughly overviews LLMs, including their history, architectures, transformers, resources, training methods, applications, impacts, challenges, etc. This paper begins by discussing the fundamental concepts of LLMs with its traditional pipeline of the LLMs training phase. Then the paper provides an overview of the existing works, the history of LLMs, their evolution over time, the architecture of transformers in LLMs, the different resources of LLMs, and the different training methods that have been used to train them. The paper also demonstrates the datasets utilized in the studies. After that, the paper discusses the wide range of applications of LLMs, including biomedical and healthcare, education, social, business, and agriculture. The study also illustrates how LLMs create an impact on society and shape the future of AI and how they can be used to solve real-world problems. Finally, the paper also explores open issues and challenges to deploy LLMs in real-world scenario. Our review paper aims to help practitioners, researchers, and experts thoroughly understand the evolution of LLMs, pre-trained architectures, applications, challenges, and future goals.

引用

页码：26839 / 26874

页数：36

共 50 条

[41] UAV Communications with Machine Learning: Challenges, Applications and Open Issues
Sana Ben Aissa
Asma Ben Letaifa
[J]. Arabian Journal for Science and Engineering, 2022, 47 : 1559 - 1579
[42] UAV Communications with Machine Learning: Challenges, Applications and Open Issues
Ben Aissa, Sana
Ben Letaifa, Asma
[J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 1559 - 1579
[43] Large language models: a primer and gastroenterology applications
Shahab, Omer
El Kurdi, Bara
Shaukat, Aasma
Nadkarni, Girish
Soroush, Ali
[J]. THERAPEUTIC ADVANCES IN GASTROENTEROLOGY, 2024, 17
[44] Looking to Future Applications of Large Language Models
Liu, Xichong
Rubin, Samuel J. S.
Rogalla, Stephan
[J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2023, 118 (12): : 2306 - 2306
[45] Limitations of large language models in medical applications
Deng, Jiawen
Zubair, Areeba
Park, Ye-Jean
[J]. POSTGRADUATE MEDICAL JOURNAL, 2023, 99 (1178) : 1298 - 1299
[46] Julia language in machine learning: Algorithms, applications, and open issues
Gao, Kaifeng
Mei, Gang
Piccialli, Francesco
Cuomo, Salvatore
Tu, Jingzhi
Huo, Zenan
[J]. COMPUTER SCIENCE REVIEW, 2020, 37
[47] From Large Language Models to Large Multimodal Models: A Literature Review
Huang, Dawei
Yan, Chuan
Li, Qing
Peng, Xiaojiang
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (12):
[48] ChatGPT and large language models in academia: opportunities and challenges
Jesse G. Meyer
Ryan J. Urbanowicz
Patrick C. N. Martin
Karen O’Connor
Ruowang Li
Pei-Chen Peng
Tiffani J. Bright
Nicholas Tatonetti
Kyoung Jae Won
Graciela Gonzalez-Hernandez
Jason H. Moore
[J]. BioData Mining, 16
[49] Relationalizing Tables with Large Language Models: The Promise and Challenges
Huang, Zezhou
Wu, Eugene
[J]. 2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 305 - 309
[50] The Social Opportunities and Challenges in the Era of Large Language Models
Huimin, Chen
Zhiyuan, Liu
Maosong, Sun
[J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1094 - 1103

← 1 2 3 4 5 →