Large language models for code completion: A systematic literature review

被引：0

作者：

Husein, Rasha Ahmad ^{[1
]}

Aburajouh, Hala ^{[1
]}

Catal, Cagatay ^{[1
]}

机构：

[1] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar

来源：

COMPUTER STANDARDS & INTERFACES | 2025年 / 92卷

关键词：

Code completion; Large language models; Deep learning; Transformers;

D O I：

10.1016/j.csi.2024.103917

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Code completion serves as a fundamental aspect of modern software development, improving developers' coding processes. Integrating code completion tools into an Integrated Development Environment (IDE) or code editor enhances the coding process and boosts productivity by reducing errors and speeding up code writing while reducing cognitive load. This is achieved by predicting subsequent tokens, such as keywords, variable names, types, function names, operators, and more. Different techniques can achieve code completion, and recent research has focused on Deep Learning methods, particularly Large Language Models (LLMs) utilizing Transformer algorithms. While several research papers have focused on the use of LLMs for code completion, these studies are fragmented, and there is no systematic overview of the use of LLMs for code completion. Therefore, we aimed to perform a Systematic Literature Review (SLR) study to investigate how LLMs have been applied for code completion so far. We have formulated several research questions to address how LLMs have been integrated for code completion-related tasks and to assess the efficacy of these LLMs in the context of code completion. To achieve this, we retrieved 244 papers from scientific databases using auto-search and specific keywords, finally selecting 23 primary studies based on an SLR methodology for in-depth analysis. This SLR study categorizes the granularity levels of code completion achieved by utilizing LLMs in IDEs, explores the existing issues in current code completion systems, how LLMs address these challenges, and the pre-training and fine-tuning methods employed. Additionally, this study identifies open research problems and outlines future research directions. Our analysis reveals that LLMs significantly enhance code completion performance across several programming languages and contexts, and their capability to predict relevant code snippets based on context and partial input boosts developer productivity substantially.

引用

页数：15

共 50 条

[1] Code Completion with Statistical Language Models
Raychev, Veselin
Vechev, Martin
Yahav, Eran
[J]. ACM SIGPLAN NOTICES, 2014, 49 (06) : 419 - 428
[2] A COMPARATIVE ANALYSIS OF LARGE LANGUAGE MODELS (LLM) UTILISED IN SYSTEMATIC LITERATURE REVIEW
Rathi, H.
Malik, A.
Behera, D. C.
Kamboj, G.
[J]. VALUE IN HEALTH, 2023, 26 (12) : S6 - S6
[3] From Large Language Models to Large Multimodal Models: A Literature Review
Huang, Dawei
Yan, Chuan
Li, Qing
Peng, Xiaojiang
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (12):
[4] Software Models for Source Code Maintainability: A Systematic Literature Review
Baldassarre, Maria Teresa
Caivano, Danilo
Romano, Simone
Scanniello, Giuseppe
[J]. 2019 45TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2019), 2019, : 252 - 259
[5] Applications of large language models in psychiatry: a systematic review
Omar, Mahmud
Soffer, Shelly
Charney, Alexander W.
Landi, Isotta
Nadkarni, Girish N.
Klang, Eyal
[J]. FRONTIERS IN PSYCHIATRY, 2024, 15
[6] A systematic literature review on the impact of AI models on the security of code generation
Negri-Ribalta, Claudia
Geraud-Stewart, Remi
Sergeeva, Anastasia
Lenzini, Gabriele
[J]. FRONTIERS IN BIG DATA, 2024, 7
[7] Large language models for generating medical examinations: systematic review
Artsi, Yaara
Sorin, Vera
Konen, Eli
Glicksberg, Benjamin S.
Nadkarni, Girish
Klang, Eyal
[J]. BMC MEDICAL EDUCATION, 2024, 24 (01)
[8] A systematic review of large language models and their implications in medical education
Lucas, Harrison C.
Upperman, Jeffrey S.
Robinson, Jamie R.
[J]. MEDICAL EDUCATION, 2024,
[9] Clinical and Surgical Applications of Large Language Models: A Systematic Review
Pressman, Sophia M.
Borna, Sahar
Gomez-Cabello, Cesar A.
Haider, Syed Ali
Haider, Clifton R.
Forte, Antonio Jorge
[J]. JOURNAL OF CLINICAL MEDICINE, 2024, 13 (11)
[10] Achieving Peak Performance for Large Language Models: A Systematic Review
Rostam, Zhyar Rzgar K.
Szenasi, Sandor
Kertesz, Gabor
[J]. IEEE ACCESS, 2024, 12 : 96017 - 96050

← 1 2 3 4 5 →