VHDL-Eval: A Framework for Evaluating Large Language Models in VHDL Code Generation

被引：0

作者：

Vijayaraghavan, Prashanth ^{[1
]}

Shi, Luyao ^{[1
]}

Ambrogio, Stefano ^{[1
]}

Mackin, Charles ^{[1
]}

Nitsure, Apoorva ^{[1
]}

Beymer, David ^{[1
]}

Degan, Ehsan ^{[1
]}

机构：

[1] IBM Res, San Jose, CA 95120 USA

来源：

2024 IEEE LLM AIDED DESIGN WORKSHOP, LAD 2024 | 2024年

关键词：

LLMs; large language models; VHDL Code generation; VHDL Evaluation; hardware design automation; Hardware Description Languages; HDL; PEFT; ICL;

D O I：

10.1109/LAD62341.2024.10691836

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the unprecedented advancements in Large Language Models (LLMs), their application domains have expanded to include code generation tasks across various programming languages. While significant progress has been made in enhancing LLMs for popular programming languages, there exists a notable gap in comprehensive evaluation frameworks tailored for Hardware Description Languages (HDLs), particularly VHDL. This paper addresses this gap by introducing a comprehensive evaluation framework designed specifically for assessing LLM performance in VHDL code generation task. We construct a dataset for evaluating LLMs on VHDL code generation task. This dataset is constructed by translating a collection of Verilog evaluation problems to VHDL and aggregating publicly available VHDL problems, resulting in a total of 202 problems. To assess the functional correctness of the generated VHDL code, we utilize a curated set of self-verifying testbenches specifically designed for those aggregated VHDL problem set. We conduct an initial evaluation of different LLMs and their variants, including zero-shot code generation, in-context learning (ICL), and Parameter-efficient fine-tuning (P EFT) methods. Our findings underscore the considerable challenges faced by existing LLMs in VHDL code generation, revealing significant scope for improvement. This study emphasizes the necessity of supervised fine-tuning code generation models specifically for VHDL, offering potential benefits to V HDL designers seeking efficient code generation solutions.

引用

页数：6

共 50 条

[21] Architecture refinements by code refactoring of behavioral VHDL-AMS models
Zeng, Kaiping
Huss, Sorin A.
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1475 - +
[22] Generating VHDL Source Code from UML Models of Embedded Systems
Moreira, Tomas G.
Wehrmeister, Marco A.
Pereira, Carlos E.
Petin, Jean-Francois
Levrat, Eric
DISTRIBUTED, PARALLEL AND BIOLOGICALLY INSPIRED SYSTEMS, 2010, 329 : 125 - +
[23] Presynthesis Test Generation using VHDL Behavioral Fault Models
Hayne, Ronald J.
IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 264 - 267
[24] Simulation of the Behavioral Models of Components Optoelectronics in Language VHDL-AMS
Baouche, Fatima Zohra
Hobar, Farida
Moussaoui, M.
Rachida, Saoudi
AFRICAN REVIEW OF PHYSICS, 2008, 2 : 113 - 115
[25] Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Riddell, Martin
Ni, Ansong
Cohan, Arman
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 14116 - 14137
[26] Combining Software and Hardware Test Generation Methods to Verify VHDL Models
Jusas, Vacius
Neverdauskas, Tomas
INFORMATION TECHNOLOGY AND CONTROL, 2013, 42 (04): : 362 - 368
[27] Enhancing Automatic Generation of VHDL Descriptions from UML/MARTE Models
Leite, Marcela
Vasconcellos, Cristiano D.
Wehrmeister, Marco Aurelio
2014 12TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2014, : 152 - +
[28] Evaluating Large Language Models on Controlled Generation Tasks
Sun, Jiao
Tian, Yufei
Zhou, Wangchunshu
Xu, Nan
Hu, Qian
Gupta, Rahul
Wieting, John
Peng, Nanyun
Ma, Xuezhe
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3155 - 3168
[29] Automatic generation of VHDL code for self-timed circuits from simulink specifications
Tranchero, Maurizio
Reyneri, Leonardo M.
2007 14TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-4, 2007, : 287 - 290
[30] Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models
Vaithilingam, Priyan
Zhang, Tianyi
Glassman, Elena L.
EXTENDED ABSTRACTS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2022, 2022,

← 1 2 3 4 5 →