VHDL-Eval: A Framework for Evaluating Large Language Models in VHDL Code Generation

被引:0
|
作者
Vijayaraghavan, Prashanth [1 ]
Shi, Luyao [1 ]
Ambrogio, Stefano [1 ]
Mackin, Charles [1 ]
Nitsure, Apoorva [1 ]
Beymer, David [1 ]
Degan, Ehsan [1 ]
机构
[1] IBM Res, San Jose, CA 95120 USA
关键词
LLMs; large language models; VHDL Code generation; VHDL Evaluation; hardware design automation; Hardware Description Languages; HDL; PEFT; ICL;
D O I
10.1109/LAD62341.2024.10691836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the unprecedented advancements in Large Language Models (LLMs), their application domains have expanded to include code generation tasks across various programming languages. While significant progress has been made in enhancing LLMs for popular programming languages, there exists a notable gap in comprehensive evaluation frameworks tailored for Hardware Description Languages (HDLs), particularly VHDL. This paper addresses this gap by introducing a comprehensive evaluation framework designed specifically for assessing LLM performance in VHDL code generation task. We construct a dataset for evaluating LLMs on VHDL code generation task. This dataset is constructed by translating a collection of Verilog evaluation problems to VHDL and aggregating publicly available VHDL problems, resulting in a total of 202 problems. To assess the functional correctness of the generated VHDL code, we utilize a curated set of self-verifying testbenches specifically designed for those aggregated VHDL problem set. We conduct an initial evaluation of different LLMs and their variants, including zero-shot code generation, in-context learning (ICL), and Parameter-efficient fine-tuning (P EFT) methods. Our findings underscore the considerable challenges faced by existing LLMs in VHDL code generation, revealing significant scope for improvement. This study emphasizes the necessity of supervised fine-tuning code generation models specifically for VHDL, offering potential benefits to V HDL designers seeking efficient code generation solutions.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Architecture refinements by code refactoring of behavioral VHDL-AMS models
    Zeng, Kaiping
    Huss, Sorin A.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 1475 - +
  • [22] Generating VHDL Source Code from UML Models of Embedded Systems
    Moreira, Tomas G.
    Wehrmeister, Marco A.
    Pereira, Carlos E.
    Petin, Jean-Francois
    Levrat, Eric
    DISTRIBUTED, PARALLEL AND BIOLOGICALLY INSPIRED SYSTEMS, 2010, 329 : 125 - +
  • [23] Presynthesis Test Generation using VHDL Behavioral Fault Models
    Hayne, Ronald J.
    IEEE SOUTHEASTCON 2011: BUILDING GLOBAL ENGINEERS, 2011, : 264 - 267
  • [24] Simulation of the Behavioral Models of Components Optoelectronics in Language VHDL-AMS
    Baouche, Fatima Zohra
    Hobar, Farida
    Moussaoui, M.
    Rachida, Saoudi
    AFRICAN REVIEW OF PHYSICS, 2008, 2 : 113 - 115
  • [25] Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
    Riddell, Martin
    Ni, Ansong
    Cohan, Arman
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 14116 - 14137
  • [26] Combining Software and Hardware Test Generation Methods to Verify VHDL Models
    Jusas, Vacius
    Neverdauskas, Tomas
    INFORMATION TECHNOLOGY AND CONTROL, 2013, 42 (04): : 362 - 368
  • [27] Enhancing Automatic Generation of VHDL Descriptions from UML/MARTE Models
    Leite, Marcela
    Vasconcellos, Cristiano D.
    Wehrmeister, Marco Aurelio
    2014 12TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2014, : 152 - +
  • [28] Evaluating Large Language Models on Controlled Generation Tasks
    Sun, Jiao
    Tian, Yufei
    Zhou, Wangchunshu
    Xu, Nan
    Hu, Qian
    Gupta, Rahul
    Wieting, John
    Peng, Nanyun
    Ma, Xuezhe
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3155 - 3168
  • [29] Automatic generation of VHDL code for self-timed circuits from simulink specifications
    Tranchero, Maurizio
    Reyneri, Leonardo M.
    2007 14TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-4, 2007, : 287 - 290
  • [30] Expectation vs. Experience: Evaluating the Usability of Code Generation Tools Powered by Large Language Models
    Vaithilingam, Priyan
    Zhang, Tianyi
    Glassman, Elena L.
    EXTENDED ABSTRACTS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2022, 2022,