Trusta: Reasoning about assurance cases with formal methods and large language models

被引：0

作者：

Chen, Zezhong ^{[1
]}

Deng, Yuxin ^{[1
]}

Du, Wenjie ^{[2
]}

机构：

[1] East China Normal Univ, Shanghai Key Lab Trustworthy Comp, Shanghai 200062, Peoples R China

[2] Shanghai Normal Univ, Shanghai 200233, Peoples R China

来源：

SCIENCE OF COMPUTER PROGRAMMING | 2025年 / 244卷

基金：

中国国家自然科学基金;

关键词：

Assurance cases; Trustworthiness derivation trees; Large language models; Formal methods; Constraint solving;

D O I：

10.1016/j.scico.2025.103288

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Assurance cases can be used to argue for the safety of products in safety engineering. In safety-critical areas, the construction of assurance cases is indispensable. We introduce the Trustworthiness Derivation Tree Analyzer (Trusta), a tool designed to enhance the development and evaluation of assurance cases by integrating formal methods and large language models (LLMs). The tool incorporates a Prolog interpreter and solvers like Z3 and MONA to handle various constraint types, enhancing the precision and efficiency of assurance case assessment. Beyond traditional formal methods, Trusta harnesses the power of LLMs including ChatGPT-3.5, ChatGPT4, and PaLM 2, assisting humans in the development of assurance cases and the writing of formal constraints. Our evaluation, through qualitative and quantitative analyses, shows Trusta's impact on improving assurance case quality and efficiency. Trusta enables junior engineers to reach the skill level of experienced safety experts, narrowing the expertise gap and greatly benefiting those with limited experience. Case studies, including automated guided vehicles (AGVs), demonstrate Trusta's effectiveness in identifying subtle issues and improving the overall trustworthiness of complex systems.

引用

页数：32

共 50 条

[31] Applying formal methods and representations in a natural language tutor to teach tactical reasoning
Murray, WR
Pease, A
Sams, M
ARTIFICIAL INTELLIGENCE IN EDUCATION: SHAPING THE FUTURE OF LEARNING THROUGH INTELLIGENT TECHNOLOGIES, 2003, 97 : 349 - 356
[32] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Zhou, Gengze
Hong, Yicong
Wu, Qi
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
[33] IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
You, Haoxuan
Sun, Rui
Wang, Zhecan
Chen, Long
Wang, Gengyu
Ayyubi, Hammad A.
Chang, Kai-Wei
Chang, Shih-Fu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11289 - 11303
[34] Towards Analysis and Interpretation of Large Language Models for Arithmetic Reasoning
Akter, Mst Shapna
Shahriar, Hossain
Cuzzocrea, Alfredo
2024 11TH IEEE SWISS CONFERENCE ON DATA SCIENCE, SDS 2024, 2024, : 267 - 270
[35] On Implementing Case-Based Reasoning with Large Language Models
Wilkerson, Kaitlynne
Leake, David
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2024, 2024, 14775 : 404 - 417
[36] Reasoning with Large Language Models on Graph Tasks: The Influence of Temperature
Wang, Yiming
Zhang, Ziyang
Chen, Hanwei
Shen, Huayi
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 630 - 634
[37] Over-Reasoning and Redundant Calculation of Large Language Models
Chiang, Cheng-Han
Lee, Hung-yi
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 161 - 169
[38] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
Guo, Pei
You, Wangjie
Li, Juntao
Yan, Bowen
Zhang, Min
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685
[39] The Return of Formal Requirements Engineering in the Era of Large Language Models
Spoletini, Paola
Ferrari, Alessio
REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2024, 2024, 14588 : 344 - 353
[40] The System BioC for Reasoning about Biological Models in Action Language C
Dworschak, Steve
Grote, Torsten
Koenig, Ame
Schaub, Torsten
Veber, Philippe
20TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL 1, PROCEEDINGS, 2008, : 11 - 18

← 1 2 3 4 5 →