Fine-Tuning Large Language Models for Ontology Engineering: A Comparative Analysis of GPT-4 and Mistral

被引：0

作者：

Doumanas, Dimitrios ^{[1
]}

Soularidis, Andreas ^{[1
]}

Spiliotopoulos, Dimitris ^{[2
]}

Vassilakis, Costas ^{[3
]}

Kotis, Konstantinos ^{[1
]}

机构：

[1] Univ Aegean, Dept Cultural Technol & Commun, Intelligent Syst Lab, Mitilini 81100, Greece

[2] Univ Peloponnese, Dept Management Sci & Technol, Tripolis 22100, Greece

[3] Univ Peloponnese, Dept Informat & Telecommun, Tripolis 22100, Greece

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 04期

关键词：

large language models (LLMs) fine-tuning; ontology engineering (OE); domain-specific knowledge; search and rescue (SAR);

D O I：

10.3390/app15042146

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Ontology engineering (OE) plays a critical role in modeling and managing structured knowledge across various domains. This study examines the performance of fine-tuned large language models (LLMs), specifically GPT-4 and Mistral 7B, in efficiently automating OE tasks. Foundational OE textbooks are used as the basis for dataset creation and for feeding the LLMs. The methodology involved segmenting texts into manageable chapters, generating question-answer pairs, and translating visual elements into description logic to curate fine-tuned datasets in JSONL format. This research aims to enhance the models' abilities to generate domain-specific ontologies, with hypotheses asserting that fine-tuned LLMs would outperform base models, and that domain-specific datasets would significantly improve their performance. Comparative experiments revealed that GPT-4 demonstrated superior accuracy and adherence to ontology syntax, albeit with higher computational costs. Conversely, Mistral 7B excelled in speed and cost efficiency but struggled with domain-specific tasks, often generating outputs that lacked syntactical precision and relevance. The presented results highlight the necessity of integrating domain-specific datasets to improve contextual understanding and practical utility in specialized applications, such as Search and Rescue (SAR) missions in wildfire incidents. Both models, despite their limitations, exhibited potential in understanding OE principles. However, their performance underscored the importance of aligning training data with domain-specific knowledge to emulate human expertise effectively. This study, based on and extending our previous work on the topic, concludes that fine-tuned LLMs with targeted datasets enhance their utility in OE, offering insights into improving future models for domain-specific applications. The findings advocate further exploration of hybrid solutions to balance accuracy and efficiency.

引用

页数：34

共 50 条

[21] Sentiment Analysis on GPT-4 with Comparative Models Using Twitter Data
Ozel, Mustafa
Bozkurt, Ozlem Cetinkaya
ACTA INFOLOGICA, 2024, 8 (01): : 23 - 33
[22] Fine-Tuning Large Language Models for Private Document Retrieval: A Tutorial
Sommers, Frank
Kongthon, Alisa
Kongyoung, Sarawoot
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1319 - 1320
[23] Large language models in Radiology: The importance of fine-tuning and the fable of the luthier
Martin-Noguerol, Teodoro
Lopez-Ubeda, Pilar
Luna, Antonio
EUROPEAN JOURNAL OF RADIOLOGY, 2024, 178
[24] Distributed Inference and Fine-tuning of Large Language Models Over The Internet
Borzunov, Alexander
Ryabinin, Max
Chumachenko, Artem
Baranchuk, Dmitry
Dettmers, Tim
Belkada, Younes
Samygin, Pavel
Raffel, Colin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[25] Fine-Tuning Large Enterprise Language Models via Ontological Reasoning
Baldazzi, Teodoro
Bellomarini, Luigi
Ceri, Stefano
Colombo, Andrea
Gentili, Andrea
Sallinger, Emanuel
RULES AND REASONING, RULEML+RR 2023, 2023, 14244 : 86 - 94
[26] Fine-tuning large language models for rare disease concept normalization
Wang, Andy
Liu, Cong
Yang, Jingye
Weng, Chunhua
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09) : 2076 - 2083
[27] Repeatability of Fine-Tuning Large Language Models Illustrated Using QLoRA
Alahmari, Saeed S.
Hall, Lawrence O.
Mouton, Peter R.
Goldgof, Dmitry B.
IEEE ACCESS, 2024, 12 : 153221 - 153231
[28] Large language models such as ChatGPT and GPT-4 for patient-centered care in radiology
Fink, Matthias A.
RADIOLOGIE, 2023, 63 (09): : 665 - 671
[29] ChatGPT, GPT-4, and Other Large Language Models: The Next Revolution for Clinical Microbiology?
Egli, Adrian
CLINICAL INFECTIOUS DISEASES, 2023, 77 (09) : 1322 - 1328
[30] Exploring the capabilities of large language models for the generation of safety cases: the case of GPT-4
Sivakumar, Mithila
Belle, Alvine Boaye
Shan, Jinjun
Shahandashti, Kimya Khakzad
32ND INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS, REW 2024, 2024, : 35 - 45

← 1 2 3 4 5 →