CurieLM: Enhancing Large Language Models for Nuclear Domain Applications

被引:0
|
作者
Bouhoun, Zakaria [1 ]
Allah, Ahmed [1 ]
Cocci, Riccardo [1 ]
Assaad, Mohamad Ali [1 ]
Plancon, Alexandra [1 ]
Godest, Frederic [1 ]
Kondratenko, Kirill [1 ]
Rodriguez, Julien [1 ]
Vitillo, Francesco [1 ]
Malhomme, Olivier [1 ]
Bechet, Lies Benmiloud [1 ]
Plana, Robert [1 ]
机构
[1] Assyst EOS, F-92400 Courbevoie, France
关键词
D O I
10.1051/epjconf/202430217006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Large Language Models (LLMs), such as the Mistral model, have exhibited remarkable performance across diverse tasks. However, their efficacy in nuclear applications remains constrained by a lack of domain-specific knowledge and an inability to effectively leverage that knowledge. Nuclear-related tasks, including safety assessments and requirement analyses, pose unique challenges due to the intricate domain expertise and diverse constraints involved. To address these limitations, we introduce CurieLM, an LLM specifically tailored for the nuclear domain. CurieLM builds upon the Mistral model, enhancing its capabilities through domain-specific fine-tuning. Our team of nuclear engineers overcame the initial hurdle of accessing high-quality nuclear data, enabling CurieLM to comprehend and accurately respond to nuclear-specific instructions. This manuscript outlines the development and optimization process of CurieLM, marking a significant step toward enhancing nuclear-related natural language processing tasks. Experimental results demonstrate a 13% performance improvement over base LLMs, underscoring the effectiveness of our approach. Domain-specific LLMs like CurieLM hold a great potential across various applications, and this study sets the stage for further exploration in this emerging field.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Large language models and their applications in bioinformatics
    Sarumi, Oluwafemi A.
    Heider, Dominik
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 3498 - 3505
  • [12] Applications of large language models in oncology
    Loefflert, Chiara M.
    Bressem, Keno K.
    Truhn, Daniel
    ONKOLOGIE, 2024, 30 (05): : 388 - 393
  • [13] Quantifying Domain Knowledge in Large Language Models
    Sayenju, Sudhashree
    Aygun, Ramazan
    Franks, Bill
    Johnston, Sereres
    Lee, George
    Choi, Hansook
    Modgil, Girish
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 193 - 194
  • [14] Enhancing Biomedical Question Answering with Large Language Models
    Yang, Hua
    Li, Shilong
    Goncalves, Teresa
    INFORMATION, 2024, 15 (08)
  • [15] PharmaBench: Enhancing ADMET benchmarks with large language models
    Niu, Zhangming
    Xiao, Xianglu
    Wu, Wenfan
    Cai, Qiwei
    Jiang, Yinghui
    Jin, Wangzhen
    Wang, Minhao
    Yang, Guojian
    Kong, Lingkang
    Jin, Xurui
    Yang, Guang
    Chen, Hongming
    SCIENTIFIC DATA, 2024, 11 (01)
  • [16] Large language models: a primer and gastroenterology applications
    Shahab, Omer
    El Kurdi, Bara
    Shaukat, Aasma
    Nadkarni, Girish
    Soroush, Ali
    THERAPEUTIC ADVANCES IN GASTROENTEROLOGY, 2024, 17
  • [17] Looking to Future Applications of Large Language Models
    Liu, Xichong
    Rubin, Samuel J. S.
    Rogalla, Stephan
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2023, 118 (12): : 2306 - 2306
  • [18] Limitations of large language models in medical applications
    Deng, Jiawen
    Zubair, Areeba
    Park, Ye-Jean
    POSTGRADUATE MEDICAL JOURNAL, 2023, 99 (1178) : 1298 - 1299
  • [19] Large Language Models in Healthcare and Medical Domain: A Review
    Nazi, Zabir Al
    Peng, Wei
    INFORMATICS-BASEL, 2024, 11 (03):
  • [20] Enhancing Large Language Models with RAG for Visual Language Navigation in Continuous Environments
    Bao, Xiaoan
    Lv, Zhiqiang
    Wu, Biao
    ELECTRONICS, 2025, 14 (05):