A qualitative assessment of using ChatGPT as large language model for scientific workflow development

被引:1
|
作者
Saenger, Mario [1 ]
De Mecquenem, Ninon [1 ]
Lewinska, Katarzyna Ewa [2 ,3 ]
Bountris, Vasilis [1 ]
Lehmann, Fabian [1 ]
Leser, Ulf [1 ]
Kosch, Thomas [1 ]
机构
[1] Humboldt Univ, Dept Comp Sci, D-10099 Berlin, Germany
[2] Humboldt Univ, Dept Geog, D-10099 Berlin, Germany
[3] Univ Wisconsin Madison, Dept Forest & Wildlife Ecol, Madison, WI 53706 USA
来源
GIGASCIENCE | 2024年 / 13卷
关键词
large language models; scientific workflows; user support; ChatGPT; END-USER DEVELOPMENT; GENERATION; ALIGNMENT; FUTURE;
D O I
10.1093/gigascience/giae030
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Scientific workflow systems are increasingly popular for expressing and executing complex data analysis pipelines over large datasets, as they offer reproducibility, dependability, and scalability of analyses by automatic parallelization on large compute clusters. However, implementing workflows is difficult due to the involvement of many black-box tools and the deep infrastructure stack necessary for their execution. Simultaneously, user-supporting tools are rare, and the number of available examples is much lower than in classical programming languages.Results To address these challenges, we investigate the efficiency of large language models (LLMs), specifically ChatGPT, to support users when dealing with scientific workflows. We performed 3 user studies in 2 scientific domains to evaluate ChatGPT for comprehending, adapting, and extending workflows. Our results indicate that LLMs efficiently interpret workflows but achieve lower performance for exchanging components or purposeful workflow extensions. We characterize their limitations in these challenging scenarios and suggest future research directions.Conclusions Our results show a high accuracy for comprehending and explaining scientific workflows while achieving a reduced performance for modifying and extending workflow descriptions. These findings clearly illustrate the need for further research in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Scientific workflow execution in the cloud using a dynamic runtime model
    Erbel, Johannes
    Grabowski, Jens
    SOFTWARE AND SYSTEMS MODELING, 2024, 23 (01): : 163 - 193
  • [32] DEVELOPMENT AND ASSESSMENT OF THE PERFORMANCE OF A LARGE LANGUAGE MODEL FOR ADMINISTERING THE SHORT BLESSED TEST
    Jaman, Rafeeul
    Nessen, Sarah
    Adjei-Poku, Michael
    Byerley, Joella
    Sailors, Olivia
    Karlawish, Jason
    O'Brien, Kyra
    Friedman, Ari
    INNOVATION IN AGING, 2024, 8 : 1234 - 1234
  • [33] What is the role of ChatGPT and other large language model AI in Higher Education?
    Rospigliosi, Pericles Asher
    INTERACTIVE LEARNING ENVIRONMENTS, 2024, 32 (02) : 393 - 394
  • [34] How reliable is the artificial intelligence product large language model ChatGPT in orthodontics?
    Demirsoy, Kevser Kurt
    Buyuk, Suleyman Kutalmis
    Bicer, Tayyip
    ANGLE ORTHODONTIST, 2024, 94 (06) : 602 - 607
  • [35] Three fundamental dimensions of scientific workflow interoperability: Model of computation, language, and execution environment
    Elmroth, Erik
    Hernandez, Francisco
    Tordsson, Johan
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2010, 26 (02): : 245 - 256
  • [36] Development of a Formative Assessment System for Reports Using Large Language Models and Rubrics
    Matsukawa, Hideya
    Iwasaki, Chiaki
    2024 INTERNATIONAL SYMPOSIUM ON EDUCATIONAL TECHNOLOGY, ISET, 2024, : 34 - 38
  • [37] A Generative Artificial Intelligence Using Multilingual Large Language Models for ChatGPT Applications
    Tuan, Nguyen Trung
    Moore, Philip
    Thanh, Dat Ha Vu
    Pham, Hai Van
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [38] Automating the Knowledge Assessment Workflow for Large Student Groups: A Development Experience
    Bosnjakovic, Andrija
    Protic, Jelica
    Bojic, Dragan
    Tartalja, Igor
    INTERNATIONAL JOURNAL OF ENGINEERING EDUCATION, 2015, 31 (04) : 1058 - 1070
  • [39] A study of the impact of scientific collaboration on the application of Large Language Model
    Tan, Suyan
    Guo, Yilin
    AIMS MATHEMATICS, 2024, 9 (07): : 19737 - 19755
  • [40] ChatGPT for shaping the future of dentistry:the potential of multi-modal large language model
    Hanyao Huang
    Ou Zheng
    Dongdong Wang
    Jiayi Yin
    Zijin Wang
    Shengxuan Ding
    Heng Yin
    Chuan Xu
    Renjie Yang
    Qian Zheng
    Bing Shi
    InternationalJournalofOralScience, 2023, 15 (03) : 377 - 389