A qualitative assessment of using ChatGPT as large language model for scientific workflow development

被引:1
|
作者
Saenger, Mario [1 ]
De Mecquenem, Ninon [1 ]
Lewinska, Katarzyna Ewa [2 ,3 ]
Bountris, Vasilis [1 ]
Lehmann, Fabian [1 ]
Leser, Ulf [1 ]
Kosch, Thomas [1 ]
机构
[1] Humboldt Univ, Dept Comp Sci, D-10099 Berlin, Germany
[2] Humboldt Univ, Dept Geog, D-10099 Berlin, Germany
[3] Univ Wisconsin Madison, Dept Forest & Wildlife Ecol, Madison, WI 53706 USA
来源
GIGASCIENCE | 2024年 / 13卷
关键词
large language models; scientific workflows; user support; ChatGPT; END-USER DEVELOPMENT; GENERATION; ALIGNMENT; FUTURE;
D O I
10.1093/gigascience/giae030
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Scientific workflow systems are increasingly popular for expressing and executing complex data analysis pipelines over large datasets, as they offer reproducibility, dependability, and scalability of analyses by automatic parallelization on large compute clusters. However, implementing workflows is difficult due to the involvement of many black-box tools and the deep infrastructure stack necessary for their execution. Simultaneously, user-supporting tools are rare, and the number of available examples is much lower than in classical programming languages.Results To address these challenges, we investigate the efficiency of large language models (LLMs), specifically ChatGPT, to support users when dealing with scientific workflows. We performed 3 user studies in 2 scientific domains to evaluate ChatGPT for comprehending, adapting, and extending workflows. Our results indicate that LLMs efficiently interpret workflows but achieve lower performance for exchanging components or purposeful workflow extensions. We characterize their limitations in these challenging scenarios and suggest future research directions.Conclusions Our results show a high accuracy for comprehending and explaining scientific workflows while achieving a reduced performance for modifying and extending workflow descriptions. These findings clearly illustrate the need for further research in this area.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Large language model, AI and scientific research: why ChatGPT is only the beginning
    Zangrossi, Pietro
    Martini, Massimo
    Guerrini, Francesco
    De Bonis, Pasquale
    Spena, Giannantonio
    JOURNAL OF NEUROSURGICAL SCIENCES, 2024, 68 (02) : 216 - 224
  • [2] ChatGPT for Education Research: Exploring the Potential of Large Language Models for Qualitative Codebook Development
    Barany, Amanda
    Nasiar, Nidhi
    Porter, Chelsea
    Zambrano, Andres Felipe
    Andres, Alexandra L.
    Bright, Dara
    Shah, Mamta
    Liu, Xiner
    Gao, Sabrina
    Zhang, Jiayi
    Mehta, Shruti
    Choi, Jaeyoon
    Giordano, Camille
    Baker, Ryan S.
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT II, AIED 2024, 2024, 14830 : 134 - 149
  • [3] Using ChatGPT for language editing in scientific articles
    Kim, Seong-Gon
    MAXILLOFACIAL PLASTIC AND RECONSTRUCTIVE SURGERY, 2023, 45 (01)
  • [4] Using ChatGPT for language editing in scientific articles
    Seong-Gon Kim
    Maxillofacial Plastic and Reconstructive Surgery, 45
  • [5] Automatic building energy model development and debugging using large language models agentic workflow
    Zhang, Liang
    Ford, Vitaly
    Chen, Zhelun
    Chen, Jianli
    ENERGY AND BUILDINGS, 2025, 327
  • [6] ChatGPT and large language model (LLM) chatbots: Correspondence
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    JOURNAL OF PEDIATRIC UROLOGY, 2023, 19 (05) : 605 - 606
  • [7] Plan, Generate and Match: Scientific Workflow Recommendation with Large Language Models
    Gu, Yang
    Cao, Jian
    Guo, Yuan
    Qian, Shiyou
    Guan, Wei
    SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 86 - 102
  • [8] Flexible Partitioning of Scientific Workflows Using the JX Workflow Language
    Shaffer, Tim
    Kremer-Herman, Nathaniel
    Thain, Douglas
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [9] ChatGPT: Transcending Language Limitations in Scientific Research Using Artificial Intelligence
    Osama, Muhammad
    Afridi, Sabah
    Maaz, Muhammad
    JCPSP-JOURNAL OF THE COLLEGE OF PHYSICIANS AND SURGEONS PAKISTAN, 2023, 33 (10): : 1198 - 1200
  • [10] Using Large Language Models to Document Code: A First Quantitative and Qualitative Assessment
    Guelman, Ian
    Leal, Arthur Gregório
    Xavier, Laerte
    Valente, Marco Tulio
    arXiv,