Using GPT-4 to guide causal machine learning

被引:0
|
作者
Constantinou, Anthony C. [1 ]
Kitson, Neville K. [1 ]
Zanga, Alessio [1 ,2 ,3 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Bayesian AI Res Lab, Machine Intelligence & Decis Syst MInDS Res Grp, London E1 4NS, England
[2] Univ Milano Bicocca, Dept Informat Syst & Commun, Models & Algorithms Data & Text Min Lab MADLab, Milan, Italy
[3] F Hoffmann La Roche Ltd, Data Sci & Adv Analyt, Basel, Switzerland
关键词
Bayesian networks; Causal discovery; ChatGPT; Directed acyclic graphs; Knowledge graphs; LLMs; Structure learning;
D O I
10.1016/j.eswa.2024.126120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since its introduction to the public, ChatGPT has had an unprecedented impact. While some experts praised AI advancements and highlighted their potential risks, others have been critical about the accuracy and usefulness of Large Language Models (LLMs). In this paper, we are interested in the ability of LLMs to identify causal relationships. We focus on the well-established GPT-4 (Turbo) and evaluate its performance under the most restrictive conditions, by isolating its ability to infer causal relationships based solely on the variable labels without being given any other context by humans, demonstrating the minimum level of effectiveness one can expect when it is provided with label-only information. We show that questionnaire participants judge the GPT-4 graphs as the most accurate in the evaluated categories, closely followed by knowledge graphs constructed by domain experts, with causal Machine Learning (ML) far behind. We use these results to highlight the important limitation of causal ML, which often produces causal graphs that violate common sense, affecting trust in them. However, we show that pairing GPT-4 with causal ML overcomes this limitation, resulting in graphical structures learnt from real data that align more closely with those identified by domain experts, compared to structures learnt by causal ML alone. Overall, our findings suggest that despite GPT-4 not being explicitly designed to reason causally, it can still be a valuable tool for causal representation, as it improves the causal discovery process of causal ML algorithms that are designed to do just that.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Using GPT-4 as a guide during inquiry-based learning
    Steinert, Steffen
    Avila, Karina E.
    Kuhn, Jochen
    Kuechemann, Stefan
    PHYSICS TEACHER, 2024, 62 (07): : 618 - 619
  • [2] Will ChatGPT/GPT-4 be a Lighthouse to Guide Spinal Surgeons?
    Yongbin He
    Haifeng Tang
    Dongxue Wang
    Shuqin Gu
    Guoxin Ni
    Haiyang Wu
    Annals of Biomedical Engineering, 2023, 51 : 1362 - 1365
  • [3] Will ChatGPT/GPT-4 be a Lighthouse to Guide Spinal Surgeons?
    He, Yongbin
    Tang, Haifeng
    Wang, Dongxue
    Gu, Shuqin
    Ni, Guoxin
    Wu, Haiyang
    ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (07) : 1362 - 1365
  • [4] Automated Financial Analysis Using GPT-4
    Noels, Sander
    Merlevede, Adriaan
    Fecheyr, Andrew
    Vanhalst, Maarten
    Meerlaen, Nick
    Viaene, Sebastien
    De Bie, Tijl
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VII, 2023, 14175 : 345 - 349
  • [5] Using GPT-4 to Generate Failure Logic
    Clegg, Kester
    Habli, Ibrahim
    McDermid, John
    COMPUTER SAFETY, RELIABILITY, AND SECURITY. SAFECOMP 2024 WORKSHOPS, 2024, 14989 : 148 - 159
  • [6] Uncovering the semantics of concepts using GPT-4
    Le Mens, Gael
    Kovacs, Balazs
    Hannan, Michael T.
    Pros, Guillem
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (49)
  • [7] The potential of GPT-4 advanced data analysis for radiomics-based machine learning models
    Foltyn-Dumitru, Martha
    Rastogi, Aditya
    Cho, Jaeyoung
    Schell, Marianne
    Mahmutoglu, Mustafa Ahmed
    Kessler, Tobias
    Sahm, Felix
    Wick, Wolfgang
    Bendszus, Martin
    Brugnara, Gianluca
    Vollmuth, Philipp
    NEURO-ONCOLOGY ADVANCES, 2025, 7 (01)
  • [8] Is GPT-4 a reliable rater? Evaluating consistency in GPT-4's text ratings
    Hackl, Veronika
    Mueller, Alexandra Elena
    Granitzer, Michael
    Sailer, Maximilian
    FRONTIERS IN EDUCATION, 2023, 8
  • [9] Generative Artificial Intelligence GPT-4 Accelerates Knowledge Mining and Machine Learning for Synthetic Biology
    Xiao, Zhengyang
    Li, Wenyu
    Moon, Hannah
    Roell, Garrett W.
    Chen, Yixin
    Tang, Yinjie J.
    ACS SYNTHETIC BIOLOGY, 2023, 12 (10): : 2973 - 2982
  • [10] GPT-4 as a biomedical simulator
    Schaefer M.
    Reichl S.
    ter Horst R.
    Nicolas A.M.
    Krausgruber T.
    Piras F.
    Stepper P.
    Bock C.
    Samwald M.
    Computers in Biology and Medicine, 2024, 178