Can large language models effectively reason about adverse weather conditions?

被引:0
|
作者
Zafarmomen, Nima [1 ]
Samadi, Vidya [1 ,2 ]
机构
[1] Clemson Univ, Dept Agr Sci, Clemson, SC 29634 USA
[2] Clemson Univ, Artificial Intelligence Res Inst Sci & Engn AIRISE, Sch Comp, Clemson, SC USA
基金
美国国家科学基金会;
关键词
Large language model; Text classification; LLaMA; BART; BERT; Adverse weather conditions;
D O I
10.1016/j.envsoft.2025.106421
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper seeks to answer the question "can Large Language Models (LLMs) effectively reason about adverse weather conditions?". To address this question, we utilized multiple LLMs to harness the US National Weather Service (NWS) flood report data spanning from June 2005 to September 2024. Bidirectional and Auto-Regressive Transformer (BART), Bidirectional Encoder Representations from Transformers (BERT), Large Language Model Meta AI (LLaMA-2), LLaMA-3, and LLaMA-3.1 were employed to categorize data based on predefined labels. The methodology was implemented in Charleston County, South Carolina, USA. Extreme events were unevenly distributed across the training period with the "Cyclonic" category exhibiting significantly fewer instances compared to the "Flood" and "Thunderstorm" categories. Analysis suggests that the LLaMA-3 reached its peak performance at 60% of the dataset size while other LLMs achieved peak performance at approximately 80-100% of the dataset size. This study provided deep insights into the application of LLMs in reasoning adverse weather conditions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Can large language models replace standardised patients?
    Han, Weipeng
    Lyu, Xiaohong
    Yang, Ji-Jiang
    Yan, Mengsha
    Zhang, Yuelun
    Wang, Tingyan
    Pan, Hui
    Chen, Shi
    Zhu, Jiming
    Huang, Xiaoming
    MEDICAL EDUCATION, 2025,
  • [22] Can Large Language Models Be an Alternative to Human Evaluation?
    Chiang, Cheng-Han
    Lee, Hung-yi
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15607 - 15631
  • [23] CAN LARGE LANGUAGE MODELS GENERATE CONCEPTUAL HEALTH ECONOMIC MODELS?
    Chhatwal, J.
    Yildirim, I
    Balta, D.
    Ermis, T.
    Tenkin, S.
    Samur, S.
    Ayer, T.
    VALUE IN HEALTH, 2024, 27 (06) : S123 - S123
  • [24] Large language models can effectively extract stroke audit data from medical free-text discharge summaries
    Goh, Rudy
    Kleinig, Timothy
    Jannes, Jim
    Vallat, Wilson
    Moey, Andrew
    Kimberly, Taylor
    Bacchi, Stephen
    CEREBROVASCULAR DISEASES, 2024, 53 : 49 - 49
  • [25] Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model
    BehnamGhader, Parishad
    Miret, Santiago
    Reddy, Siva
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15492 - 15509
  • [26] EureCar Turbo: a Self-Driving Car that can Handle Adverse Weather Conditions
    Lee, Unghui
    Jung, Jiwon
    Shin, Seunghak
    Jeong, Yongseop
    Park, Kibaek
    Shim, David Hyunchul
    Kweon, In-so
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2301 - 2306
  • [27] Deformation prediction of large-span spatial structure considering adverse weather conditions
    Wang, Jin-Tao
    Liu, Yu-Fei
    Zhou, Meng
    Nie, Jian-Guo
    Chen, Xuan
    STRUCTURES, 2023, 50 : 508 - 523
  • [28] Restoring Vision in Adverse Weather Conditions With Patch-Based Denoising Diffusion Models
    Oezdenizci, Ozan
    Legenstein, Robert
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10346 - 10357
  • [29] Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent Model
    Li, Zhiwei
    Song, Ran
    Sun, Caihong
    Xu, Wei
    Yu, Zhengtao
    Wen, Ji-Rong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 3891 - 3902
  • [30] Large language models can effectively extract stroke and reperfusion audit data from medical free-text discharge summaries
    Goh, Rudy
    Cook, Benjamin
    Stretton, Brandon
    Booth, Andrew E. C.
    Satheakeerthy, Shrirajh
    Howson, Sarah
    Kovoor, Joshua
    Gupta, Aashray
    Tan, Sheryn
    Kimberly, W. Taylor
    Moey, Andrew
    Vallat, Wilson
    Maddison, John
    Marks, Jarrod
    Gluck, Samuel
    Gilbert, Toby
    Jannes, Jim
    Kleinig, Timothy
    Bacchi, Stephen
    JOURNAL OF CLINICAL NEUROSCIENCE, 2024, 129