OceanGPT: A Large Language Model for Ocean Science Tasks

被引:0
|
作者
Bi, Zhen [1 ,2 ,5 ,6 ]
Zhang, Ningyu [1 ,2 ,5 ]
Xue, Yida [1 ]
Ou, Yixin [1 ]
Ji, Daxiong [2 ,3 ]
Zheng, Guozhou [2 ,4 ]
Chen, Huajun [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Univ, Donghai Lab, Hangzhou, Peoples R China
[3] Zhejiang Univ, Ocean Coll, Hangzhou, Peoples R China
[4] Zhoushan Zhejiang Univ, Ocean Res Ctr, Hangzhou, Peoples R China
[5] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
[6] Huzhou Univ, Huzhou, Peoples R China
来源
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, and the potential of LLMs for ocean science is under-explored. The intrinsic reasons are the immense and intricate nature of ocean data as well as the necessity for higher granularity and richness in knowledge. To alleviate these issues, we introduce OCEANGPT, the first-ever large language model in the ocean domain, which is expert in various ocean science tasks. We also propose DOINSTRUCT, a novel framework to automatically obtain a large volume of ocean domain instruction data, which generates instructions based on multi-agent collaboration. Additionally, we construct the first oceanography benchmark, OCEANBENCH, to evaluate the capabilities of LLMs in the ocean domain. Though comprehensive experiments, OCEANGPT not only shows a higher level of knowledge expertise for oceans science tasks but also gains preliminary embodied intelligence capabilities in ocean technology.
引用
收藏
页码:3357 / 3372
页数:16
相关论文
共 50 条
  • [21] Science in the age of large language models
    Birhane, Abeba
    Kasirzadeh, Atoosa
    Leslie, David
    Wachter, Sandra
    NATURE REVIEWS PHYSICS, 2023, 5 (05) : 277 - 280
  • [22] Large language models for science and medicine
    Telenti, Amalio
    Auli, Michael
    Hie, Brian L.
    Maher, Cyrus
    Saria, Suchi
    Ioannidis, John P. A.
    EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2024, 54 (06)
  • [23] Science in the age of large language models
    Abeba Birhane
    Atoosa Kasirzadeh
    David Leslie
    Sandra Wachter
    Nature Reviews Physics, 2023, 5 (5) : 277 - 280
  • [24] Language Learning through Tasks in a Content and Language Integrated Learning (CLIL) Science Classroom
    Escobar Urmeneta, Cristina
    Sanchez Sola, Antonio
    PORTA LINGUARUM, 2009, (11) : 65 - 83
  • [25] Large Language Models in der WissenschaftLarge language models in science
    Karl-Friedrich Kowalewski
    Severin Rodler
    Die Urologie, 2024, 63 (9) : 860 - 866
  • [26] VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
    Wang, Wenhai
    Chen, Zhe
    Chen, Xiaokang
    Wu, Jiannan
    Zhu, Xizhou
    Zeng, Gang
    Luo, Ping
    Lu, Tong
    Zhou, Jie
    Qiao, Yu
    Dai, Jifeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Towards an understanding of large language models in software engineering tasks
    Zheng, Zibin
    Ning, Kaiwen
    Zhong, Qingyuan
    Chen, Jiachi
    Chen, Wenqing
    Guo, Lianghong
    Wang, Weicheng
    Wang, Yanlin
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (02)
  • [28] Reasoning with Large Language Models on Graph Tasks: The Influence of Temperature
    Wang, Yiming
    Zhang, Ziyang
    Chen, Hanwei
    Shen, Huayi
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 630 - 634
  • [29] Multimodal large language models for inclusive collaboration learning tasks
    Lewis, Armanda
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 202 - 210
  • [30] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks
    Sarkisyan, Christina
    Korchemnyi, Alexandr
    Kovalev, Alexey K.
    Panov, Aleksandr, I
    ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232