OceanGPT: A Large Language Model for Ocean Science Tasks

被引:0
|
作者
Bi, Zhen [1 ,2 ,5 ,6 ]
Zhang, Ningyu [1 ,2 ,5 ]
Xue, Yida [1 ]
Ou, Yixin [1 ]
Ji, Daxiong [2 ,3 ]
Zheng, Guozhou [2 ,4 ]
Chen, Huajun [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Univ, Donghai Lab, Hangzhou, Peoples R China
[3] Zhejiang Univ, Ocean Coll, Hangzhou, Peoples R China
[4] Zhoushan Zhejiang Univ, Ocean Res Ctr, Hangzhou, Peoples R China
[5] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
[6] Huzhou Univ, Huzhou, Peoples R China
来源
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, and the potential of LLMs for ocean science is under-explored. The intrinsic reasons are the immense and intricate nature of ocean data as well as the necessity for higher granularity and richness in knowledge. To alleviate these issues, we introduce OCEANGPT, the first-ever large language model in the ocean domain, which is expert in various ocean science tasks. We also propose DOINSTRUCT, a novel framework to automatically obtain a large volume of ocean domain instruction data, which generates instructions based on multi-agent collaboration. Additionally, we construct the first oceanography benchmark, OCEANBENCH, to evaluate the capabilities of LLMs in the ocean domain. Though comprehensive experiments, OCEANGPT not only shows a higher level of knowledge expertise for oceans science tasks but also gains preliminary embodied intelligence capabilities in ocean technology.
引用
收藏
页码:3357 / 3372
页数:16
相关论文
共 50 条
  • [31] Challenges in applying large language models to requirements engineering tasks
    Norheim, Johannes J.
    Rebentisch, Eric
    Xiao, Dekai
    Draeger, Lorenz
    Kerbrat, Alain
    de Weck, Olivier L.
    DESIGN SCIENCE, 2024, 10
  • [32] BB-GeoGPT: A framework for learning a large language model for geographic information science
    Zhang, Yifan
    Wang, Zhiyun
    He, Zhengting
    Li, Jingxuan
    Mai, Gengchen
    Lin, Jianfeng
    Wei, Cheng
    Yu, Wenhao
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [33] The Large Language Model GreekLegalRoBERTa
    Saketos, Vasileios
    Pantazi, Despina-Athanasia
    Koubarakis, Manolis
    PROCEEDINGS OF THE 13TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2024, 2024,
  • [34] (sic) Pengi: An Audio Language Model for Audio Tasks
    Deshmukh, Soham
    Elizalde, Benjamin
    Singh, Rita
    Wang, Huaming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Large language model in electrocatalysis
    Zhang, Chengyi
    Wang, Xingyu
    Wang, Ziyun
    CHINESE JOURNAL OF CATALYSIS, 2024, 59 : 7 - 14
  • [36] Comparative Analysis of Single and Multiagent Large Language Model Architectures for Domain- Specific Tasks in Well Construction
    Sabbagh, V. B.
    Lima, C. B. C.
    Xexeo, G.
    SPE JOURNAL, 2024, 29 (12): : 6869 - 6882
  • [37] Adopting Pre-trained Large Language Models for Regional Language Tasks: A Case Study
    Gaikwad, Harsha
    Kiwelekar, Arvind
    Laddha, Manjushree
    Shahare, Shashank
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 15 - 25
  • [38] Large Language Models Can Accomplish Business Process Management Tasks
    Grohs, Michael
    Abb, Luka
    Elsayed, Nourhan
    Rehse, Jana-Rebecca
    BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2023, 2024, 492 : 453 - 465
  • [39] Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks
    Mishra, Aditi
    Rahman, Sajjadur
    Mitra, Kushan
    Kim, Hannah
    Hruschka, Estevam
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 8117 - 8139
  • [40] Large Language Models for Data Extraction in Slot-Filling Tasks
    Bazan, Marek
    Gniazdowski, Tomasz
    Wolkiewicz, Dawid
    Sarna, Juliusz
    Marchwiany, Maciej E.
    SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 1 - 18