OceanGPT: A Large Language Model for Ocean Science Tasks

被引：0

作者：

Bi, Zhen ^{[1
,2
,5
,6
]}

Zhang, Ningyu ^{[1
,2
,5
]}

Xue, Yida ^{[1
]}

Ou, Yixin ^{[1
]}

Ji, Daxiong ^{[2
,3
]}

Zheng, Guozhou ^{[2
,4
]}

Chen, Huajun ^{[1
,2
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China

[2] Zhejiang Univ, Donghai Lab, Hangzhou, Peoples R China

[3] Zhejiang Univ, Ocean Coll, Hangzhou, Peoples R China

[4] Zhoushan Zhejiang Univ, Ocean Res Ctr, Hangzhou, Peoples R China

[5] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China

[6] Huzhou Univ, Huzhou, Peoples R China

来源：

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, and the potential of LLMs for ocean science is under-explored. The intrinsic reasons are the immense and intricate nature of ocean data as well as the necessity for higher granularity and richness in knowledge. To alleviate these issues, we introduce OCEANGPT, the first-ever large language model in the ocean domain, which is expert in various ocean science tasks. We also propose DOINSTRUCT, a novel framework to automatically obtain a large volume of ocean domain instruction data, which generates instructions based on multi-agent collaboration. Additionally, we construct the first oceanography benchmark, OCEANBENCH, to evaluate the capabilities of LLMs in the ocean domain. Though comprehensive experiments, OCEANGPT not only shows a higher level of knowledge expertise for oceans science tasks but also gains preliminary embodied intelligence capabilities in ocean technology.

引用

页码：3357 / 3372

页数：16

共 50 条

[21] Science in the age of large language models
Birhane, Abeba
Kasirzadeh, Atoosa
Leslie, David
Wachter, Sandra
NATURE REVIEWS PHYSICS, 2023, 5 (05) : 277 - 280
[22] Large language models for science and medicine
Telenti, Amalio
Auli, Michael
Hie, Brian L.
Maher, Cyrus
Saria, Suchi
Ioannidis, John P. A.
EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2024, 54 (06)
[23] Science in the age of large language models
Abeba Birhane
Atoosa Kasirzadeh
David Leslie
Sandra Wachter
Nature Reviews Physics, 2023, 5 (5) : 277 - 280
[24] Language Learning through Tasks in a Content and Language Integrated Learning (CLIL) Science Classroom
Escobar Urmeneta, Cristina
Sanchez Sola, Antonio
PORTA LINGUARUM, 2009, (11) : 65 - 83
[25] Large Language Models in der WissenschaftLarge language models in science
Karl-Friedrich Kowalewski
Severin Rodler
Die Urologie, 2024, 63 (9) : 860 - 866
[26] VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Wang, Wenhai
Chen, Zhe
Chen, Xiaokang
Wu, Jiannan
Zhu, Xizhou
Zeng, Gang
Luo, Ping
Lu, Tong
Zhou, Jie
Qiao, Yu
Dai, Jifeng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[27] Towards an understanding of large language models in software engineering tasks
Zheng, Zibin
Ning, Kaiwen
Zhong, Qingyuan
Chen, Jiachi
Chen, Wenqing
Guo, Lianghong
Wang, Weicheng
Wang, Yanlin
EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (02)
[28] Reasoning with Large Language Models on Graph Tasks: The Influence of Temperature
Wang, Yiming
Zhang, Ziyang
Chen, Hanwei
Shen, Huayi
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 630 - 634
[29] Multimodal large language models for inclusive collaboration learning tasks
Lewis, Armanda
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 202 - 210
[30] Evaluation of Pretrained Large Language Models in Embodied Planning Tasks
Sarkisyan, Christina
Korchemnyi, Alexandr
Kovalev, Alexey K.
Panov, Aleksandr, I
ARTIFICIAL GENERAL INTELLIGENCE, AGI 2023, 2023, 13921 : 222 - 232

← 1 2 3 4 5 →