OceanGPT: A Large Language Model for Ocean Science Tasks

被引：0

作者：

Bi, Zhen ^{[1
,2
,5
,6
]}

Zhang, Ningyu ^{[1
,2
,5
]}

Xue, Yida ^{[1
]}

Ou, Yixin ^{[1
]}

Ji, Daxiong ^{[2
,3
]}

Zheng, Guozhou ^{[2
,4
]}

Chen, Huajun ^{[1
,2
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China

[2] Zhejiang Univ, Donghai Lab, Hangzhou, Peoples R China

[3] Zhejiang Univ, Ocean Coll, Hangzhou, Peoples R China

[4] Zhoushan Zhejiang Univ, Ocean Res Ctr, Hangzhou, Peoples R China

[5] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China

[6] Huzhou Univ, Huzhou, Peoples R China

来源：

PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, and the potential of LLMs for ocean science is under-explored. The intrinsic reasons are the immense and intricate nature of ocean data as well as the necessity for higher granularity and richness in knowledge. To alleviate these issues, we introduce OCEANGPT, the first-ever large language model in the ocean domain, which is expert in various ocean science tasks. We also propose DOINSTRUCT, a novel framework to automatically obtain a large volume of ocean domain instruction data, which generates instructions based on multi-agent collaboration. Additionally, we construct the first oceanography benchmark, OCEANBENCH, to evaluate the capabilities of LLMs in the ocean domain. Though comprehensive experiments, OCEANGPT not only shows a higher level of knowledge expertise for oceans science tasks but also gains preliminary embodied intelligence capabilities in ocean technology.

引用

页码：3357 / 3372

页数：16

共 50 条

[31] Challenges in applying large language models to requirements engineering tasks
Norheim, Johannes J.
Rebentisch, Eric
Xiao, Dekai
Draeger, Lorenz
Kerbrat, Alain
de Weck, Olivier L.
DESIGN SCIENCE, 2024, 10
[32] BB-GeoGPT: A framework for learning a large language model for geographic information science
Zhang, Yifan
Wang, Zhiyun
He, Zhengting
Li, Jingxuan
Mai, Gengchen
Lin, Jianfeng
Wei, Cheng
Yu, Wenhao
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
[33] The Large Language Model GreekLegalRoBERTa
Saketos, Vasileios
Pantazi, Despina-Athanasia
Koubarakis, Manolis
PROCEEDINGS OF THE 13TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2024, 2024,
[34] (sic) Pengi: An Audio Language Model for Audio Tasks
Deshmukh, Soham
Elizalde, Benjamin
Singh, Rita
Wang, Huaming
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[35] Large language model in electrocatalysis
Zhang, Chengyi
Wang, Xingyu
Wang, Ziyun
CHINESE JOURNAL OF CATALYSIS, 2024, 59 : 7 - 14
[36] Comparative Analysis of Single and Multiagent Large Language Model Architectures for Domain- Specific Tasks in Well Construction
Sabbagh, V. B.
Lima, C. B. C.
Xexeo, G.
SPE JOURNAL, 2024, 29 (12): : 6869 - 6882
[37] Adopting Pre-trained Large Language Models for Regional Language Tasks: A Case Study
Gaikwad, Harsha
Kiwelekar, Arvind
Laddha, Manjushree
Shahare, Shashank
INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 15 - 25
[38] Large Language Models Can Accomplish Business Process Management Tasks
Grohs, Michael
Abb, Luka
Elsayed, Nourhan
Rehse, Jana-Rebecca
BUSINESS PROCESS MANAGEMENT WORKSHOPS, BPM 2023, 2024, 492 : 453 - 465
[39] Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks
Mishra, Aditi
Rahman, Sajjadur
Mitra, Kushan
Kim, Hannah
Hruschka, Estevam
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 8117 - 8139
[40] Large Language Models for Data Extraction in Slot-Filling Tasks
Bazan, Marek
Gniazdowski, Tomasz
Wolkiewicz, Dawid
Sarna, Juliusz
Marchwiany, Maciej E.
SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 1 - 18

← 1 2 3 4 5 →