Opening a conversation on responsible environmental data science in the age of large language models

被引:0
|
作者
Oliver, Ruth Y. [1 ]
Chapman, Melissa [2 ]
Emery, Nathan [3 ]
Gillespie, Lauren [4 ]
Gownaris, Natasha [5 ]
Leiker, Sophia [1 ]
Nisi, Anna C. [6 ]
Ayers, David [7 ]
Breckheimer, Ian [8 ]
Blondin, Hannah [9 ]
Hoffman, Ava [10 ]
Pagniello, Camille M. L. S. [11 ]
Raisle, Megan [12 ]
Zimmerman, Naupaka [12 ]
机构
[1] Univ Calif Santa Barbara, Bren Sch Environm Sci & Management, Santa Barbara, CA USA
[2] Univ Calif Santa Barbara, Natl Ctr Ecol Anal & Synth, Santa Barbara, CA USA
[3] Univ Calif Santa Barbara, Ctr Innovat Teaching Res & Learning, Santa Barbara, CA USA
[4] Stanford Univ, Dept Comp Sci, Palo Alto, CA USA
[5] Gettysburg Coll, Dept Environm Studies, Gettysburg, PA USA
[6] Univ Washington, Dept Biol, Ctr Ecosyst Sentinels, Seattle, WA USA
[7] Univ Calif Davis, Wildlife Fish & Conservat Biol Dept, Davis, CA USA
[8] Rocky Mt Biol Labs, Crested Butte, CO USA
[9] Univ Miami, Cooperat Inst Marine & Atmospher Studies CIMAS, Miami, FL USA
[10] Fred Hutchinson Canc Ctr, Data Sci Lab, Seattle, WA USA
[11] Univ Hawaii Manoa, Hawaii Inst Marine Biol, Kaneohe, HI USA
[12] Univ San Francisco, Dept Biol, San Francisco, CA USA
来源
关键词
bias; ChatGPT; data ethics; generative AI; pedagogy;
D O I
10.1017/eds.2024.12
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The general public and scientific community alike are abuzz over the release of ChatGPT and GPT-4. Among many concerns being raised about the emergence and widespread use of tools based on large language models (LLMs) is the potential for them to propagate biases and inequities. We hope to open a conversation within the environmental data science community to encourage the circumspect and responsible use of LLMs. Here, we pose a series of questions aimed at fostering discussion and initiating a larger dialogue. To improve literacy on these tools, we provide background information on the LLMs that underpin tools like ChatGPT. We identify key areas in research and teaching in environmental data science where these tools may be applied, and discuss limitations to their use and points of concern. We also discuss ethical considerations surrounding the use of LLMs to ensure that as environmental data scientists, researchers, and instructors, we can make well-considered and informed choices about engagement with these tools. Our goal is to spark forward-looking discussion and research on how as a community we can responsibly integrate generative AI technologies into our work. Impact Statement With the recent release of ChatGPT and similar tools based on large language models, there is considerable enthusiasm and substantial concern over how these tools should be used. We pose a series of questions aimed at unpacking important considerations in the responsible use of large language models within environmental data science.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Materials science in the era of large language models: a perspective
    Lei, Ge
    Docherty, Ronan
    Cooper, Samuel J.
    DIGITAL DISCOVERY, 2024, 3 (07): : 1257 - 1272
  • [22] Leveraging Cognitive Science for Testing Large Language Models
    Srinivasan, Ramya
    Inakoshi, Hiroya
    Uchino, Kanji
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 169 - 171
  • [23] Large language models for causal hypothesis generation in science
    Cohrs, Kai-Hendrik
    Diaz, Emiliano
    Sitokonstantinou, Vasileios
    Varando, Gherardo
    Camps-Valls, Gustau
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [24] Mining experimental data from materials science literature with large language models: an evaluation study
    Foppiano, Luca
    Lambard, Guillaume
    Amagasa, Toshiyuki
    Ishii, Masashi
    SCIENCE AND TECHNOLOGY OF ADVANCED MATERIALS-METHODS, 2024, 4 (01):
  • [25] Smart Science Needs Linked Open Data with a Dash of Large Language Models and Extended Relations
    Jamil, Hasan M.
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL WORKSHOP ON EXPLOITING ARTIFICIAL INTELLIGENCE TECHNIQUES FOR DATA MANAGEMENT, AIDM 2024, 2024,
  • [26] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [27] OffensEval 2023: Offensive language identification in the age of Large Language Models
    Zampieri, Marcos
    Rosenthal, Sara
    Nakov, Preslav
    Dmonte, Alphaeus
    Ranasinghe, Tharindu
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1416 - 1435
  • [28] RESPONSIBLE SCIENCE - ON SCIENTIFIC ETHICS IN THE AGE OF BIG-SCIENCE AND LARGE-SCALE TECHNOLOGY
    LENK, H
    GEGENWARTSKUNDE GESELLSCHAFT STAAT ERZIEHUNG, 1989, 38 (02): : 181 - 191
  • [29] The long but necessary road to responsible use of large language models in healthcare research
    Kwong, Jethro C. C.
    Wang, Serena C. Y.
    Nickel, Grace C.
    Cacciamani, Giovanni E.
    Kvedar, Joseph C.
    NPJ DIGITAL MEDICINE, 2024, 7 (01):
  • [30] Large-scale environmental data science with ExaGeoStatR
    Abdulah, Sameh
    Li, Yuxiao
    Cao, Jian
    Ltaief, Hatem
    Keyes, David E.
    Genton, Marc G.
    Sun, Ying
    ENVIRONMETRICS, 2023, 34 (01)