Opening a conversation on responsible environmental data science in the age of large language models

被引:0
|
作者
Oliver, Ruth Y. [1 ]
Chapman, Melissa [2 ]
Emery, Nathan [3 ]
Gillespie, Lauren [4 ]
Gownaris, Natasha [5 ]
Leiker, Sophia [1 ]
Nisi, Anna C. [6 ]
Ayers, David [7 ]
Breckheimer, Ian [8 ]
Blondin, Hannah [9 ]
Hoffman, Ava [10 ]
Pagniello, Camille M. L. S. [11 ]
Raisle, Megan [12 ]
Zimmerman, Naupaka [12 ]
机构
[1] Univ Calif Santa Barbara, Bren Sch Environm Sci & Management, Santa Barbara, CA USA
[2] Univ Calif Santa Barbara, Natl Ctr Ecol Anal & Synth, Santa Barbara, CA USA
[3] Univ Calif Santa Barbara, Ctr Innovat Teaching Res & Learning, Santa Barbara, CA USA
[4] Stanford Univ, Dept Comp Sci, Palo Alto, CA USA
[5] Gettysburg Coll, Dept Environm Studies, Gettysburg, PA USA
[6] Univ Washington, Dept Biol, Ctr Ecosyst Sentinels, Seattle, WA USA
[7] Univ Calif Davis, Wildlife Fish & Conservat Biol Dept, Davis, CA USA
[8] Rocky Mt Biol Labs, Crested Butte, CO USA
[9] Univ Miami, Cooperat Inst Marine & Atmospher Studies CIMAS, Miami, FL USA
[10] Fred Hutchinson Canc Ctr, Data Sci Lab, Seattle, WA USA
[11] Univ Hawaii Manoa, Hawaii Inst Marine Biol, Kaneohe, HI USA
[12] Univ San Francisco, Dept Biol, San Francisco, CA USA
来源
关键词
bias; ChatGPT; data ethics; generative AI; pedagogy;
D O I
10.1017/eds.2024.12
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The general public and scientific community alike are abuzz over the release of ChatGPT and GPT-4. Among many concerns being raised about the emergence and widespread use of tools based on large language models (LLMs) is the potential for them to propagate biases and inequities. We hope to open a conversation within the environmental data science community to encourage the circumspect and responsible use of LLMs. Here, we pose a series of questions aimed at fostering discussion and initiating a larger dialogue. To improve literacy on these tools, we provide background information on the LLMs that underpin tools like ChatGPT. We identify key areas in research and teaching in environmental data science where these tools may be applied, and discuss limitations to their use and points of concern. We also discuss ethical considerations surrounding the use of LLMs to ensure that as environmental data scientists, researchers, and instructors, we can make well-considered and informed choices about engagement with these tools. Our goal is to spark forward-looking discussion and research on how as a community we can responsibly integrate generative AI technologies into our work. Impact Statement With the recent release of ChatGPT and similar tools based on large language models, there is considerable enthusiasm and substantial concern over how these tools should be used. We pose a series of questions aimed at unpacking important considerations in the responsible use of large language models within environmental data science.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Leveraging Large Language Models for Sensor Data Retrieval
    Berenguer, Alberto
    Morejon, Adriana
    Tomas, David
    Mazon, Jose-Norberto
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [42] Empowering Large Language Models for Textual Data Augmentation
    Li, Yichuan
    Ding, Kaize
    Wang, Jianling
    Lee, Kyumin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12734 - 12751
  • [43] Making Large Language Models Better Data Creators
    Lee, Dong-Ho
    Pujara, Jay
    Sewak, Mohit
    White, Ryen W.
    Jauhar, Sujay Kumar
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 15349 - 15360
  • [44] Automating Qualitative Data Analysis with Large Language Models
    Parfenova, Angelina
    Denzler, Alexander
    Pfeffer, Juergen
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 4: STUDENT RESEARCH WORKSHOP, 2024, : 101 - 109
  • [45] Leveraging large language models for data analysis automation
    Jansen, Jacqueline A.
    Manukyan, Artur
    Al Khoury, Nour
    Akalin, Altuna
    PLOS ONE, 2025, 20 (02):
  • [46] Geofence-to-Conversation: Hierarchical Geofencing for Augmenting City Walks with Large Language Models
    Sasaki, Iori
    Arikawa, Masatoshi
    Lu, Min
    Utsumi, Tomihiro
    Sato, Ryo
    PUBLICATION OF THE 26TH ACM INTERNATIONAL CONFERENCE ON MOBILE HUMAN-COMPUTER INTERACTION, MOBILEHCI 2024 ADJUNCT PROCEEDINGS, 2024,
  • [47] Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models
    Wang, Xiaolong
    Wang, Yile
    Zhang, Yuanchi
    Luo, Fuwen
    Li, Peng
    Sun, Maosong
    Liu, Yang
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 15880 - 15893
  • [48] Reimagining Self-Adaptation in the Age of Large Language Models
    Donakanti, Raghav
    Jain, Prakhar
    Kulkarni, Shubham
    Vaidhyanathan, Karthik
    IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE COMPANION, ICSA-C 2024, 2024, : 171 - 174
  • [49] Addressing digital inequities in the age of large language models (LLMs)
    Ng, Olivia
    Han, Siew Ping
    MEDICAL EDUCATION, 2024, 58 (12) : 1545 - 1546
  • [50] Civilizing and Humanizing Artificial Intelligence in the Age of Large Language Models
    Sheth, Amit
    Roy, Kaushik
    Purohit, Hemant
    Das, Amitava
    IEEE INTERNET COMPUTING, 2024, 28 (05) : 5 - 10