Action Guidance and AI Alignment

被引:0
|
作者
Robinson, Pamela [1 ]
机构
[1] Australian Natl Univ, Sch Philosophy, Canberra, ACT, Australia
关键词
Value alignment; AI safety; Artificial intelligence; Machine ethics; Abilities; Action guidance;
D O I
10.1145/3600211.3604714
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
I offer a preliminary conceptual framework for evaluating AI alignment projects. It is based on the concept of action guidance. In 1 and 2, I explain action guidance and its importance to AI alignment. I introduce the 'Guidance Framework' in 3. In 4, I show how it can be applied to two different sorts of questions: the practical question of how to design a specific AI agent (my example is a fictional ocean-cleaning robot), and the theoretical question of how to evaluate a specific AI alignment proposal (my example is Stuart Russell's 'binary approach'). In 5 I discuss limitations of the framework and opportunities for further research.
引用
收藏
页码:387 / 395
页数:9
相关论文
共 50 条
  • [1] AI Alignment Dialogues: An Interactive Approach to AI Alignment in Support Agents
    Chen, Pei-Yu
    [J]. PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022, 2022, : 894 - 894
  • [2] DANCING LIKE A SUPERSTAR: ACTION GUIDANCE BASED ON POSE ESTIMATION AND CONDITIONAL POSE ALIGNMENT
    Hou, Yuxin
    Yao, Hongxun
    Li, Haoran
    Sun, Xiaoshuai
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1312 - 1316
  • [3] AI Alignment and Human Reward
    Butlin, Patrick
    [J]. AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 437 - 445
  • [4] Compassionate AI and the Alignment Problem
    Graves, Mark
    Compson, Jane
    Bhojani, Ali-Reza
    Olsen, Cyrus
    Arnold, Thomas
    [J]. THEOLOGY AND SCIENCE, 2024, 22 (01) : 4 - 8
  • [5] AI, alignment, and the categorical imperative
    Fritz J. McDonald
    [J]. AI and Ethics, 2023, 3 (1): : 337 - 344
  • [6] Optimization of Cellular Alignment for Nerve Guidance
    Kofron, C. M.
    Hoffman-Kim, D.
    [J]. 2009 35TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2009, : 169 - +
  • [7] White House offers AI guidance
    Brainard, Jeffrey
    [J]. SCIENCE, 2022, 378 (6615) : 8 - 8
  • [8] GUIDANCE FOR AI/ML PAPER SUBMISSIONS
    Aldrin, John
    [J]. MATERIALS EVALUATION, 2024, 82 (02) : 7 - 7
  • [9] The state as a model for AI control and alignment
    Elsner, Micha
    [J]. AI & SOCIETY, 2024,
  • [10] AI VALUE ALIGNMENT AND SOCIOLOGY OF MORALITY
    Deviatko, I. F.
    [J]. SOTSIOLOGICHESKIE ISSLEDOVANIYA, 2023, (09): : 16 - 28