Generating Deontic Obligations From Utility-Maximizing Systems

被引：0

作者：

Shea-Blymyer, Colin ^{[1
]}

Abbas, Houssam ^{[1
]}

机构：

[1] Oregon State Univ, Corvallis, OR 97331 USA

来源：

PROCEEDINGS OF THE 2022 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2022 | 2022年

关键词：

Machine ethics; normative systems; deontic logic; model checking; explainability;

D O I：

10.1145/3514094.3534163

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work gives a logical characterization of the (ethical and social) obligations of an agent trained with Reinforcement Learning (RL). An RL agent takes actions by following a utility-maximizing policy. We maintain that the choice of utility function embeds ethical and social values implicitly, and that it is necessary to make these values explicit. This work provides a basis for doing so. First, we propose a probabilistic deontic logic that is suited for formally specifying the obligations of a stochastic system, including its ethical obligations. We prove some useful validities about this logic, and how its semantics are compatible with those of Markov Decision Processes (MDPs). Second, we show that model checking allows us to prove that an agent has a given obligation to bring about some state of affairs - meaning that by acting optimally, it is seeking to reach that state of affairs. We develop a model checker for our logic against MDPs. Third, we observe that it is useful for a system designer to obtain a logical characterization of her system's obligations, which is potentially more interpretable and helpful in debugging than the expression of a utility function. Enumerating all the obligations of an agent is impractical, so we propose a Bayesian optimization routine that learns to generate a system's obligations that the system designer deems interesting. We implement the model checking and Bayesian optimization routines, and demonstrate their effectiveness with an initial pilot study. This work provides a rigorous method to characterize utility-maximizing agents in terms of the (ethical and social) obligations that they implicitly seek to satisfy.

引用

页码：653 / 663

页数：11

共 50 条

[1] Utility-maximizing Server Selection
Truong Khoa Phan
Griffin, David
Maini, Elisa
Rio, Miguel
[J]. 2016 IFIP NETWORKING CONFERENCE (IFIP NETWORKING) AND WORKSHOPS, 2016, : 413 - 421
[2] UNIVERSALLY UTILITY-MAXIMIZING PRIVACY MECHANISMS
Ghosh, Arpita
Roughgarden, Tim
Sundararajan, Mukund
[J]. SIAM JOURNAL ON COMPUTING, 2012, 41 (06) : 1673 - 1693
[3] Utility-Maximizing Task Scheduling for Partially Observable Multiagent Systems
Ji, Qi-jin
Yang, Zhe
Zhu, Yan-qin
[J]. MECHANICAL ENGINEERING AND TECHNOLOGY, 2012, 125 : 387 - 394
[4] STAGGERED PRICES IN A UTILITY-MAXIMIZING FRAMEWORK
CALVO, GA
[J]. JOURNAL OF MONETARY ECONOMICS, 1983, 12 (03) : 383 - 398
[5] Universally Utility-Maximizing Privacy Mechanisms
Ghosh, Arpita
Roughgarden, Tim
Sundararajan, Mukund
[J]. STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 351 - 359
[6] Causal Feature Learning for Utility-Maximizing Agents
Kinney, David
Watson, David
[J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 257 - 268
[7] OPTIMAL-CONTRACTS WITH A UTILITY-MAXIMIZING AUDITOR
BAIMAN, S
EVANS, JH
NOEL, J
[J]. JOURNAL OF ACCOUNTING RESEARCH, 1987, 25 (02) : 217 - 244
[8] Model selection in utility-maximizing binary prediction
Su, Jiun-Hua
[J]. JOURNAL OF ECONOMETRICS, 2021, 223 (01) : 96 - 124
[9] A UTILITY-MAXIMIZING MECHANISM FOR VICARIOUS REWARD - COMMENTS
AINSLIE, G
[J]. RATIONALITY AND SOCIETY, 1995, 7 (04) : 393 - 403
[10] The utility-maximizing self-employed physician
Thornton, J
Eakin, BK
[J]. JOURNAL OF HUMAN RESOURCES, 1997, 32 (01) : 98 - 128

← 1 2 3 4 5 →