Generating Deontic Obligations From Utility-Maximizing Systems

被引:0
|
作者
Shea-Blymyer, Colin [1 ]
Abbas, Houssam [1 ]
机构
[1] Oregon State Univ, Corvallis, OR 97331 USA
关键词
Machine ethics; normative systems; deontic logic; model checking; explainability;
D O I
10.1145/3514094.3534163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work gives a logical characterization of the (ethical and social) obligations of an agent trained with Reinforcement Learning (RL). An RL agent takes actions by following a utility-maximizing policy. We maintain that the choice of utility function embeds ethical and social values implicitly, and that it is necessary to make these values explicit. This work provides a basis for doing so. First, we propose a probabilistic deontic logic that is suited for formally specifying the obligations of a stochastic system, including its ethical obligations. We prove some useful validities about this logic, and how its semantics are compatible with those of Markov Decision Processes (MDPs). Second, we show that model checking allows us to prove that an agent has a given obligation to bring about some state of affairs - meaning that by acting optimally, it is seeking to reach that state of affairs. We develop a model checker for our logic against MDPs. Third, we observe that it is useful for a system designer to obtain a logical characterization of her system's obligations, which is potentially more interpretable and helpful in debugging than the expression of a utility function. Enumerating all the obligations of an agent is impractical, so we propose a Bayesian optimization routine that learns to generate a system's obligations that the system designer deems interesting. We implement the model checking and Bayesian optimization routines, and demonstrate their effectiveness with an initial pilot study. This work provides a rigorous method to characterize utility-maximizing agents in terms of the (ethical and social) obligations that they implicitly seek to satisfy.
引用
收藏
页码:653 / 663
页数:11
相关论文
共 50 条
  • [1] Utility-maximizing Server Selection
    Truong Khoa Phan
    Griffin, David
    Maini, Elisa
    Rio, Miguel
    [J]. 2016 IFIP NETWORKING CONFERENCE (IFIP NETWORKING) AND WORKSHOPS, 2016, : 413 - 421
  • [2] UNIVERSALLY UTILITY-MAXIMIZING PRIVACY MECHANISMS
    Ghosh, Arpita
    Roughgarden, Tim
    Sundararajan, Mukund
    [J]. SIAM JOURNAL ON COMPUTING, 2012, 41 (06) : 1673 - 1693
  • [3] Utility-Maximizing Task Scheduling for Partially Observable Multiagent Systems
    Ji, Qi-jin
    Yang, Zhe
    Zhu, Yan-qin
    [J]. MECHANICAL ENGINEERING AND TECHNOLOGY, 2012, 125 : 387 - 394
  • [4] STAGGERED PRICES IN A UTILITY-MAXIMIZING FRAMEWORK
    CALVO, GA
    [J]. JOURNAL OF MONETARY ECONOMICS, 1983, 12 (03) : 383 - 398
  • [5] Universally Utility-Maximizing Privacy Mechanisms
    Ghosh, Arpita
    Roughgarden, Tim
    Sundararajan, Mukund
    [J]. STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 351 - 359
  • [6] Causal Feature Learning for Utility-Maximizing Agents
    Kinney, David
    Watson, David
    [J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 257 - 268
  • [7] OPTIMAL-CONTRACTS WITH A UTILITY-MAXIMIZING AUDITOR
    BAIMAN, S
    EVANS, JH
    NOEL, J
    [J]. JOURNAL OF ACCOUNTING RESEARCH, 1987, 25 (02) : 217 - 244
  • [8] Model selection in utility-maximizing binary prediction
    Su, Jiun-Hua
    [J]. JOURNAL OF ECONOMETRICS, 2021, 223 (01) : 96 - 124
  • [9] A UTILITY-MAXIMIZING MECHANISM FOR VICARIOUS REWARD - COMMENTS
    AINSLIE, G
    [J]. RATIONALITY AND SOCIETY, 1995, 7 (04) : 393 - 403
  • [10] The utility-maximizing self-employed physician
    Thornton, J
    Eakin, BK
    [J]. JOURNAL OF HUMAN RESOURCES, 1997, 32 (01) : 98 - 128