Explaining Reinforcement Learning with Shapley Values

被引：0

作者：

Beechey, Daniel ^{[1
]}

Smith, Thomas M. S. ^{[1
]}

Simsek, Ozgur ^{[1
]}

机构：

[1] Univ Bath, Dept Comp Sci, Bath, Avon, England

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

基金：

英国工程与自然科学研究理事会;

关键词：

CLASSIFICATIONS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Reinforcement Learning (SVERL). Our analysis exposes the limitations of earlier uses of Shapley values in reinforcement learning. We then develop an approach that uses Shapley values to explain agent performance. In a variety of domains, SVERL produces meaningful explanations that match and supplement human intuition.

引用

页数：12

共 50 条

[21] Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning
Li, Jiahui
Kuang, Kun
Wang, Baoxiang
Liu, Furui
Chen, Long
Wu, Fei
Xiao, Jun
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 934 - 942
[22] EDGE: Explaining Deep Reinforcement Learning Policies
Guo, Wenbo
Wu, Xian
Khan, Usmann
Xing, Xinyu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[23] EXPLAINING DEEP LEARNING MODELS FOR SPOOFING AND DEEPFAKE DETECTION WITH SHAPLEY ADDITIVE EXPLANATIONS
Ge, Wanying
Patino, Jose
Todisco, Massimiliano
Evans, Nicholas
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6387 - 6391
[24] Explaining Deep Q-Learning Experience Replay with SHapley Additive exPlanations
Sullivan, Robert S.
Longo, Luca
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (04): : 1433 - 1455
[25] ON WEIGHTED SHAPLEY VALUES
KALAI, E
SAMET, D
INTERNATIONAL JOURNAL OF GAME THEORY, 1987, 16 (03) : 205 - 222
[26] Explaining Reinforcement Learning to Mere Mortals: An Empirical Study
Anderson, Andrew
Dodge, Jonathan
Sadarangani, Amrita
Juozapaitis, Zoe
Newman, Evan
Irvine, Jed
Chattopadhyay, Souti
Fern, Alan
Burnett, Margaret
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1328 - 1334
[27] EXACT SHAPLEY VALUES FOR EXPLAINING COMPLEX MACHINE LEARNING BASED MOLECULAR TESTS OF CHECKPOINT INHIBITORS: POTENTIAL UTILITY FOR PATIENTS, PHYSICIANS, AND TRANSLATIONAL RESEARCH
Roder, Heinrich
Net, Lelia
Roder, Joanna
Campbell, Thomas
McCleland, Mark
Zou, Wei
Srivastava, Minu
Shames, David
Maguire, Laura
Georgantas, Robert, III
JOURNAL FOR IMMUNOTHERAPY OF CANCER, 2021, 9 : A870 - A871
[28] Shapley Chains: Extending Shapley Values to Classifier Chains
Ayad, Celia Wafa
Bonnier, Thomas
Bosch, Benjamin
Read, Jesse
DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 541 - 555
[29] P-Shapley: Shapley Values on Probabilistic Classifiers
Xia, Haocheng
Li, Xiang
Pang, Junyuan
Liu, Jinfei
Ren, Kui
Xiong, Li
PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (07): : 1737 - 1750
[30] ON AXIOMATIZATIONS OF THE WEIGHTED SHAPLEY VALUES
NOWAK, AS
RADZIK, T
GAMES AND ECONOMIC BEHAVIOR, 1995, 8 (02) : 389 - 405

← 1 2 3 4 5 →