Learning Fairness from Demonstrations via Inverse Reinforcement Learning

被引:0
|
作者
Blandin, Jack [1 ]
Kash, Ian [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
关键词
inverse reinforcement learning; group fairness in classification; fairness transfer learning;
D O I
10.1145/3630106.3658539
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Defining fairness in algorithmic contexts is challenging, particularly when adapting to new domains. Our research introduces a novel method for learning and applying group fairness preferences across different classification domains, without the need for manual fine-tuning. Utilizing concepts from inverse reinforcement learning (IRL), our approach enables the extraction and application of fairness preferences from human experts or established algorithms. We propose the first technique for using IRL to recover and adapt group fairness preferences to new domains, offering a low-touch solution for implementing fair classifiers in settings where expert-established fairness tradeoffs are not yet defined.
引用
收藏
页码:51 / 61
页数:11
相关论文
共 50 条
  • [41] Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
    Rajeswaran, Aravind
    Kumar, Vikash
    Gupta, Abhishek
    Vezzani, Giulia
    Schulman, John
    Todorov, Emanuel
    Levine, Sergey
    [J]. ROBOTICS: SCIENCE AND SYSTEMS XIV, 2018,
  • [42] Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
    Wang, Xingyu
    Klabjan, Diego
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [43] Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
    Xu, Kelvin
    Ratner, Ellis
    Dragan, Anca
    Levine, Sergey
    Finn, Chelsea
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [44] Learning Human-Aware Robot Navigation from Physical Interaction via Inverse Reinforcement Learning
    Kollmitz, Marina
    Koller, Torsten
    Boedecker, Joschka
    Burgard, Wolfram
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 11025 - 11031
  • [45] Inverse reinforcement learning from summary data
    Kangasraasio, Antti
    Kaski, Samuel
    [J]. MACHINE LEARNING, 2018, 107 (8-10) : 1517 - 1535
  • [46] Inverse reinforcement learning from summary data
    Antti Kangasrääsiö
    Samuel Kaski
    [J]. Machine Learning, 2018, 107 : 1517 - 1535
  • [47] UAV reinforcement learning control algorithm with demonstrations
    Sun D.
    Gao D.
    Zheng J.
    Han P.
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2023, 49 (06): : 1424 - 1433
  • [48] Learning from Successful and Failed Demonstrations via Optimization
    Hertel, Brendan
    Ahmadzadeh, S. Reza
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7807 - 7812
  • [49] Learning Behavior Styles with Inverse Reinforcement Learning
    Lee, Seong Jae
    popovic, Zoran
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
  • [50] Robust Imitation via Mirror Descent Inverse Reinforcement Learning
    Han, Dong-Sig
    Kim, Hyunseo
    Lee, Hyundo
    Ryu, Je-Hwan
    Zhang, Byoung-Tak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,