Adversarial learning for counterfactual fairness

被引:8
|
作者
Grari, Vincent [1 ,2 ]
Lamprier, Sylvain [1 ]
Detyniecki, Marcin [2 ,3 ]
机构
[1] Sorbonne Univ, CNRS, ISIR, F-75005 Paris, France
[2] AXA, Paris, France
[3] Polish Acad Sci, IBS PAN, Warsaw, Poland
关键词
Counterfactual fairness; Adversarial neural network; Causal inference;
D O I
10.1007/s10994-022-06206-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, fairness has become an important topic in the machine learning research community. In particular, counterfactual fairness aims at building prediction models which ensure fairness at the most individual level. Rather than globally considering equity over the entire population, the idea is to imagine what any individual would look like with a variation of a given attribute of interest, such as a different gender or race for instance. Existing approaches rely on Variational Auto-encoding of individuals, using Maximum Mean Discrepancy (MMD) penalization to limit the statistical dependence of inferred representations with their corresponding sensitive attributes. This enables the simulation of counterfactual samples used for training the target fair model, the goal being to produce similar outcomes for every alternate version of any individual. In this work, we propose to rely on an adversarial neural learning approach, that enables more powerful inference than with MMD penalties, and is particularly better fitted for the continuous setting, where values of sensitive attributes cannot be exhaustively enumerated. Experiments show significant improvements in term of counterfactual fairness for both the discrete and the continuous settings.
引用
收藏
页码:741 / 763
页数:23
相关论文
共 50 条
  • [1] Adversarial learning for counterfactual fairness
    Vincent Grari
    Sylvain Lamprier
    Marcin Detyniecki
    [J]. Machine Learning, 2023, 112 : 741 - 763
  • [2] Counterfactual Adversarial Learning for Recommendation
    Liu, Jialin
    Zhang, Zijian
    Zhao, Xiangyu
    Li, Jun
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4115 - 4119
  • [3] Counterfactual Adversarial Learning with Representation Interpolation
    Wang, Wei
    Wang, Boxin
    Shi, Ning
    Li, Jinfeng
    Zhu, Bingyu
    Liu, Xiangyu
    Zhang, Rong
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4809 - 4820
  • [4] Learning for Counterfactual Fairness from Observational Data
    Ma, Jing
    Guo, Ruocheng
    Zhang, Aidong
    Li, Jundong
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1620 - 1630
  • [5] Adversarial Counterfactual Learning and Evaluation for Recommender System
    Xu, Da
    Ruan, Chuanwei
    Korpeoglu, Evren
    Kumar, Sushant
    Achan, Kannan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [6] Counterfactual Fairness
    Kusner, Matt
    Loftus, Joshua
    Russell, Chris
    Silva, Ricardo
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [7] Fairness by Explicability and Adversarial SHAP Learning
    Hickey, James M.
    Di Stefano, Pietro G.
    Vasileiou, Vlasios
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 174 - 190
  • [8] Learning Fair Node Representations with Graph Counterfactual Fairness
    Ma, Jing
    Guo, Ruocheng
    Wan, Mengting
    Yang, Longqi
    Zhang, Aidong
    Li, Jundong
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 695 - 703
  • [9] On Learning and Testing of Counterfactual Fairness through Data Preprocessing
    Chen, Haoyu
    Lu, Wenbin
    Song, Rui
    Ghosh, Pulak
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1286 - 1296
  • [10] Measuring Fairness in Machine Learning Models via Counterfactual Examples
    Haffar, Rami
    Singh, Ashneet Khandpur
    Domingo-Ferrer, Josep
    Jebreel, Najeeb
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2022, 2022, 13408 : 119 - 131