Model Debiasing via Gradient-based Explanation on Representation

被引:0
|
作者
Zhang, Jindi [1 ]
Wang, Luning [1 ]
Su, Dan [3 ]
Huang, Yongxiang [1 ]
Cao, Caleb Chen [2 ]
Chen, Lei [2 ]
机构
[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci, Hong Kong, Peoples R China
[3] NVIDIA Res, Hong Kong, Peoples R China
关键词
fairness; model debiasing; representation learning; gradient-based explanation;
D O I
10.1145/3600211.3604668
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [1] Robust face recognition via gradient-based sparse representation
    Ma, Peng
    Yang, Dan
    Ge, Yongxin
    Zhang, Xiaohong
    Qu, Ying
    Huang, Sheng
    Lu, Jiwen
    JOURNAL OF ELECTRONIC IMAGING, 2013, 22 (01)
  • [2] On Spectral Properties of Gradient-Based Explanation Methods
    Mehrpanah, Amir
    Englesson, Erik
    Azizpoure, Hossein
    COMPUTER VISION - ECCV 2024, PT LXXXVII, 2025, 15145 : 282 - 299
  • [3] Orthogonal Gradient-Based Binary Image Representation for Vehicle Detection
    Czapla, Zbigniew
    COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 453 - 461
  • [4] A gradient-based calibration method for the Heston model
    Clevenhaus, Anna
    Totzeck, Claudia
    Ehrhardt, Matthias
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2024, 101 (9-10) : 1094 - 1112
  • [5] A gradient-based model of the peripheral drift illusion
    Ashida, H
    Kitaoka, A
    PERCEPTION, 2003, 32 : 106 - 106
  • [6] A Variational Model for Gradient-Based Video Editing
    Rida Sadek
    Gabriele Facciolo
    Pablo Arias
    Vicent Caselles
    International Journal of Computer Vision, 2013, 103 : 127 - 162
  • [7] MODEL STRUCTURE ADAPTATION: A GRADIENT-BASED APPROACH
    La Cava, William G.
    Danai, Kourosh
    PROCEEDINGS OF THE ASME 8TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2015, VOL 1, 2016,
  • [8] Gradient-Based Language Model Red Teaming
    Wichers, Nevan
    Denison, Carson
    Beirami, Ahmad
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2862 - 2881
  • [9] A Variational Model for Gradient-Based Video Editing
    Sadek, Rida
    Facciolo, Gabriele
    Arias, Pablo
    Caselles, Vicent
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 103 (01) : 127 - 162
  • [10] ACOUSTIC CLOAK DESIGN VIA GRADIENT-BASED OPTIMIZATION
    Avina, Angel
    Gerges, Samer
    Amirkulova, Feruza A.
    Du, Winncy
    PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 4, 2023,