Model Debiasing via Gradient-based Explanation on Representation

被引：0

作者：

Zhang, Jindi ^{[1
]}

Wang, Luning ^{[1
]}

Su, Dan ^{[3
]}

Huang, Yongxiang ^{[1
]}

Cao, Caleb Chen ^{[2
]}

Chen, Lei ^{[2
]}

机构：

[1] Huawei, Hong Kong Res Ctr, Hong Kong, Peoples R China

[2] Hong Kong Univ Sci, Hong Kong, Peoples R China

[3] NVIDIA Res, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023 | 2023年

关键词：

fairness; model debiasing; representation learning; gradient-based explanation;

D O I：

10.1145/3600211.3604668

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning systems produce biased results towards certain demographic groups, known as the fairness problem. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disentangled representation learning and then discard the latent code dimensions correlated with sensitive attributes (e.g., gender). Nevertheless, these approaches may suffer from incomplete disentanglement and overlook proxy attributes (proxies for sensitive attributes) when processing real-world data, especially for unstructured data, causing performance degradation in fairness and loss of useful information for downstream tasks. In this paper, we propose a novel fairness framework that performs debiasing with regard to both sensitive attributes and proxy attributes, which boosts the prediction performance of downstream task models without complete disentanglement. The main idea is to, first, leverage gradient-based explanation to find two model focuses, 1) one focus for predicting sensitive attributes and 2) the other focus for predicting downstream task labels, and second, use them to perturb the latent code that guides the training of downstream task models towards fairness and utility goals. We show empirically that our frameworkworks with both disentangled and non-disentangled representation learning methods and achieves better fairness-accuracy trade-off on unstructured and structured datasets than previous state-of-the-art approaches.

引用

页码：193 / 204

页数：12

共 50 条

[1] Robust face recognition via gradient-based sparse representation
Ma, Peng
Yang, Dan
Ge, Yongxin
Zhang, Xiaohong
Qu, Ying
Huang, Sheng
Lu, Jiwen
JOURNAL OF ELECTRONIC IMAGING, 2013, 22 (01)
[2] On Spectral Properties of Gradient-Based Explanation Methods
Mehrpanah, Amir
Englesson, Erik
Azizpoure, Hossein
COMPUTER VISION - ECCV 2024, PT LXXXVII, 2025, 15145 : 282 - 299
[3] Orthogonal Gradient-Based Binary Image Representation for Vehicle Detection
Czapla, Zbigniew
COMPUTER VISION AND GRAPHICS, ICCVG 2016, 2016, 9972 : 453 - 461
[4] A gradient-based calibration method for the Heston model
Clevenhaus, Anna
Totzeck, Claudia
Ehrhardt, Matthias
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2024, 101 (9-10) : 1094 - 1112
[5] A gradient-based model of the peripheral drift illusion
Ashida, H
Kitaoka, A
PERCEPTION, 2003, 32 : 106 - 106
[6] A Variational Model for Gradient-Based Video Editing
Rida Sadek
Gabriele Facciolo
Pablo Arias
Vicent Caselles
International Journal of Computer Vision, 2013, 103 : 127 - 162
[7] MODEL STRUCTURE ADAPTATION: A GRADIENT-BASED APPROACH
La Cava, William G.
Danai, Kourosh
PROCEEDINGS OF THE ASME 8TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2015, VOL 1, 2016,
[8] Gradient-Based Language Model Red Teaming
Wichers, Nevan
Denison, Carson
Beirami, Ahmad
PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2862 - 2881
[9] A Variational Model for Gradient-Based Video Editing
Sadek, Rida
Facciolo, Gabriele
Arias, Pablo
Caselles, Vicent
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 103 (01) : 127 - 162
[10] ACOUSTIC CLOAK DESIGN VIA GRADIENT-BASED OPTIMIZATION
Avina, Angel
Gerges, Samer
Amirkulova, Feruza A.
Du, Winncy
PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 4, 2023,

← 1 2 3 4 5 →