Latent Imitator: Generating Natural Individual Discriminatory Instances for Black-Box Fairness Testing

被引:6
|
作者
Xiao, Yisong [1 ,2 ]
Liu, Aishan [1 ,3 ]
Li, Tianlin [4 ]
Liu, Xianglong [1 ,3 ,5 ]
机构
[1] Beihang Univ, NLSDE, Beijing, Peoples R China
[2] Beihang Univ, Shen Yuan Honors Coll, Beijing, Peoples R China
[3] Inst Dataspace, Hefei, Anhui, Peoples R China
[4] Nanyang Technol Univ, Singapore, Singapore
[5] Zhongguancun Lab, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Fairness Testing; Individual Discrimination; Latent Space; Natural Individual Discriminatory Instances;
D O I
10.1145/3597926.3598099
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning (ML) systems have achieved remarkable performance across a wide area of applications. However, they frequently exhibit unfair behaviors in sensitive application domains (e.g., employment and loan), raising severe fairness concerns. To evaluate and test fairness, engineers often generate individual discriminatory instances to expose unfair behaviors before model deployment. However, existing baselines ignore the naturalness of generation and produce instances that deviate from the real data distribution, which may fail to reveal the actual model fairness since these unnatural discriminatory instances are unlikely to appear in practice. To address the problem, this paper proposes a framework named Latent Imitator (LIMI) to generate more natural individual discriminatory instances with the help of a generative adversarial network (GAN), where we imitate the decision boundary of the target model in the semantic latent space of GAN and further samples latent instances on it. Specifically, we first derive a surrogate linear boundary to coarsely approximate the decision boundary of the target model, which reflects the nature of the original data distribution. Subsequently, to obtain more natural instances, we manipulate random latent vectors to the surrogate boundary with a one-step movement, and further conduct vector calculation to probe two potential discriminatory candidates that may be more closely located in the real decision boundary. Extensive experiments on various datasets demonstrate that our LIMI outperforms other baselines largely in effectiveness (x9.42 instances), efficiency (x8.71 speeds), and naturalness (+19.65%) on average. In addition, we empirically demonstrate that retraining on test samples generated by our approach can lead to improvements in both individual fairness (45.67% on IFr and 32.81% on IFo) and group fairness (9.86% on SPD and 28.38% on AOD). Our codes can be found on our website.
引用
收藏
页码:829 / 841
页数:13
相关论文
共 50 条
  • [41] RESTTESTGEN: Automated Black-Box Testing of RESTful APIs
    Viglianisi, Emanuele
    Dallago, Michael
    Ceccato, Mariano
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VALIDATION AND VERIFICATION (ICST 2020), 2020, : 142 - 152
  • [42] Automated black-box testing with abstract VDM oracles
    Aichernig, BK
    COMPUTER SAFETY, RELIABILITY AND SECURITY, 1999, 1698 : 250 - 259
  • [43] Constructive Membership Testing in Black-Box Classical Groups
    Ambrose, Sophie
    Murray, Scott H.
    Praeger, Cheryl E.
    Schneider, Csaba
    MATHEMATICAL SOFTWARE - ICMS 2010, 2010, 6327 : 54 - +
  • [44] Black-Box Testing and Auditing of Bias in ADM Systems
    Krafft, Tobias D.
    Hauer, Marc P.
    Zweig, Katharina
    MINDS AND MACHINES, 2024, 34 (02)
  • [45] An Evolutionary, Gradient-Free, Query-Efficient, Black-Box Algorithm for Generating Adversarial Instances in Deep Convolutional Neural Networks
    Lapid, Raz
    Haramaty, Zvika
    Sipper, Moshe
    ALGORITHMS, 2022, 15 (11)
  • [46] CovDroid: A Black-Box Testing Coverage System for Android
    Yeh, Chao-Chun
    Huang, Shih-Kun
    IEEE 39TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC 2015), VOL 3, 2015, : 447 - 452
  • [47] Black Box Fairness Testing of Machine Learning Models
    Aggarwal, Aniya
    Lohia, Pranay
    Nagar, Seema
    Dey, Kuntal
    Saha, Diptikalyan
    ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 625 - 635
  • [48] Black-Box Optimizer with Stochastic Implicit Natural Gradient
    Lyu, Yueming
    Tsang, Ivor W.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 217 - 232
  • [49] Black-Box Inference for Non-Linear Latent Force Models
    Ward, Wil O. C.
    Ryder, Tom
    Prangle, Dennis
    Alvarez, Mauricio A.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [50] Revizor: Testing Black-Box CPUs against Speculation Contracts
    Oleksenko, Oleksii
    Fetzer, Christof
    Kopf, Boris
    Silberstein, Mark
    ASPLOS '22: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2022, : 226 - 239