Latent Imitator: Generating Natural Individual Discriminatory Instances for Black-Box Fairness Testing

被引:6
|
作者
Xiao, Yisong [1 ,2 ]
Liu, Aishan [1 ,3 ]
Li, Tianlin [4 ]
Liu, Xianglong [1 ,3 ,5 ]
机构
[1] Beihang Univ, NLSDE, Beijing, Peoples R China
[2] Beihang Univ, Shen Yuan Honors Coll, Beijing, Peoples R China
[3] Inst Dataspace, Hefei, Anhui, Peoples R China
[4] Nanyang Technol Univ, Singapore, Singapore
[5] Zhongguancun Lab, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Fairness Testing; Individual Discrimination; Latent Space; Natural Individual Discriminatory Instances;
D O I
10.1145/3597926.3598099
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning (ML) systems have achieved remarkable performance across a wide area of applications. However, they frequently exhibit unfair behaviors in sensitive application domains (e.g., employment and loan), raising severe fairness concerns. To evaluate and test fairness, engineers often generate individual discriminatory instances to expose unfair behaviors before model deployment. However, existing baselines ignore the naturalness of generation and produce instances that deviate from the real data distribution, which may fail to reveal the actual model fairness since these unnatural discriminatory instances are unlikely to appear in practice. To address the problem, this paper proposes a framework named Latent Imitator (LIMI) to generate more natural individual discriminatory instances with the help of a generative adversarial network (GAN), where we imitate the decision boundary of the target model in the semantic latent space of GAN and further samples latent instances on it. Specifically, we first derive a surrogate linear boundary to coarsely approximate the decision boundary of the target model, which reflects the nature of the original data distribution. Subsequently, to obtain more natural instances, we manipulate random latent vectors to the surrogate boundary with a one-step movement, and further conduct vector calculation to probe two potential discriminatory candidates that may be more closely located in the real decision boundary. Extensive experiments on various datasets demonstrate that our LIMI outperforms other baselines largely in effectiveness (x9.42 instances), efficiency (x8.71 speeds), and naturalness (+19.65%) on average. In addition, we empirically demonstrate that retraining on test samples generated by our approach can lead to improvements in both individual fairness (45.67% on IFr and 32.81% on IFo) and group fairness (9.86% on SPD and 28.38% on AOD). Our codes can be found on our website.
引用
收藏
页码:829 / 841
页数:13
相关论文
共 50 条
  • [31] Property-Driven Testing of Black-Box Functions
    Sharma, Arnab
    Melnikov, Vitalik
    Huellermeier, Eyke
    Wehrheim, Heike
    IEEE/ACM 10TH INTERNATIONAL CONFERENCE ON FORMAL METHODS IN SOFTWARE ENGINEERING (FORMALISE 2022), 2022, : 113 - 123
  • [32] WebRTC Quality Assessment: Dangers of Black-box Testing
    Cinar, Yusuf
    Melvin, Hugh
    2014 10TH INTERNATIONAL CONFERENCE ON DIGITAL TECHNOLOGIES (DT), 2014, : 31 - 35
  • [33] FACTS: Automated Black-Box Testing of FinTech Systems
    Wang, Qingshun
    Gu, Lintao
    Xue, Minhui
    Xu, Lihua
    Niu, Wenyu
    Dou, Liang
    He, Liang
    Xie, Tao
    ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 839 - 844
  • [34] Evolutionary functional black-box testing in an industrial setting
    Tanja E. J. Vos
    Felix F. Lindlar
    Benjamin Wilmes
    Andreas Windisch
    Arthur I. Baars
    Peter M. Kruse
    Hamilton Gross
    Joachim Wegener
    Software Quality Journal, 2013, 21 : 259 - 288
  • [35] Black-box Test Data Generation for GUI Testing
    Darvish, Ali
    Chang, Carl K.
    2014 14TH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE (QSIC 2014), 2014, : 133 - 138
  • [36] Evolutionary functional black-box testing in an industrial setting
    Vos, Tanja E. J.
    Lindlar, Felix F.
    Wilmes, Benjamin
    Windisch, Andreas
    Baars, Arthur I.
    Kruse, Peter M.
    Gross, Hamilton
    Wegener, Joachim
    SOFTWARE QUALITY JOURNAL, 2013, 21 (02) : 259 - 288
  • [37] Black-box testing based on colorful taint analysis
    Kai Chen
    DengGuo Feng
    PuRui Su
    YingJun Zhang
    Science China Information Sciences, 2012, 55 : 171 - 183
  • [38] Black-box testing based on colorful taint analysis
    Chen Kai
    Feng DengGuo
    Su PuRui
    Zhang YingJun
    SCIENCE CHINA-INFORMATION SCIENCES, 2012, 55 (01) : 171 - 183
  • [39] Black-box testing based on colorful taint analysis
    CHEN Kai1
    2State Key Laboratory of Information Security
    3National Engineering Research Center of Information Security
    Science China(Information Sciences), 2012, 55 (01) : 171 - 183
  • [40] Using MC/DC as a Black-Box Testing Technique
    Vilkomir, Sergiy
    Baptista, John
    Das, Gourav
    2017 IEEE 28TH ANNUAL SOFTWARE TECHNOLOGY CONFERENCE (STC), 2017,