Fair Feature Subset Selection using Multiobjective Genetic Algorithm

被引:5
|
作者
Rehman, Ayaz Ur [1 ]
Nadeem, Anas [1 ]
Malik, Muhammad Zubair [1 ]
机构
[1] North Dakota State Univ, Fargo, ND 58105 USA
关键词
Fairness; Data-sets; Genetic Algorithms; Feature Selection;
D O I
10.1145/3520304.3529061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The feature subset selection problem aims at selecting the relevant subset of features to improve the performance of a Machine Learning (ML) algorithm on training data. Some features in data can be inherently noisy, costly to compute, improperly scaled, or correlated to other features, and they can adversely affect the accuracy, cost, and complexity of the induced algorithm. The goal of traditional feature selection approaches has been to remove such irrelevant features. In recent years ML is making a noticeable impact on the decision-making processes of our everyday lives. We want to ensure that these decisions do not reflect biased behavior towards certain groups or individuals based on the protected attributes such as age, sex, or race. In this paper, we present a feature subset selection approach that improves both fairness and accuracy objectives and computes Pareto-optimal solutions using the NSGA-II algorithm. We use statistical disparity as a fairness metric and F1-Score as a metric for model performance. Our experiments on the most commonly used fairness benchmark datasets with three different machine learning algorithms show that using the evolutionary algorithm we can effectively explore the trade-off between fairness and accuracy.
引用
收藏
页码:360 / 363
页数:4
相关论文
共 50 条
  • [21] Feature subset selection via multi-objective genetic algorithm
    Lac, HC
    Stacey, DA
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 1349 - 1354
  • [22] A hybrid genetic algorithm for feature subset selection in rough set theory
    Jing, Si-Yuan
    [J]. SOFT COMPUTING, 2014, 18 (07) : 1373 - 1382
  • [23] Optimized Feature Subset Selection Using Genetic Algorithm for Preterm Labor Prediction Based on Electrohysterography
    Nieto-del-Amor, Felix
    Prats-Boluda, Gema
    Martinez-De-Juan, Jose Luis
    Diaz-Martinez, Alba
    Monfort-Ortiz, Rogelio
    Jose Diago-Almela, Vicente
    Ye-Lin, Yiyao
    [J]. SENSORS, 2021, 21 (10)
  • [24] Fair Feature Selection with a Lexicographic Multi-objective Genetic Algorithm
    Brookhouse, James
    Freitas, Alex
    [J]. PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT II, 2022, 13399 : 151 - 163
  • [25] Gabor filter subset selection using a genetic algorithm
    Mandriota, C
    Ancona, N
    Stella, E
    Distante, A
    [J]. OPTOMECHATRONIC SYSTEMS III, 2002, 4902 : 707 - 714
  • [26] Novel multiobjective TLBO algorithms for the feature subset selection problem
    Kiziloz, Hakan Ezgi
    Deniz, Ayca
    Dokeroglu, Tansel
    Cosar, Ahmet
    [J]. NEUROCOMPUTING, 2018, 306 : 94 - 107
  • [27] Face feature selection using genetic algorithm
    Yin Hongtao
    Fu Ping
    Sha Xuejun
    [J]. ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 980 - 983
  • [28] Feature Selection Using Diploid Genetic Algorithm
    Jasuja A.
    [J]. Annals of Data Science, 2020, 7 (01): : 33 - 43
  • [29] Modified genetic algorithm based feature subset selection in intrusion detection system
    Zhu, YX
    Shan, X
    Guo, J
    [J]. INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 9 - 12
  • [30] Multi-objective Genetic Algorithm setup for Feature Subset Selection in Clustering
    Kashyap, Himanshu
    Das, Sohini
    Bhattacharjee, Jayee
    Halder, Ritu
    Goswami, Saptarsi
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN INFORMATION TECHNOLOGY (RAIT), 2016, : 243 - 247