Fair Feature Subset Selection using Multiobjective Genetic Algorithm

被引:5
|
作者
Rehman, Ayaz Ur [1 ]
Nadeem, Anas [1 ]
Malik, Muhammad Zubair [1 ]
机构
[1] North Dakota State Univ, Fargo, ND 58105 USA
关键词
Fairness; Data-sets; Genetic Algorithms; Feature Selection;
D O I
10.1145/3520304.3529061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The feature subset selection problem aims at selecting the relevant subset of features to improve the performance of a Machine Learning (ML) algorithm on training data. Some features in data can be inherently noisy, costly to compute, improperly scaled, or correlated to other features, and they can adversely affect the accuracy, cost, and complexity of the induced algorithm. The goal of traditional feature selection approaches has been to remove such irrelevant features. In recent years ML is making a noticeable impact on the decision-making processes of our everyday lives. We want to ensure that these decisions do not reflect biased behavior towards certain groups or individuals based on the protected attributes such as age, sex, or race. In this paper, we present a feature subset selection approach that improves both fairness and accuracy objectives and computes Pareto-optimal solutions using the NSGA-II algorithm. We use statistical disparity as a fairness metric and F1-Score as a metric for model performance. Our experiments on the most commonly used fairness benchmark datasets with three different machine learning algorithms show that using the evolutionary algorithm we can effectively explore the trade-off between fairness and accuracy.
引用
收藏
页码:360 / 363
页数:4
相关论文
共 50 条
  • [31] Toward an Optimal and Structured Feature Subset Selection for Multi-Target Regression Using Genetic Algorithm
    Syed, Farrukh Hasan
    Tahir, Muhammad Atif
    Frnda, Jaroslav
    Rafi, Muhammad
    Anwar, Muhammad Shahid
    Nedoma, Jan
    [J]. IEEE ACCESS, 2023, 11 : 121966 - 121977
  • [32] A neuro fuzzy algorithm for feature subset selection
    Chakraborty, B
    Chakraborty, G
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2001, E84A (09): : 2182 - 2188
  • [33] BRANCH AND BOUND ALGORITHM FOR FEATURE SUBSET SELECTION
    NARENDRA, P
    FUKUNAGA, K
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1977, 26 (09) : 917 - 922
  • [34] An advanced ACO algorithm for feature subset selection
    Kashef, Shima
    Nezamabadi-pour, Hossein
    [J]. NEUROCOMPUTING, 2015, 147 : 271 - 279
  • [35] A thermodynamical search algorithm for feature subset selection
    Gonzalez, Felix F.
    Belanche, Lluis A.
    [J]. NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 683 - 692
  • [36] A hierarchy reduct algorithm for feature subset selection
    Qu, BB
    Lu, YS
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1157 - 1161
  • [37] Experimental comparison of feature subset selection using GA and ACO algorithm
    Lee, Keunjoon
    Joo, Jinu
    Yang, Jihoon
    Honavar, Vasant
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 465 - 472
  • [38] Enhanced Feature Subset Selection Using Niche Based Bat Algorithm
    Saleem, Noman
    Zafar, Kashif
    Sabzwari, Alizaa Fatima
    [J]. COMPUTATION, 2019, 7 (03)
  • [39] Feature subset selection using improved binary gravitational search algorithm
    Rashedi, Esmat
    Nezamabadi-pour, Hossein
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (03) : 1211 - 1221
  • [40] Feature Subset Selection Using Generalized Steepest Ascent Search Algorithm
    Nakariyakul, Songyot
    [J]. 2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 147 - 151