Efficient fair principal component analysis

被引:6
|
作者
Kamani, Mohammad Mahdi [1 ]
Haddadpour, Farzin [2 ]
Forsati, Rana [3 ]
Mahdavi, Mehrdad [2 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
[2] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
[3] Microsoft Bing, Bellevue, WA USA
关键词
Fairness; Pareto efficient; Dimension reduction; PCA;
D O I
10.1007/s10994-021-06100-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has been shown that dimension reduction methods such as Principal Component Analysis (PCA) may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly compromising the reconstruction loss. Theoretically, we provide sufficient conditions that the solution of the proposed algorithm belongs to the Pareto frontier for all sensitive groups; thereby, the optimal trade-off between overall reconstruction loss and fairness constraints is guaranteed. We also provide the convergence analysis of our algorithm and show its efficacy through empirical studies on different datasets, which demonstrates superior performance in comparison with state-of-the-art algorithms. The proposed fairness-aware PCA algorithm can be efficiently generalized to multiple group sensitive features and effectively reduce the unfairness decisions in downstream tasks such as classification.
引用
收藏
页码:3671 / 3702
页数:32
相关论文
共 50 条
  • [31] Kernel principal component analysis for efficient, differentiable parameterization of multipoint geostatistics
    Sarma, Pallav
    Durlofsky, Louis J.
    Aziz, Khalid
    MATHEMATICAL GEOSCIENCES, 2008, 40 (01) : 3 - 32
  • [32] Projection sparse principal component analysis: An efficient least squares method
    Merola, Giovanni Maria
    Chen, Gemai
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 366 - 382
  • [33] An efficient algorithm for Kernel two-dimensional principal component analysis
    Ning Sun
    Hai-xian Wang
    Zhen-hai Ji
    Cai-rong Zou
    Li Zhao
    Neural Computing and Applications, 2008, 17 : 59 - 64
  • [34] An efficient algorithm for Kernel two-dimensional principal component analysis
    Sun, Ning
    Wang, Hai-xian
    Ji, Zhen-hai
    Zou, Cai-rong
    Zhao, Li
    NEURAL COMPUTING & APPLICATIONS, 2008, 17 (01): : 59 - 64
  • [35] An efficient incremental Kernel Principal Component Analysis for online feature selection
    Takeuchi, Yohei
    Wawa, Seiichi
    Abe, Shigeo
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2346 - 2351
  • [36] Efficient Optimization Algorithms for Robust Principal Component Analysis and Its Variants
    Ma, Shiqian
    Aybat, Necdet Serhat
    PROCEEDINGS OF THE IEEE, 2018, 106 (08) : 1411 - 1426
  • [37] Kernel Principal Component Analysis for Efficient, Differentiable Parameterization of Multipoint Geostatistics
    Pallav Sarma
    Louis J. Durlofsky
    Khalid Aziz
    Mathematical Geosciences, 2008, 40 : 3 - 32
  • [38] Efficient tools for principal component analysis of complex data- a tutorial
    Rodionova, Oxana
    Kucheryavskiy, Sergey
    Pomerantsev, Alexey
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [39] An Efficient and Reliable Tolerance-Based Algorithm for Principal Component Analysis
    Yeh, Michael
    Gu, Ming
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 642 - 649
  • [40] An Efficient Point Cloud Registration Algorithm Based on Principal Component Analysis
    Chen Yi
    Wang Yong
    Li Jinlong
    Liu Dengzhi
    Gao Xiaorong
    Zhang Yu
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (14)