Improved Feature Importance Computation for Tree Models Based on the Banzhaf Value

被引:0
|
作者
Karczmarz, Adam [1 ,2 ]
Michalak, Tomasz [1 ,2 ]
Mukherjee, Anish [1 ,2 ]
Sankowski, Piotr [1 ,2 ,3 ]
Wygocki, Piotr [1 ,3 ]
机构
[1] Univ Warsaw, Inst Informat, Warsaw, Poland
[2] IDEAS NCBR, Warsaw, Poland
[3] MIM Solut, Warsaw, Poland
基金
欧洲研究理事会;
关键词
EXPLAINABLE AI; SHAPLEY VALUE; EXPLANATIONS; TRACTABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Shapley value - a fundamental game-theoretic solution concept - has recently become one of the main tools used to explain predictions of tree ensemble models. Another well-known game-theoretic solution concept is the Banzhaf value. Although the Banzhaf value is closely related to the Shapley value, its properties w.r.t. feature attribution have not been understood equally well. This paper shows that, for tree ensemble models, the Banzhaf value offers some crucial advantages over the Shapley value while providing similar feature attributions. In particular, we first give an optimal O(TL + n) time algorithm for computing the Banzhaf value-based attribution of a tree ensemble model's output. Here, T is the number of trees, L is the maximum number of leaves in a tree, and n is the number of features. In comparison, the state-of-the-art Shapley value-based algorithm runs in O(TLD2 + n) time, where D denotes the maximum depth of a tree in the ensemble. Next, we experimentally compare the Banzhaf and Shapley values for tree ensemble models. Both methods deliver essentially the same average importance scores for the studied datasets using two different tree ensemble models (the sklearn implementation of Decision Trees or xgboost implementation of Gradient Boosting Decision Trees). However, our results indicate that, on top of being computable faster, the Banzhaf is more numerically robust than the Shapley value.
引用
收藏
页码:969 / 979
页数:11
相关论文
共 50 条
  • [1] Interpretation with baseline shapley value for feature groups on tree models
    Xu, Fan
    Zhou, Zhi-Jian
    Ni, Jie
    Gao, Wei
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (05)
  • [2] Hollow-tree super: A directional and scalable approach for feature importance in boosted tree models
    Doyen, Stephane
    Taylor, Hugh
    Nicholas, Peter
    Crawford, Lewis
    Young, Isabella
    Sughrue, Michael E.
    PLOS ONE, 2021, 16 (10):
  • [3] Iterative missing value imputation based on feature importance
    Guo, Cong
    Yang, Wei
    Liu, Chun
    Li, Zheng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (10) : 6387 - 6414
  • [4] Unbiased Measurement of Feature Importance in Tree-Based Methods
    Zhou, Zhengze
    Hooker, Giles
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (02)
  • [5] ON MARGINAL FEATURE ATTRIBUTIONS OF TREE-BASED MODELS
    Filom, Khashayar
    Miroshnikov, Alexey
    Kotsiopoulos, Konstandinos
    Kannan, Arjun ravi
    FOUNDATIONS OF DATA SCIENCE, 2024, 6 (04): : 395 - 467
  • [6] Construction of Decision Trees by Using Feature Importance Value for Improved Learning Performance
    Al Iqbal, Md. Ridwan
    Rahaman, Mohammad Saiedur
    Nabil, Syed Irfan
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT II, 2012, 7664 : 242 - 249
  • [7] Feature Selection Based on a Novel Improved Tree Growth Algorithm
    Zhong, Changkang
    Chen, Yu
    Peng, Jian
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 247 - 258
  • [8] Feature Selection Based on a Novel Improved Tree Growth Algorithm
    Changkang Zhong
    Yu Chen
    Jian Peng
    International Journal of Computational Intelligence Systems, 2020, 13 : 247 - 258
  • [9] An improved tree model based on ensemble feature selection for classification
    Mohan, Chandralekha
    Nagarajan, Shenbagavadivu
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1290 - 1307
  • [10] Knowledge Based Decision Tree Construction with Feature Importance Domain Knowledge
    Al Iqbal, Md. Ridwan
    Rahman, Saiedur
    Nabil, Syed Irfan
    Chowdhury, Ijaz Ul Amin
    2012 7TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2012,