On Explaining Random Forests with SAT

被引:0
|
作者
Izza, Yacine [1 ]
Marques-Silva, Joao [2 ]
机构
[1] Univ Toulouse, Toulouse, France
[2] CNRS, IRIT, Toulouse, France
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Forests (RFs) are among the most widely used Machine Learning (ML) classifiers. Even though RFs are not interpretable, there are no dedicated non-heuristic approaches for computing explanations of RFs. Moreover, there is recent work on polynomial algorithms for explaining ML models, including naive Bayes classifiers. Hence, one question is whether finding explanations of RFs can be solved in polynomial time. This paper answers this question negatively, by proving that deciding whether a set of literals is a PI-explanation of an RF is DP-complete. Furthermore, the paper proposes a propositional encoding for computing explanations of RFs, thus enabling finding PI-explanations with a SAT solver. This contrasts with earlier work on explaining boosted trees (BTs) and neural networks (NNs), which requires encodings based on SMT/MILP. Experimental results, obtained on a wide range of publicly available datasets, demonstrate that the proposed SAT-based approach scales to RFs of sizes common in practical applications. Perhaps more importantly, the experimental results demonstrate that, for the vast majority of examples considered, the SAT-based approach proposed in this paper significantly outperforms existing heuristic approaches.
引用
收藏
页码:2584 / 2591
页数:8
相关论文
共 50 条
  • [1] Explaining Cautious Random Forests via Counterfactuals
    Zhang, Haifei
    Quost, Benjamin
    Masson, Marie-Helene
    BUILDING BRIDGES BETWEEN SOFT AND STATISTICAL METHODOLOGIES FOR DATA SCIENCE, 2023, 1433 : 390 - 397
  • [2] Explaining the success of adaboost and random forests as interpolating classifiers
    Wyner, Abraham J.
    Olson, Matthew
    Bleich, Justin
    Mease, David
    Journal of Machine Learning Research, 2017, 18 : 1 - 33
  • [3] Explaining the Success of AdaBoost and Random Forests as Interpolating Classifiers
    Wyner, Abraham J.
    Olson, Matthew
    Bleich, Justin
    Mease, David
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18 : 1 - 33
  • [4] Explaining Misclassification and Attacks in Deep Learning via Random Forests
    Haffar, Rami
    Domingo-Ferrer, Josep
    Sanchez, David
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2020), 2020, 12256 : 273 - 285
  • [5] Explaining predictions and attacks in federated learning via random forests
    Rami Haffar
    David Sánchez
    Josep Domingo-Ferrer
    Applied Intelligence, 2023, 53 : 169 - 185
  • [6] Explaining Random Forests Using Bipolar Argumentation and Markov Networks
    Potyka, Nico
    Yin, Xiang
    Toni, Francesca
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9453 - 9460
  • [7] Explaining predictions and attacks in federated learning via random forests
    Haffar, Rami
    Sanchez, David
    Domingo-Ferrer, Josep
    APPLIED INTELLIGENCE, 2023, 53 (01) : 169 - 185
  • [8] Random Forests with Economic Roots: Explaining Machine Learning in Hedonic Imputation
    Zeng, Shipei
    Rao, Deyu
    COMPUTATIONAL ECONOMICS, 2024,
  • [9] Explaining the black-white SAT gap
    Schmid, M
    JOURNAL OF BLACKS IN HIGHER EDUCATION, 1999, (25): : 5 - 5
  • [10] Locality in Random SAT Instances
    Giraldez-Cru, Jesus
    Levy, Jordi
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 638 - 644