On Explaining Random Forests with SAT

被引:0
|
作者
Izza, Yacine [1 ]
Marques-Silva, Joao [2 ]
机构
[1] Univ Toulouse, Toulouse, France
[2] CNRS, IRIT, Toulouse, France
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Forests (RFs) are among the most widely used Machine Learning (ML) classifiers. Even though RFs are not interpretable, there are no dedicated non-heuristic approaches for computing explanations of RFs. Moreover, there is recent work on polynomial algorithms for explaining ML models, including naive Bayes classifiers. Hence, one question is whether finding explanations of RFs can be solved in polynomial time. This paper answers this question negatively, by proving that deciding whether a set of literals is a PI-explanation of an RF is DP-complete. Furthermore, the paper proposes a propositional encoding for computing explanations of RFs, thus enabling finding PI-explanations with a SAT solver. This contrasts with earlier work on explaining boosted trees (BTs) and neural networks (NNs), which requires encodings based on SMT/MILP. Experimental results, obtained on a wide range of publicly available datasets, demonstrate that the proposed SAT-based approach scales to RFs of sizes common in practical applications. Perhaps more importantly, the experimental results demonstrate that, for the vast majority of examples considered, the SAT-based approach proposed in this paper significantly outperforms existing heuristic approaches.
引用
收藏
页码:2584 / 2591
页数:8
相关论文
共 50 条
  • [31] Unsupervised random forests
    Mantero, Alejandro
    Ishwaran, Hemant
    STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (02) : 144 - 167
  • [32] Extremal Random Forests
    Gnecco, Nicola
    Terefe, Edossa Merga
    Engelke, Sebastian
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 3059 - 3072
  • [33] Enriched random forests
    Amaratunga, Dhammika
    Cabrera, Javier
    Lee, Yung-Seop
    BIOINFORMATICS, 2008, 24 (18) : 2010 - 2014
  • [34] Joints in Random Forests
    Correia, Alvaro H. C.
    Peharz, Robert
    de Campos, Cassio
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [35] Random Forests with R
    Maindonald, John H.
    INTERNATIONAL STATISTICAL REVIEW, 2021, 89 (02) : 422 - 423
  • [36] RANDOM SURVIVAL FORESTS
    Ishwaran, Hemant
    Kogalur, Udaya B.
    Blackstone, Eugene H.
    Lauer, Michael S.
    ANNALS OF APPLIED STATISTICS, 2008, 2 (03): : 841 - 860
  • [37] Evidential Random Forests
    Hoarau, Arthur
    Martin, Arnaud
    Dubois, Jean-Christophe
    Le Gall, Yolande
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [38] GENERALIZED RANDOM FORESTS
    Athey, Susan
    Tibshirani, Julie
    Wager, Stefan
    ANNALS OF STATISTICS, 2019, 47 (02): : 1148 - 1178
  • [39] RANDOM RECURSIVE FORESTS
    BALINSKA, KT
    QUINTAS, LV
    SZYMANSKI, J
    RANDOM STRUCTURES & ALGORITHMS, 1994, 5 (01) : 3 - 12
  • [40] Calibrating Random Forests
    Bostrom, Henrik
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 121 - 126