On Explaining Random Forests with SAT

被引:0
|
作者
Izza, Yacine [1 ]
Marques-Silva, Joao [2 ]
机构
[1] Univ Toulouse, Toulouse, France
[2] CNRS, IRIT, Toulouse, France
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Random Forests (RFs) are among the most widely used Machine Learning (ML) classifiers. Even though RFs are not interpretable, there are no dedicated non-heuristic approaches for computing explanations of RFs. Moreover, there is recent work on polynomial algorithms for explaining ML models, including naive Bayes classifiers. Hence, one question is whether finding explanations of RFs can be solved in polynomial time. This paper answers this question negatively, by proving that deciding whether a set of literals is a PI-explanation of an RF is DP-complete. Furthermore, the paper proposes a propositional encoding for computing explanations of RFs, thus enabling finding PI-explanations with a SAT solver. This contrasts with earlier work on explaining boosted trees (BTs) and neural networks (NNs), which requires encodings based on SMT/MILP. Experimental results, obtained on a wide range of publicly available datasets, demonstrate that the proposed SAT-based approach scales to RFs of sizes common in practical applications. Perhaps more importantly, the experimental results demonstrate that, for the vast majority of examples considered, the SAT-based approach proposed in this paper significantly outperforms existing heuristic approaches.
引用
收藏
页码:2584 / 2591
页数:8
相关论文
共 50 条
  • [41] Critical random forests
    Martin, James B.
    Yeo, Dominic
    ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2018, 15 (02): : 913 - 960
  • [42] On the asymptotics of random forests
    Scornet, Erwan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 146 : 72 - 83
  • [43] Multivariate random forests
    Segal, Mark
    Xiao, Yuanyuan
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 80 - 87
  • [44] Random Tessellation Forests
    Ge, Shufei
    Wang, Shijia
    Teh, Yee Whye
    Wang, Liangliang
    Elliott, Lloyd T.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [45] Random Similarity Forests
    Piernik, Maciej
    Brzezinski, Dariusz
    Zawadzki, Pawel
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 53 - 69
  • [46] Random Forests in Chapel
    Albrecht, Benjamin
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 676 - 676
  • [47] Neural Random Forests
    Biau, Gerard
    Scornet, Erwan
    Welbl, Johannes
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2019, 81 (02): : 347 - 386
  • [48] Random Survival Forests
    Taylor, Jeremy M. G.
    JOURNAL OF THORACIC ONCOLOGY, 2011, 6 (12) : 1974 - 1975
  • [49] Dynamic Random Forests
    Bernard, Simon
    Adam, Sebastien
    Heutte, Laurent
    PATTERN RECOGNITION LETTERS, 2012, 33 (12) : 1580 - 1586
  • [50] Random Kernel Forests
    Devyatkin, A. Dmitry
    Grigoriev, G. Oleg
    IEEE ACCESS, 2022, 10 : 77962 - 77979