Discovering Rule Lists with Preferred Variables

被引：0

作者：

Papagianni, Ioanna ^{[1
]}

van Leeuwen, Matthijs ^{[1
]}

机构：

[1] Leiden Univ, LIACS, Leiden, Netherlands

来源：

ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023 | 2023年 / 13876卷

基金：

荷兰研究理事会;

关键词：

Classification; Probabilistic rule lists; Minimum description length (MDL) principle; Human-guided machine learning;

D O I：

10.1007/978-3-031-30047-9_27

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Interpretable machine learning focuses on learning models that are inherently understandable by humans. Even such interpretable models, however, must be trustworthy for domain experts to adopt them. This requires not only accurate predictions, but also reliable explanations that do not contradict a domain expert's knowledge. When considering rule-based models, for example, rules may include certain variables either due to artefacts in the data, or due to the search heuristics used. When such rules are provided as explanations, this may lead to distrust. We investigate whether human guidance could benefit interpretable machine learning when it comes to learning models that provide both accurate predictions and reliable explanations. The form of knowledge that we consider is that of preferred variables, i.e., variables that the domain expert deems important enough to be given higher priority than the other variables. We study this question for the task of multiclass classification, use probabilistic rule lists as interpretable models, and use the minimum description length (MDL) principle for model selection. We propose S-Classy, an algorithm based on beam search that learns rule lists and takes preferred variables into account. We compare S-Classy to its baseline method, i.e., without using preferred variables, and empirically demonstrate that adding preferred variables does not harm predictive performance, while it does result in the preferred variables being used in rules higher up in the learned rule lists.

引用

页码：340 / 352

页数：13

共 50 条

[21] Discovering Diverse Top-K Characteristic Lists
Lopez-Martinez-Carrasco, Antonio
Proenca, Hugo M.
Juarez, Jose M.
van Leeuwen, Matthijs
Campos, Manuel
ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 262 - 273
[22] How Similar Are States' Medicaid Preferred Drug Lists?
Ketcham, Jonathan D.
Ngai, Jeffrey K.
AMERICAN JOURNAL OF MANAGED CARE, 2008, 14 (11): : SP46 - SP52
[23] Discovering and characterizing Hidden Variables
Ray, Soumi
Oates, Tim
ARTRIFICIAL GENERAL INTELLIGENCE, AGI 2010, 2010, 10 : 127 - 132
[24] DIVINE: DIscovering variables IN executables
Balakrishnan, Gogul
Reps, Thomas
VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, PROCEEDINGS, 2007, 4349 : 1 - +
[25] REDS: Rule Extraction for Discovering Scenarios
Arzamasov, Vadim
Bohm, Klemens
SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 115 - 128
[26] Medicaid Preferred Drug Lists: Cost Containment and Side Effects
Alvin E. Headen
PharmacoEconomics, 2006, 24 (Suppl 3) : 1 - 3
[27] Applying the Essential Medicines Concept to US Preferred Drug Lists
Millar, Timothy P.
Wong, Shirley
Odierna, Donna H.
Bero, Lisa A.
AMERICAN JOURNAL OF PUBLIC HEALTH, 2011, 101 (08) : 1444 - 1448
[28] Robust subgroup discovery Discovering subgroup lists using MDL
Proenca, Hugo M.
Grunwald, Peter
Back, Thomas
van Leeuwen, Matthijs
DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (05) : 1885 - 1970
[29] An Optimization Approach to Learning Falling Rule Lists
Chen, Chaofan
Rudin, Cynthia
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[30] MMS lists options for royalty rule changes
不详
OIL & GAS JOURNAL, 1997, 95 (39) : 44 - &

← 1 2 3 4 5 →