Discovering Rule Lists with Preferred Variables

被引:0
|
作者
Papagianni, Ioanna [1 ]
van Leeuwen, Matthijs [1 ]
机构
[1] Leiden Univ, LIACS, Leiden, Netherlands
来源
ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023 | 2023年 / 13876卷
基金
荷兰研究理事会;
关键词
Classification; Probabilistic rule lists; Minimum description length (MDL) principle; Human-guided machine learning;
D O I
10.1007/978-3-031-30047-9_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretable machine learning focuses on learning models that are inherently understandable by humans. Even such interpretable models, however, must be trustworthy for domain experts to adopt them. This requires not only accurate predictions, but also reliable explanations that do not contradict a domain expert's knowledge. When considering rule-based models, for example, rules may include certain variables either due to artefacts in the data, or due to the search heuristics used. When such rules are provided as explanations, this may lead to distrust. We investigate whether human guidance could benefit interpretable machine learning when it comes to learning models that provide both accurate predictions and reliable explanations. The form of knowledge that we consider is that of preferred variables, i.e., variables that the domain expert deems important enough to be given higher priority than the other variables. We study this question for the task of multiclass classification, use probabilistic rule lists as interpretable models, and use the minimum description length (MDL) principle for model selection. We propose S-Classy, an algorithm based on beam search that learns rule lists and takes preferred variables into account. We compare S-Classy to its baseline method, i.e., without using preferred variables, and empirically demonstrate that adding preferred variables does not harm predictive performance, while it does result in the preferred variables being used in rules higher up in the learned rule lists.
引用
收藏
页码:340 / 352
页数:13
相关论文
共 50 条
  • [21] Discovering Diverse Top-K Characteristic Lists
    Lopez-Martinez-Carrasco, Antonio
    Proenca, Hugo M.
    Juarez, Jose M.
    van Leeuwen, Matthijs
    Campos, Manuel
    ADVANCES IN INTELLIGENT DATA ANALYSIS XXI, IDA 2023, 2023, 13876 : 262 - 273
  • [22] How Similar Are States' Medicaid Preferred Drug Lists?
    Ketcham, Jonathan D.
    Ngai, Jeffrey K.
    AMERICAN JOURNAL OF MANAGED CARE, 2008, 14 (11): : SP46 - SP52
  • [23] Discovering and characterizing Hidden Variables
    Ray, Soumi
    Oates, Tim
    ARTRIFICIAL GENERAL INTELLIGENCE, AGI 2010, 2010, 10 : 127 - 132
  • [24] DIVINE: DIscovering variables IN executables
    Balakrishnan, Gogul
    Reps, Thomas
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, PROCEEDINGS, 2007, 4349 : 1 - +
  • [25] REDS: Rule Extraction for Discovering Scenarios
    Arzamasov, Vadim
    Bohm, Klemens
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 115 - 128
  • [26] Medicaid Preferred Drug Lists: Cost Containment and Side Effects
    Alvin E. Headen
    PharmacoEconomics, 2006, 24 (Suppl 3) : 1 - 3
  • [27] Applying the Essential Medicines Concept to US Preferred Drug Lists
    Millar, Timothy P.
    Wong, Shirley
    Odierna, Donna H.
    Bero, Lisa A.
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2011, 101 (08) : 1444 - 1448
  • [28] Robust subgroup discovery Discovering subgroup lists using MDL
    Proenca, Hugo M.
    Grunwald, Peter
    Back, Thomas
    van Leeuwen, Matthijs
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (05) : 1885 - 1970
  • [29] An Optimization Approach to Learning Falling Rule Lists
    Chen, Chaofan
    Rudin, Cynthia
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [30] MMS lists options for royalty rule changes
    不详
    OIL & GAS JOURNAL, 1997, 95 (39) : 44 - &