Extract interpretability-accuracy balanced rules from artificial neural networks: A review

被引:62
|
作者
He, Congjie [1 ]
Ma, Meng [2 ]
Wang, Ping [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Software & Microelect, Beijing 102600, Peoples R China
[2] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
[3] Minist Educ, Key Lab High Confidence Software Technol PKU, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Rule extraction; Accuracy; Interpretability; Multilayer Perceptron; Deep neural network; CLASSIFICATION PROBLEMS; DECISION RULES; INDUCTION; ISSUES; TREE;
D O I
10.1016/j.neucom.2020.01.036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Artificial neural networks (ANN) have been widely used and have achieved remarkable achievements. However, neural networks with high accuracy and good performance often have extremely complex internal structures such as deep neural networks (DNN). This shortcoming makes the neural networks as incomprehensible as a black box, which is unacceptable in some practical applications. But pursuing excessive interpretation of the neural networks will make the performance of the model worse. Based on this contradictory issue, we first summarize the mainstream methods about quantitatively evaluating the accuracy and interpretability of rule set. And then review existing methods on extracting rules from Multilayer Perceptron (MLP) and DNN in three categories: Decomposition Approach (Extract rules in neuron level such as visualizing the structure of network), Pedagogical Approach (By studying the correspondence between input and output such as by computing gradient) and Eclectics Approach (Combine the above two ideas). Some potential research directions about extracting rules from DNN are discussed in the last. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:346 / 358
页数:13
相关论文
共 50 条
  • [21] Extraction of similarity based fuzzy rules from artificial neural networks
    Mantas, C.J.
    Puche, J.M.
    Mantas, J.M.
    International Journal of Approximate Reasoning, 2006, 43 (02): : 202 - 221
  • [22] NEWRON: A New Generalization of the Artificial Neuron to Enhance the Interpretability of Neural Networks
    Siciliano, Federico
    Bucarelli, Maria Sofia
    Tolomei, Gabriele
    Silvestri, Fabrizio
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [23] Artificial neural networks:: Review
    Yazici, Ayse Canan
    Oegues, Ersin
    Ankarali, Seyit
    Canan, Sinan
    Ankarali, Handan
    Akkus, Zeki
    TURKIYE KLINIKLERI TIP BILIMLERI DERGISI, 2007, 27 (01): : 65 - 71
  • [24] Interpretability of deep neural networks: A review of methods, classification and hardware
    Antamis, Thanasis
    Drosou, Anastasis
    Vafeiadis, Thanasis
    Nizamis, Alexandros
    Ioannidis, Dimosthenis
    Tzovaras, Dimitrios
    NEUROCOMPUTING, 2024, 601
  • [25] Interpretability for Neural Networks from the Perspective of Probability Density
    Lu, Lu
    Pan, Tingting
    Zhao, Junhong
    Yang, Jie
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1502 - 1507
  • [26] Extracting rules from artificial Neural Networks with kernel-based representations
    Ramírez, JM
    ENGINEERING APPLICATIONS OF BIO-INSPIRED ARTIFICIAL NEURAL NETWORKS, VOL II, 1999, 1607 : 68 - 77
  • [27] Extraction of the association rules from artificial neural networks based on the multiobjective optimization
    Yedjour, Dounia
    Yedjour, Hayat
    Chouraqui, Samira
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2022, 33 (3-4) : 233 - 252
  • [28] Extraction of fuzzy logic rules from data by means of artificial neural networks
    Holeña, M
    KYBERNETIKA, 2005, 41 (03) : 297 - 314
  • [29] Interpretation of artificial neural networks by means of fuzzy rules
    Castro, JL
    Mantas, CJ
    Benítez, JM
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01): : 101 - 116
  • [30] Survey and critique of techniques for extracting rules from trained artificial neural networks
    Andrews, R
    Diederich, J
    Tickle, AB
    KNOWLEDGE-BASED SYSTEMS, 1995, 8 (06) : 373 - 389