Feature redundancy assessment framework for subject matter experts

被引:2
|
作者
Lee, Kee Khoon Gary [1 ]
Kasim, Henry [1 ]
Zhou, Weigui Jair [2 ]
Sirigina, Rajendra Prasad [2 ]
Hung, Gih Guang Terence [1 ]
机构
[1] Rolls Royce Singapore, 1 Seletar Aerosp Crescent, Singapore, Singapore
[2] Nanyang Technol Univ, 50 Nanyang Ave, Singapore, Singapore
关键词
Feature redundancy; Feature selection; Clustering; Guided feature; Feature swap assessment; Retained information; Unsupervised task; Human-in-the-loop; Subject matter expert in the loop; SELECTION;
D O I
10.1016/j.engappai.2022.105456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional feature removal techniques focus on showing how well the selected subset of features can perform in terms of model accuracy while neglecting the aspect of eliminating redundant features and incorporating Subject Matter Experts' (SME) prior knowledge. This is important so that SMEs can leverage their prior knowledge to incorporate actionable or controllable features to build a downstream model with confidence and practical application. Furthermore, feature removal should include evidence on how similar the redundant features are with the selected features. We proposed a framework that incorporates SME prior knowledge to assess/augment the relevancy of the features with respect to the domain-specific problem. First, we rely on the Variance Inflation Factor (VIF) to iteratively remove the redundant features and measure their information loss. The quantifying of information loss will assist the SME in determining the number of features to be selected. Next, Partitions Around Medoids (PAM) is used to cluster redundant features to the closest selected feature. These clusters guide the SME in the augmentation process where the SME can retain, add, or swap the preferred features with those deemed non-redundant by the algorithm. We compared our result based on four commonly used benchmark datasets (Alate Adelges, Sonar, Wisconsin Diagnostic Breast Cancer, and Wine) with the features selected by domain experts, how they are being grouped, and the possible options to perform feature swaps. Our results show the similarity features between redundant features and their corresponding selected features. Also, we have demonstrated that our framework is able to maintain comparable retained information with those supervised feature selection methods, and demonstrate overall higher retained information of up to 3%.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] SUBJECT-MATTER EXPERTS ASSESSMENT OF ITEM STATISTICS
    BEJAR, II
    [J]. APPLIED PSYCHOLOGICAL MEASUREMENT, 1983, 7 (03) : 303 - 310
  • [2] AGING WITH IMPAIRMENT: NEEDS ASSESSMENT INSIGHTS FROM SUBJECT MATTER EXPERTS
    Preusse, K. C.
    Gonzalez, E. T.
    Singleton, J.
    Mitzner, T.
    Rogers, W. A.
    [J]. GERONTOLOGIST, 2016, 56 : 22 - 22
  • [3] Radiologists: The Unsuspecting Subject Matter Experts
    McGann, Camille
    Miaullis, Aaron
    Page, Neil
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2015, 12 (07) : 745 - 753
  • [4] A fuzzy linguistic supported framework to increase Artificial Intelligence intelligibility for subject matter experts
    Bernabe-Moreno, Juan
    Wildberger, Karsten
    [J]. 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 865 - 872
  • [5] An experiment in agent teaching by subject matter experts
    Tecuci, G
    Boicu, M
    Bowman, M
    Marcu, D
    Shyr, P
    Cascaval, C
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2000, 53 (04) : 583 - 610
  • [6] Subject-Matter Experts and the White House Conference
    Smith, Mortimer
    [J]. SCHOOL AND SOCIETY, 1955, 82 (2072): : 155 - 156
  • [7] Stakeholders or subject matter experts, who should be consulted?
    Alberts, Daniel J.
    [J]. ENERGY POLICY, 2007, 35 (04) : 2336 - 2346
  • [8] Automatic knowledge acquisition from subject matter experts
    Boicu, M
    Tecuci, G
    Stanescu, B
    Marcu, D
    Cascaval, C
    [J]. ICTAI 2001: 13TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2001, : 69 - 78
  • [9] Parallel knowledge base development by subject matter experts
    Tecuci, G
    Boicu, M
    Marcu, D
    Stanescu, B
    Boicu, C
    Barbulescu, M
    [J]. ENGINEERING KNOWLEDGE IN THE AGE OF THE SEMANTIC WEB, PROCEEDINGS, 2004, 3257 : 265 - 279
  • [10] Assessing the Contribution of Subject-matter Experts to Wikipedia
    Yarovoy, Alex
    Nagar, Yiftach
    Minkov, Einat
    Arazy, Ofer
    [J]. ACM Transactions on Social Computing, 2020, 3 (04):