Feature selection with prior knowledge improves interpretability of chemometrics models

被引:1
|
作者
des Touches, Thomas [1 ]
Munda, Marco [1 ]
Cornet, Thomas [1 ]
Gerkens, Pascal [2 ]
Hellepute, Thibault [1 ]
机构
[1] DNAlytics SA, Ave Jean Monnet 1, B-1348 Louvain La Neuve, Belgium
[2] GlaxoSmithKline Vaccines, Global Proc Sci, Rue Inst 89, B-1330 Rixensart, Belgium
关键词
Spectrometry; Prior knowledge; Feature selection; Linear models;
D O I
10.1016/j.chemolab.2023.104905
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses feature selection for regression of high dimensional data, typically spectrometry measure-ments. In some contexts, prior (but partial) knowledge may be available to guide the selection towards some dimensions a priori assumed to be more relevant. We propose a feature selection method making use of this partial supervision. It extends previous works on feature selection with sparsity-enforcing regularised linear models for classification. In the current regularisation context, a practical approximation of this technique reduces to standard Support Vector Regression learning with iterative re-scaling of the inputs. The scaling factors depend here on the prior knowledge but the final selection may depart from it. Practical results on two data sets show the benefits of the proposed approach on the stability, relevance and interpretability of the selected features, as well as regression performances.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] On the Interpretability of Machine Learning Models and Experimental Feature Selection in Case of Multicollinear Data
    Drobnic, Franc
    Kos, Andrej
    Pustisek, Matevz
    [J]. ELECTRONICS, 2020, 9 (05)
  • [2] New Prior Knowledge Based Extensions for Stable Feature Selection
    Ben Brahim, Afef
    Limam, Mohamed
    [J]. 2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 306 - 311
  • [3] Incorporating prior domain knowledge into a kernel based feature selection algorithm
    Yu, Ting
    Simoff, Simeon J.
    Stokes, Donald
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 : 1064 - +
  • [4] Informative variable identifier: Expanding interpretability in feature selection
    Munoz-Romero, Sergio
    Gorostiaga, Arantza
    Soguero-Ruiz, Cristina
    Mora-Jimenez, Inmaculada
    Rojo-Alvarez, Jose Luis
    [J]. PATTERN RECOGNITION, 2020, 98
  • [5] Ensembling improves stability and power of feature selection for deep learning models
    Gyawali, Prashnna K.
    Liu, Xiaoxia
    Zou, James
    He, Zihuai
    [J]. MACHINE LEARNING IN COMPUTATIONAL BIOLOGY, VOL 200, 2022, 200
  • [6] Feature fusion improves performance and interpretability of machine learning models in identifying soil pollution of potentially contaminated sites
    Lu, Xiaosong
    Du, Junyang
    Zheng, Liping
    Wang, Guoqing
    Li, Xuzhi
    Sun, Li
    Huang, Xinghua
    [J]. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2023, 259
  • [7] Automatic Feature Selection for Improved Interpretability on Whole Slide Imaging
    Pirovano, Antoine
    Heuberger, Hippolyte
    Berlemont, Sylvain
    Ladjal, SaId
    Bloch, Isabelle
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2021, 3 (01): : 243 - 262
  • [8] Using prior knowledge to improve the performance of an estimation of distribution algorithm applied to feature selection
    Emmendorfer, LR
    Traleski, R
    Pozo, ATR
    [J]. HIS 2005: 5th International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 393 - 398
  • [9] Bandit Interpretability of Deep Models via Confidence Selection
    Duan, Xiaoyue
    Li, Hong
    Wang, Panpan
    Wang, Tiancheng
    Liu, Boyu
    Zhang, Baochang
    [J]. NEUROCOMPUTING, 2023, 544
  • [10] The Influence of Feature Selection Methods on Accuracy, Stability and Interpretability of Molecular Signatures
    Haury, Anne-Claire
    Gestraud, Pierre
    Vert, Jean-Philippe
    [J]. PLOS ONE, 2011, 6 (12):