Mutual information for the selection of relevant variables in spectrometric nonlinear modelling

被引:156
|
作者
Rossi, F
Lendasse, A
François, D
Wertz, V
Verleysen, M
机构
[1] Catholic Univ Louvain, Machine Learning Grp, DICE, B-1348 Louvain, Belgium
[2] Univ Paris 01, SAMOS, MATISSE, F-75634 Paris 13, France
[3] INRIA Rocquencourt, Projet AxIS, F-78153 Le Chesnay, France
[4] Aalto Univ, Lab Comp & Informat Sci, Neural Networks Res Ctr, FIN-02015 Helsinki, Finland
[5] Catholic Univ Louvain, Machine Learning Grp, CESAME, B-1348 Louvain, Belgium
关键词
D O I
10.1016/j.chemolab.2005.06.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data from spectropbotometers form vectors of a large number of exploitable variables. Building quantitative models using these variables most often requires using a smaller set of variables than the initial one. Indeed, a too large number of input variables to a model results in a too large number of parameters, leading to overfitting and poor generalization abilities. In this paper, we suggest the use of the mutual information measure to select variables from the initial set. The mutual information measures the information content in input variables with respect to the model output, without making any assumption on the model that will be used; it is thus suitable for nonlinear modelling. In addition, it leads to the selection of variables among the initial set, and not to linear or nonlinear combinations of them. Without decreasing the model performances compared to other variable projection methods, it allows therefore a greater interpretability of the results. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:215 / 226
页数:12
相关论文
共 50 条
  • [1] Relevant harmonics selection based on mutual information for electrical appliances identification
    Hacine-Gharbi, Abdenour
    Ravier, Philippe
    Nait-Meziane, Mohamed
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 62 (02) : 102 - 115
  • [2] A Mutual Information estimator for continuous and discrete variables applied to Feature Selection and Classification problems
    Frederico Coelho
    Antonio P. Braga
    Michel Verleysen
    [J]. International Journal of Computational Intelligence Systems, 2016, 9 : 726 - 733
  • [3] A Mutual Information estimator for continuous and discrete variables applied to Feature Selection and Classification problems
    Coelho, Frederico
    Braga, Antonio P.
    Verleysen, Michel
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2016, 9 (04) : 726 - 733
  • [4] Multilabel all-relevant feature selection using lower bounds of conditional mutual information
    Teisseyre, Pawel
    Lee, Jaesung
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 216
  • [5] Modelling the quality of enantiomeric separations using Mutual Information as an alternative variable selection technique
    Caetano, Sonia
    Krier, Catherine
    Verleysen, Michel
    Heyden, Yvan Vander
    [J]. ANALYTICA CHIMICA ACTA, 2007, 602 (01) : 37 - 46
  • [6] Soft Sensor Modelling Based on Mutual Information Variable Selection and Partial Least Squares
    Li, Qi
    Du, Xiaodong
    Liu, Wenya
    Ba, Wei
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 3649 - 3654
  • [7] A New Variable Selection Method Based on Mutual Information Maximization by Replacing Collinear Variables for Nonlinear Quantitative Structure-Property Relationship Models
    Ghasemi, Jahan B.
    Zolfonoun, Ehsan
    [J]. BULLETIN OF THE KOREAN CHEMICAL SOCIETY, 2012, 33 (05) : 1527 - 1535
  • [8] Modeling Categorical Variables by Mutual Information Decomposition
    Liou, Jiun-Wei
    Liou, Michelle
    Cheng, Philip E.
    [J]. ENTROPY, 2023, 25 (05)
  • [9] On the Mutual Information between Random Variables in Networks
    Xu, Xiaoli
    Thakor, Satyajit
    Guan, Yong Liang
    [J]. 2013 IEEE INFORMATION THEORY WORKSHOP (ITW), 2013,
  • [10] Measuring distances between variables by mutual information
    Steuer, R
    Daub, CO
    Selbig, J
    Kurths, J
    [J]. INNOVATIONS IN CLASSIFICATION, DATA SCIENCE, AND INFORMATION SYSTEMS, 2005, : 81 - 90