New method for instance or prototype selection using mutual information in time series prediction

被引:35
|
作者
Guillen, A. [1 ]
Herrera, L. J. [1 ]
Rubio, G. [1 ]
Pomares, H. [1 ]
Lendasse, A. [2 ]
Rojas, I. [1 ]
机构
[1] Univ Granada, Dept Comp Technol & Architecture, E-18071 Granada, Spain
[2] Aalto Univ, Dept Informat & Comp Sci, FIN-02150 Espoo, Finland
关键词
Time series; Regression; Prediction; Mutual information; Prototype; Instance; Selection; VARIABLE SELECTION; OUTLIER DETECTION; ALGORITHM; DESIGN;
D O I
10.1016/j.neucom.2009.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of selecting the patterns to be learned by any model is usually not considered by the time of designing the concrete model but as a preprocessing step. Information theory provides a robust theoretical framework for performing input variable selection thanks to the concept of mutual information. Recently the computation of the mutual information for regression tasks has been proposed so this paper presents a new application of the concept of mutual information not to select the variables but to decide which prototypes should belong to the training data set in regression problems. The proposed methodology consists in deciding if a prototype should belong to or not to the training set using as criteria the estimation of the mutual information between the variables. The novelty of the approach is to focus in prototype selection for regression problems instead of classification as the majority of the literature deals only with the last one. Other element that distinguishes this work from others is that it is not proposed as an outlier detector but as an algorithm that determines the best subset of input vectors by the time of building a model to approximate it. As the experiment section shows, this new method is able to identify a high percentage of the real data set when it is applied to highly distorted data sets. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:2030 / 2038
页数:9
相关论文
共 50 条
  • [41] Mutual information: a measure of dependency for nonlinear time series
    Dionisio, A
    Menezes, R
    Mendes, DA
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2004, 344 (1-2) : 326 - 329
  • [42] Variable Selection Method Based on Partial Mutual Information and Its Application to NOx Emission Prediction
    Qin Tianmu
    Zhang Jinzhe
    You Mo
    Yang Tingling
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 1017 - 1021
  • [44] Time series online prediction method based on information perception weight and error prediction
    Wang, Hao
    Liu, Zhen
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2020, 41 (11): : 31 - 41
  • [45] A new density-based subspace selection method using mutual information for high dimensional outlier detection
    Riahi-Madvar, Mahboobeh
    Azirani, Ahmad Akbari
    Nasersharif, Babak
    Raahemi, Bijan
    KNOWLEDGE-BASED SYSTEMS, 2021, 216
  • [46] TIME-SERIES - INFORMATION AND PREDICTION
    TEODORESCU, D
    BIOLOGICAL CYBERNETICS, 1990, 63 (06) : 477 - 485
  • [47] Improved Mutual Information Method For Text Feature Selection
    Ding Xiaoming
    Tang Yan
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 163 - 166
  • [48] Selection and Prediction of the Trend of a Time Series Using a Recurrent Neural Network
    Trufanov, N. N.
    Churikov, D., V
    Kravchenko, O., V
    2021 PHOTONICS & ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS 2021), 2021, : 2878 - 2884
  • [49] INSIGHT: Efficient and Effective Instance Selection for Time-Series Classification
    Buza, Krisztian
    Nanopoulos, Alexandros
    Schmidt-Thieme, Lars
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 149 - 160
  • [50] Data partition and variable selection for time series prediction using wrappers
    Puma-Villanueva, Wilfredo J.
    dos Santos, Euripedes P.
    Von Zuben, Fernando J.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4740 - +