New method for instance or prototype selection using mutual information in time series prediction

被引:35
|
作者
Guillen, A. [1 ]
Herrera, L. J. [1 ]
Rubio, G. [1 ]
Pomares, H. [1 ]
Lendasse, A. [2 ]
Rojas, I. [1 ]
机构
[1] Univ Granada, Dept Comp Technol & Architecture, E-18071 Granada, Spain
[2] Aalto Univ, Dept Informat & Comp Sci, FIN-02150 Espoo, Finland
关键词
Time series; Regression; Prediction; Mutual information; Prototype; Instance; Selection; VARIABLE SELECTION; OUTLIER DETECTION; ALGORITHM; DESIGN;
D O I
10.1016/j.neucom.2009.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of selecting the patterns to be learned by any model is usually not considered by the time of designing the concrete model but as a preprocessing step. Information theory provides a robust theoretical framework for performing input variable selection thanks to the concept of mutual information. Recently the computation of the mutual information for regression tasks has been proposed so this paper presents a new application of the concept of mutual information not to select the variables but to decide which prototypes should belong to the training data set in regression problems. The proposed methodology consists in deciding if a prototype should belong to or not to the training set using as criteria the estimation of the mutual information between the variables. The novelty of the approach is to focus in prototype selection for regression problems instead of classification as the majority of the literature deals only with the last one. Other element that distinguishes this work from others is that it is not proposed as an outlier detector but as an algorithm that determines the best subset of input vectors by the time of building a model to approximate it. As the experiment section shows, this new method is able to identify a high percentage of the real data set when it is applied to highly distorted data sets. (C) 2010 Elsevier B.V. All rights reserved.
引用
下载
收藏
页码:2030 / 2038
页数:9
相关论文
共 50 条
  • [31] A causal discovery method for time series data based on mutual information measurement
    Li, De-Zhi
    Lu, Yun-Jun
    Wu, Jian-Ping
    Li, Qiang
    Kongzhi yu Juece/Control and Decision, 2024, 39 (09): : 3151 - 3159
  • [32] A new method for prediction of stationary time series using the Riemann sum approximation
    Mohammadi, Mohammad
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [33] Time Series Prediction of Retirement Mutual Fund Values using Optimal Window Size Selection and Support Vector Regression
    Sukkachart, Piyawadee
    Surapholchai, Chotiros
    Lipikorn, Rajalida
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2017, : 332 - 335
  • [34] Joint mutual information-based input variable selection for multivariate time series modeling
    Han, Min
    Ren, Weijie
    Liu, Xiaoxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 250 - 257
  • [35] Genetic Algorithm for the Mutual Information-Based Feature Selection in Univariate Time Series Data
    Siddiqi, Umair F.
    Sait, Sadiq M.
    Kaynak, Okyay
    IEEE ACCESS, 2020, 8 (08): : 9597 - 9609
  • [36] Time series analysis of categorical data using auto-mutual information
    Biswas, Atanu
    Guha, Apratim
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (09) : 3076 - 3087
  • [37] Efficient Temporal Pattern Mining in Big Time Series Using Mutual Information
    Ho, Van Long
    Ho, Nguyen
    Pedersen, Torben Bach
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (03): : 673 - 685
  • [38] An Automatic Method of Autoregression Part Selection for Gaussian Time Series Prediction
    Tang, Lijuan
    Sun, Siyu
    Wang, Xigang
    Shen, Qihao
    Ren, Jia
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 5572 - 5579
  • [39] Channel Selection Method for EEG Emotion Recognition Using Normalized Mutual Information
    Wang, Zhong-Min
    Hu, Shu-Yuan
    Song, Hui
    IEEE ACCESS, 2019, 7 : 143303 - 143311
  • [40] The development of the significant peak selection method using the mutual information and its completeness
    Fujikura, T
    Sakamoto, K
    Shimozawa, JT
    ANALYTICA CHIMICA ACTA, 1997, 351 (1-3) : 387 - 396