Feature subset selection and feature ranking for multivariate time series

被引:123
|
作者
Yoon, H [1 ]
Yang, KY [1 ]
Shahabi, C [1 ]
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
data mining; feature evaluation and selection; feature extraction or construction; time series analysis; feature representation;
D O I
10.1109/TKDE.2005.144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection (FSS) is a known technique to preprocess the data before performing any data mining tasks, e.g., classification and clustering. FSS provides both cost-effective predictors and a better understanding of the underlying process that generated the data. We propose a family of novel unsupervised methods for feature subset selection from Multivariate Time Series (MTS) based on Common Principal Component Analysis, termed CLeVer. Traditional FSS techniques, such as Recursive Feature Elimination (RFE) and Fisher Criterion (FC), have been applied to MTS data sets, e.g., Brain Computer Interface (BCI) data sets. However, these techniques may lose the correlation information among features, while our proposed techniques utilize the properties of the principal component analysis to retain that information. In order to evaluate the effectiveness of our selected subset of features, we employ classification as the target data mining task. Our exhaustive experiments show that CLeVer outperforms RFE, FC, and random selection by up to a factor of two in terms of the classification accuracy, while taking up to 2 orders of magnitude less processing time than RFE and FC.
引用
收藏
页码:1186 / 1198
页数:13
相关论文
共 50 条
  • [1] CLeVer:: A feature subset selection technique for multivariate time series
    Yang, KY
    Yoon, H
    Shahabi, C
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 516 - 522
  • [2] Feature subset selection on multivariate time series with extremely large spatial features
    Yoon, Hyunjin
    Shahabi, Cyrus
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 337 - +
  • [3] Mutual information based feature subset selection in multivariate time series classification
    Ircio, Josu
    Lojo, Aizea
    Mori, Usue
    Lozano, Jose A.
    [J]. PATTERN RECOGNITION, 2020, 108
  • [4] An Adaptive Multiple Feature Subset Method for Feature Ranking and Selection
    Chang, Fu
    Chen, Jen-Cheng
    [J]. INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 255 - 262
  • [5] Feature ranking based consensus clustering for feature subset selection
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    Krishna, G. Bala
    [J]. APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8154 - 8169
  • [6] A novel grey-based feature ranking method for feature subset selection
    Huang, Chi-Chun
    Chang, Hsin-Yun
    Yang, Cheng-Hong
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2008, 31 (03) : 509 - 514
  • [7] A novel grey-based feature ranking method for feature subset selection
    Huang, Chi-Chun
    Chang, Hsin-Yun
    Yang, Cheng-Hong
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 129 - 132
  • [8] Feature subset selection and ranking for data dimensionality reduction
    Wei, Hua-Liang
    Billings, Stephen A.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (01) : 162 - 166
  • [9] Feature Selection for Multivariate Time Series via Network Pruning
    Gu, Kang
    Vosoughi, Soroush
    Prioleau, Temiloluwa
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 1017 - 1024
  • [10] Feature selection techniques with class separability for multivariate time series
    Han, Min
    Liu, Xiaoxin
    [J]. NEUROCOMPUTING, 2013, 110 : 29 - 34