Variable Selection based on Maximum Information Coefficient for Data Modeling

被引:0
|
作者
Chu, Fuchang [1 ]
Fan, Zhenping [1 ]
Guo, Baohui [1 ]
Zhi, Dan [1 ]
Yin, Zijian [1 ]
Zhao, Wenjie [1 ]
机构
[1] North China Elect Power Univ, Coll Automat, Baoding, Peoples R China
关键词
mutual information; variable selection; maximum information coefficient; MUTUAL INFORMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Whether the variable selection is accurate or not affect the accuracy and generalization ability of the model. The traditional variable selection method is difficult to maintain a high stability under high collinearity. In order to solve the problem, we propose a new method MICFS (Feature Select based on Maximal Information Coefficient), which combines the maximum information coefficient with the existing mutual information variable selection method. Firstly, this paper introduces the theory of mutual information and the variable selection algorithm based on mutual information, and then use the maximum information coefficient instead of the original mutual information criterion. Finally, the validity of method is verified by using the Friedman data set. The result shows that this method can meet the requirements of variable selection in a high collinearity and high noise environment.
引用
收藏
页码:1714 / 1717
页数:4
相关论文
共 50 条
  • [21] Unified Variable Selection for Varying Coefficient Models with Longitudinal Data
    XU Xiaoli
    ZHOU Yan
    ZHANG Kongsheng
    ZHAO Mingtao
    JournalofSystemsScience&Complexity, 2023, 36 (02) : 822 - 842
  • [22] Fault detection using grouped support vector data description based on maximum information coefficient
    Zhang Y.
    Wang Z.
    Huagong Xuebao/CIESC Journal, 2023, 74 (09): : 3865 - 3878
  • [23] Empirical Likelihood Based Variable Selection for Varying Coefficient Partially Linear Models with Censored Data
    Peixin ZHAO
    数学研究及应用, 2013, 33 (04) : 493 - 504
  • [24] Empirical Likelihood Based Variable Selection for Varying Coefficient Partially Linear Models with Censored Data
    Peixin ZHAO
    Journal of Mathematical Research with Applications, 2013, (04) : 493 - 504
  • [25] Variable selection for modeling the absolute magnitude at maximum of Type Ia supernovae
    Uemura, Makoto
    Kawabata, Koji S.
    Ikeda, Shiro
    Maeda, Keiichi
    PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF JAPAN, 2015, 67 (03)
  • [26] Information push model-building based on maximum mutual information coefficient
    Tan S.-Q.
    Zhang X.
    Li Q.
    Ai C.
    Ai, Chen (33979639@qq.com), 2018, Editorial Board of Jilin University (48): : 558 - 563
  • [27] Model Detection and Variable Selection for Varying Coefficient Models with Longitudinal Data
    Feng, San Ying
    Hu, Yu Ping
    Xue, Liu Gen
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2016, 32 (03) : 331 - 350
  • [28] Model detection and variable selection for varying coefficient models with longitudinal data
    San Ying Feng
    Yu Ping Hu
    Liu Gen Xue
    Acta Mathematica Sinica, English Series, 2016, 32 : 331 - 350
  • [29] Model Detection and Variable Selection for Varying Coefficient Models with Longitudinal Data
    San Ying FENG
    Yu Ping HU
    Liu Gen XUE
    Acta Mathematica Sinica,English Series, 2016, 32 (03) : 331 - 350
  • [30] Model Detection and Variable Selection for Varying Coefficient Models with Longitudinal Data
    San Ying FENG
    Yu Ping HU
    Liu Gen XUE
    ActaMathematicaSinica, 2016, 32 (03) : 331 - 350