Variable Selection based on Maximum Information Coefficient for Data Modeling

被引:0
|
作者
Chu, Fuchang [1 ]
Fan, Zhenping [1 ]
Guo, Baohui [1 ]
Zhi, Dan [1 ]
Yin, Zijian [1 ]
Zhao, Wenjie [1 ]
机构
[1] North China Elect Power Univ, Coll Automat, Baoding, Peoples R China
关键词
mutual information; variable selection; maximum information coefficient; MUTUAL INFORMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Whether the variable selection is accurate or not affect the accuracy and generalization ability of the model. The traditional variable selection method is difficult to maintain a high stability under high collinearity. In order to solve the problem, we propose a new method MICFS (Feature Select based on Maximal Information Coefficient), which combines the maximum information coefficient with the existing mutual information variable selection method. Firstly, this paper introduces the theory of mutual information and the variable selection algorithm based on mutual information, and then use the maximum information coefficient instead of the original mutual information criterion. Finally, the validity of method is verified by using the Friedman data set. The result shows that this method can meet the requirements of variable selection in a high collinearity and high noise environment.
引用
收藏
页码:1714 / 1717
页数:4
相关论文
共 50 条
  • [41] MULTIVARIATE NTCP MODELING OF XEROSTOMIA WITH MAXIMUM LIKELIHOOD VARIABLE SELECTION: METHODOLOGY ASSESSMENT
    Xu, C.
    Van der Schaaf, A.
    Schilstra, C.
    Beetz, I.
    Bijl, H. P.
    Hoegen-Chouvalova, O.
    Burlage, F.
    Steenbakkers, R.
    Langendijk, J.
    van 't Veld, A.
    RADIOTHERAPY AND ONCOLOGY, 2010, 96 : S519 - S520
  • [42] Joint mutual information-based input variable selection for multivariate time series modeling
    Han, Min
    Ren, Weijie
    Liu, Xiaoxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 250 - 257
  • [43] An Information Criterion for Auxiliary Variable Selection in Incomplete Data Analysis
    Imori, Shinpei
    Shimodaira, Hidetoshi
    ENTROPY, 2019, 21 (03):
  • [44] Study on Gas Molecular Structure Parameters Based on Maximum Information Coefficient
    You, Tianpeng
    Dong, Xuzhu
    Zhou, Wenjun
    Zheng, Yu
    Ren, Shubo
    Lei, Hongyu
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2022, 29 (04) : 1633 - 1639
  • [45] A flexible Bayesian variable selection approach for modeling interval data
    Sen, Shubhajit
    Kundu, Damitri
    Das, Kiranmoy
    STATISTICAL METHODS AND APPLICATIONS, 2024, 33 (01): : 267 - 286
  • [46] Walsh-average based variable selection for varying coefficient models
    Kangning Wang
    Lu Lin
    Journal of the Korean Statistical Society, 2015, 44 : 95 - 110
  • [47] Walsh-average based variable selection for varying coefficient models
    Wang, Kangning
    Lin, Lu
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2015, 44 (01) : 95 - 110
  • [48] Robust spline-based variable selection in varying coefficient model
    Long Feng
    Changliang Zou
    Zhaojun Wang
    Xianwu Wei
    Bin Chen
    Metrika, 2015, 78 : 85 - 118
  • [49] Robust spline-based variable selection in varying coefficient model
    Feng, Long
    Zou, Changliang
    Wang, Zhaojun
    Wei, Xianwu
    Chen, Bin
    METRIKA, 2015, 78 (01) : 85 - 118
  • [50] MODEL SELECTION METHOD BASED ON MAXIMAL INFORMATION COEFFICIENT OF RESIDUALS
    Tan, Qiuheng
    Jiang, Hangjin
    Ding, Yiming
    ACTA MATHEMATICA SCIENTIA, 2014, 34 (02) : 579 - 592