Detecting Associations Based on the Multi-Variable Maximum Information Coefficient

被引:10
|
作者
Gu, Taoyong [1 ]
Guo, Jiansheng [1 ]
Li, Zhengxin [1 ,2 ,3 ]
Mao, Sheng [1 ]
机构
[1] Air Force Engn Univ, Equipment Management & UAV Engn Coll, Xian 710051, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710072, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Microwave integrated circuits; Mutual information; Entropy; Feature extraction; Computer science; Time complexity; Licenses; Data mining; association detection; information entropy; maximum information coefficient; upper confidence bound;
D O I
10.1109/ACCESS.2021.3070925
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The maximum information coefficient (MIC) is a novel and widely-using measure of association detection in large datasets. The most outstanding feature of MIC is that it has both generality and equability. However, MIC can only deal with two variables and cannot precisely estimate coupling associations of multiple variables. In this paper, we propose an extension of MIC to deal with multi-variable datasets, called the multi-variable maximum information coefficient (MMIC). Some inherited and novel properties of MMIC are proved, including generality, equability, monotonicity, and subadditivity. We design an algorithm based on greedy stepwise strategy and upper confidence bound (UCB) for an approximate calculation of MMIC. The tests of MMIC on generated datasets and examples on real datasets are carried out to detect known and novel associations.
引用
收藏
页码:54912 / 54922
页数:11
相关论文
共 50 条
  • [1] Detecting Associations Based on the Multi-Variable Maximum Information Coefficient
    Gu, Taoyong
    Guo, Jiansheng
    Li, Zhengxin
    Mao, Sheng
    IEEE Access, 2021, 9 : 54912 - 54922
  • [2] Identifying multi-variable relationships based on the maximal information coefficient
    Shao, Fubo
    Li, Keping
    Dong, Yulin
    INTELLIGENT DATA ANALYSIS, 2017, 21 (01) : 151 - 166
  • [3] Detecting Novel Multi-variable Associations in Big Data Based on MIC
    Shao, Fubo
    Li, Keping
    PROCEEDINGS OF 2015 IEEE 5TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION, 2015, : 39 - 42
  • [4] A multi-variable detecting sensor and its application
    Koyama, Kenji
    Watanabe, Kajiro
    Kobayashi, Kazuyuki
    Kurihara, Tosuke
    Ishigaki, Tsukasa
    Higuchi, Tomoyuki
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 1004 - +
  • [5] Variable Selection based on Maximum Information Coefficient for Data Modeling
    Chu, Fuchang
    Fan, Zhenping
    Guo, Baohui
    Zhi, Dan
    Yin, Zijian
    Zhao, Wenjie
    2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 1714 - 1717
  • [6] Active Interface for Piezoelectric Harvesters Based on Multi-Variable Maximum Power Point Tracking
    Costanzo, Luigi
    Lo Schiavo, Alessandro
    Vitelli, Massimo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (07) : 2503 - 2515
  • [7] Multi-variable instruments
    Control Instrum, 9 (47):
  • [8] Multi-variable PDE model of evaluation for distributed information retrieval
    Wang, Xiuhong
    Ju, Shiguang
    SCIENTIFIC RESEARCH AND ESSAYS, 2010, 5 (17): : 2574 - 2578
  • [9] Variable selection in multi-variable models of toxicity based on logistic regression
    Van der Schaaf, A.
    Langendijk, J. A.
    RADIOTHERAPY AND ONCOLOGY, 2015, 115 : S108 - S108
  • [10] A multi-variable theta product
    Berndt, BC
    Zaharescu, A
    INDAGATIONES MATHEMATICAE-NEW SERIES, 2002, 13 (01): : 11 - 22