Detecting Three-Dimensional Associations in Large Data Set

被引:1
|
作者
Liu Chuanlu [1 ]
Wang Shuliang [1 ,2 ]
Yuan Hanning [1 ]
Geng Jing [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Inst E Govt, Beijing 100081, Peoples R China
关键词
Associations; Three-dimensional variables; Mutual information; Iterative optimization; MAXIMAL INFORMATION COEFFICIENT; DATA-ANALYTICS; DEPENDENCE;
D O I
10.1049/cje.2021.08.008
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The associations detection among variables in the large dataset is recently important due to the rapid growth rate of data. The interested associations can provide references for solving the problems such as dimension reduction and feature selection. Many methods have done on the associations detection of pairwise variables. The multi-dimensional variables, especially three-dimensional variables, is rarely studied. The relationships among them cannot be revealed by the detection of pairwise variables methods. A new method of Maximal three-dimensional information coefficient (MTDIC) is proposed which is able to indicate the associations of three-dimensional variables. The correlation coefficient is calculated from the three-dimensional mutual information. The World Health Organization (WHO) data and the Tara data are selected to evaluate their associations. The experiment is verified by comparing the coefficient results with the Distance correlation (Dcor). The accurate association strength is obtained by an iterative optimization procedure on sorting descending order of coefficients. The MTDIC performs better than the Dcor in generality and equitability properties.
引用
收藏
页码:1131 / 1140
页数:10
相关论文
共 50 条
  • [1] Detecting Three-Dimensional Associations in Large Data Set
    LIU Chuanlu
    WANG Shuliang
    YUAN Hanning
    GENG Jing
    [J]. Chinese Journal of Electronics, 2021, 30 (06) : 1131 - 1140
  • [2] ROBUST DEPENDENCE MEASURE FOR DETECTING ASSOCIATIONS IN LARGE DATA SET
    蒋杭进
    吴琼莉
    [J]. Acta Mathematica Scientia, 2018, 38 (01) : 57 - 72
  • [3] ROBUST DEPENDENCE MEASURE FOR DETECTING ASSOCIATIONS IN LARGE DATA SET
    Jiang, Hangjin
    Wu, Qiongli
    [J]. ACTA MATHEMATICA SCIENTIA, 2018, 38 (01) : 57 - 72
  • [4] Large three-dimensional data set segmentation using a graphtheoretic energy-minimization approach
    Parker, B
    Feng, DD
    [J]. MEDICAL IMAGING 2003: IMAGE PROCESSING, PTS 1-3, 2003, 5032 : 491 - 499
  • [5] Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors
    Chen, X
    Rusinko, A
    Young, SS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06): : 1054 - 1062
  • [6] A fast level set framework for large three-dimensional topography simulations
    Ertl, Otmar
    Selberherr, Siegfried
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2009, 180 (08) : 1242 - 1250
  • [7] Maximizing the value of forest restoration for tropical mammals by detecting three-dimensional habitat associations
    Deere, Nicolas J.
    Guillera-Arroita, Gurutzeta
    Swinfield, Tom
    Milodowski, David T.
    Coomes, David A.
    Bernard, Henry
    Reynolds, Glen
    Davies, Zoe G.
    Struebig, Matthew J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (42) : 26254 - 26262
  • [8] Three-dimensional nanostructure determination from a large diffraction data set recorded using scanning electron nanodiffraction
    Meng, Yifei
    Zuo, Jian-Min
    [J]. IUCRJ, 2016, 3 : 300 - 308
  • [9] Large Viewing Three-Dimensional Hydrodynamical Data by Ray Tracing
    Ruffert, M.
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 1991, 2 (01): : 461 - 467
  • [10] Detecting Novel Associations in Large Data Sets
    Reshef, David N.
    Reshef, Yakir A.
    Finucane, Hilary K.
    Grossman, Sharon R.
    McVean, Gilean
    Turnbaugh, Peter J.
    Lander, Eric S.
    Mitzenmacher, Michael
    Sabeti, Pardis C.
    [J]. SCIENCE, 2011, 334 (6062) : 1518 - 1524