Gaussian Graphical Model Exploration and Selection in High Dimension Low Sample Size Setting

被引:5
|
作者
Lartigue, Thomas [1 ,2 ]
Bottani, Simona [3 ]
Baron, Stephanie [4 ]
Colliot, Olivier [3 ]
Durrleman, Stanley [3 ]
Allassonniere, Stephanie [5 ]
机构
[1] Ecole Polytech, IP, CMAP, CNRS, Paris, France
[2] INRIA, Aramis Project Team, F-91128 Palaiseau, France
[3] Sorbonne Univ, CNRS UMR 7225, Inserm U1127, Aramis Project Team,Inria,Inst Cerveau & Moelle E, F-75004 Paris, France
[4] Hop Europeen Georges Pompidou, AP HP, F-75015 Paris, France
[5] Sorbonne Univ, INSERM, Univ Paris, Ctr Rech Cordeliers, F-75006 Paris, France
基金
欧洲研究理事会;
关键词
Correlation; Covariance matrices; Measurement; Graphical models; Gaussian distribution; Sparse representation; Alzheimer's disease; Gaussian graphical models; model selection; high dimension low sample size; sparse matrices; maximum likelihood estimation; MAXIMUM-LIKELIHOOD-ESTIMATION; COVARIANCE ESTIMATION; SPARSE ESTIMATION; LASSO;
D O I
10.1109/TPAMI.2020.2980542
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian graphical models (GGM) are often used to describe the conditional correlations between the components of a random vector. In this article, we compare two families of GGM inference methods: the nodewise approach and the penalised likelihood maximisation. We demonstrate on synthetic data that, when the sample size is small, the two methods produce graphs with either too few or too many edges when compared to the real one. As a result, we propose a composite procedure that explores a family of graphs with a nodewise numerical scheme and selects a candidate among them with an overall likelihood criterion. We demonstrate that, when the number of observations is small, this selection method yields graphs closer to the truth and corresponding to distributions with better KL divergence with regards to the real distribution than the other two. Finally, we show the interest of our algorithm on two concrete cases: first on brain imaging data, then on biological nephrology data. In both cases our results are more in line with current knowledge in each field.
引用
收藏
页码:3196 / 3213
页数:18
相关论文
共 50 条
  • [21] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [22] Consistency of sparse PCA in High Dimension, Low Sample Size contexts
    Shen, Dan
    Shen, Haipeng
    Marron, J. S.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 115 : 317 - 333
  • [23] Classification for high-dimension low-sample size data
    Shen, Liran
    Er, Meng Joo
    Yin, Qingbo
    [J]. PATTERN RECOGNITION, 2022, 130
  • [24] Deep Neural Networks for High Dimension, Low Sample Size Data
    Liu, Bo
    Wei, Ying
    Zhang, Yu
    Yang, Qiang
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2287 - 2293
  • [25] Boundary behavior in High Dimension, Low Sample Size asymptotics of PCA
    Jung, Sungkyu
    Sen, Arusharka
    Marron, J. S.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 109 : 190 - 203
  • [26] Objective Bayesian model selection in Gaussian graphical models
    Carvalho, C. M.
    Scott, J. G.
    [J]. BIOMETRIKA, 2009, 96 (03) : 497 - 512
  • [27] Bias-corrected support vector machine with Gaussian kernel in high-dimension, low-sample-size settings
    Nakayama, Yugo
    Yata, Kazuyoshi
    Aoshima, Makoto
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2020, 72 (05) : 1257 - 1286
  • [28] Bias-corrected support vector machine with Gaussian kernel in high-dimension, low-sample-size settings
    Yugo Nakayama
    Kazuyoshi Yata
    Makoto Aoshima
    [J]. Annals of the Institute of Statistical Mathematics, 2020, 72 : 1257 - 1286
  • [29] AN ALGORITHM FOR REDUCING THE DIMENSION AND SIZE OF A SAMPLE FOR DATA EXPLORATION PROCEDURES
    Kulczycki, Piotr
    Lukasik, Szymon
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (01) : 133 - 149
  • [30] ESTIMATION OF COVARIANCE MATRIX DISTANCES IN THE HIGH DIMENSION LOW SAMPLE SIZE REGIME
    Tiomoko, Malik
    Couillet, Romain
    [J]. 2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 341 - 345