Maximally Informative Hierarchical Representations of High-Dimensional Data

被引:0
|
作者
Ver Steeg, Greg [1 ]
Galstyan, Aram [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Los Angeles, CA 90007 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a set of probabilistic functions of some input variables as a representation of the inputs. We present bounds on how informative a representation is about input data. We extend these bounds to hierarchical representations so that we can quantify the contribution of each layer towards capturing the information in the original data. The special form of these bounds leads to a simple, bottom-up optimization procedure to construct hierarchical representations that are also maximally informative about the data. This optimization has linear computational complexity and constant sample complexity in the number of variables. These results establish a new approach to unsupervised learning of deep representations that is both principled and practical. We demonstrate the usefulness of the approach on both synthetic and real-world data.
引用
收藏
页码:1004 / 1012
页数:9
相关论文
共 50 条
  • [21] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Mansoori, Eghbal G.
    [J]. SOFT COMPUTING, 2014, 18 (05) : 905 - 922
  • [22] GACH: a grid-based algorithm for hierarchical clustering of high-dimensional data
    Eghbal G. Mansoori
    [J]. Soft Computing, 2014, 18 : 905 - 922
  • [23] On Criticality in High-Dimensional Data
    Saremi, Saeed
    Sejnowski, Terrence J.
    [J]. NEURAL COMPUTATION, 2014, 26 (07) : 1329 - 1339
  • [24] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [26] High-Dimensional Data Bootstrap
    Chernozhukov, Victor
    Chetverikov, Denis
    Kato, Kengo
    Koike, Yuta
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 427 - 449
  • [27] High-dimensional data visualization
    Tang, Lin
    [J]. NATURE METHODS, 2020, 17 (02) : 129 - 129
  • [28] High-dimensional data visualization
    Lin Tang
    [J]. Nature Methods, 2020, 17 : 129 - 129
  • [29] Modeling High-Dimensional Data
    Vempala, Santosh S.
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (02) : 112 - 112
  • [30] High-dimensional Data Cubes
    John, Sachin Basil
    Koch, Christoph
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (13): : 3828 - 3840