Maximally Informative Hierarchical Representations of High-Dimensional Data

被引:0
|
作者
Ver Steeg, Greg [1 ]
Galstyan, Aram [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Los Angeles, CA 90007 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a set of probabilistic functions of some input variables as a representation of the inputs. We present bounds on how informative a representation is about input data. We extend these bounds to hierarchical representations so that we can quantify the contribution of each layer towards capturing the information in the original data. The special form of these bounds leads to a simple, bottom-up optimization procedure to construct hierarchical representations that are also maximally informative about the data. This optimization has linear computational complexity and constant sample complexity in the number of variables. These results establish a new approach to unsupervised learning of deep representations that is both principled and practical. We demonstrate the usefulness of the approach on both synthetic and real-world data.
引用
收藏
页码:1004 / 1012
页数:9
相关论文
共 50 条
  • [1] Hierarchical classification of microorganisms based on high-dimensional phenotypic data
    Tafintseva, Valeria
    Vigneau, Evelyne
    Shapaval, Volha
    Cariou, Veronique
    Qannari, El Mostafa
    Kohler, Achim
    [J]. JOURNAL OF BIOPHOTONICS, 2018, 11 (03)
  • [2] Hierarchical Bayesian Modeling of Mediation by High-Dimensional Omics Data
    Thomas, Duncan
    [J]. GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 619 - 619
  • [3] A Hierarchical Manifold Learning Framework for High-Dimensional Neuroimaging Data
    Gao, Siyuan
    Mishne, Gal
    Scheinost, Dustin
    [J]. INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 : 631 - 643
  • [4] On high-dimensional representations of knot groups
    Friedl, Stefan
    Heusener, Michael
    [J]. ALGEBRAIC AND GEOMETRIC TOPOLOGY, 2018, 18 (01): : 313 - 332
  • [5] High-dimensional activity landscape representations
    Stumpfe, Dagmar
    Bajorath, Juergen
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 244
  • [6] LASSO-TYPE RECOVERY OF SPARSE REPRESENTATIONS FOR HIGH-DIMENSIONAL DATA
    Meinshausen, Nicolai
    Yu, Bin
    [J]. ANNALS OF STATISTICS, 2009, 37 (01): : 246 - 270
  • [7] The Curse Revisited: When are Distances Informative for the Ground Truth in Noisy High-Dimensional Data?
    Vandaele, Robin
    Kang, Bo
    De Bie, Tijl
    Saeys, Yvan
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [8] High-dimensional data
    Geubbelmans, Melvin
    Rousseau, Axel-Jan
    Valkenborg, Dirk
    Burzykowski, Tomasz
    [J]. AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (03) : 453 - 456
  • [9] High-dimensional data
    Amaratunga, Dhammika
    Cabrera, Javier
    [J]. JOURNAL OF THE NATIONAL SCIENCE FOUNDATION OF SRI LANKA, 2016, 44 (01): : 3 - 9
  • [10] Analysis of Temporal High-Dimensional Gene Expression Data for Identifying Informative Biomarker Candidates
    Lou, Qiang
    Obradovic, Zoran
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 996 - 1001