Deep kernel dimensionality reduction for scalable data integration

被引:2
|
作者
Sokolovska, Nataliya [1 ,2 ,3 ]
Clement, Karine [1 ,2 ,3 ]
Zucker, Jean-Daniel [1 ,2 ,4 ]
机构
[1] Hop La Pitie Salpetriere, Assistance Publ Hop Paris, ICAN, Inst Cardiometab & Nutr, Paris, France
[2] Univ Paris 06, Sorbonne Univ, UMR S 1166, NutriOm Team,ICAN, Paris, France
[3] INSERM, UMR S U1166, NutriOm Team, Paris, France
[4] UMMISCO, UMI 209, Inst Dev Res, Bondy, France
关键词
Dimensionality reduction; Heterogeneous data integration;
D O I
10.1016/j.ijar.2016.03.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction is used to preserve significant properties of data in a low dimensional space. In particular, data representation in a lower dimension is needed in applications, where information comes from multiple high dimensional sources. Data integration, however, is a challenge in itself. In this contribution, we consider a general framework to perform dimensionality reduction taking into account that data are heterogeneous. We propose a novel approach, called Deep Kernel Dimensionality Reduction which is designed for learning layers of new compact data representations simultaneously. The method can be also used to learn shared representations between modalities. We show by experiments on standard and on real large-scale biomedical data sets that the proposed method embeds data in a new compact meaningful representation, and leads to a lower classification error compared to the state-of-the-art methods. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:121 / 132
页数:12
相关论文
共 50 条
  • [1] A scalable supervised algorithm for dimensionality reduction on streaming data
    Yan, Jun
    Zhang, Benyu
    Yan, Shuicheng
    Liu, Ning
    Yang, Qiang
    Cheng, Qiansheng
    Li, Hua
    Chen, Zheng
    Ma, Wei-Ying
    INFORMATION SCIENCES, 2006, 176 (14) : 2042 - 2065
  • [2] MultiMAP: dimensionality reduction and integration of multimodal data
    Mika Sarkin Jain
    Krzysztof Polanski
    Cecilia Dominguez Conde
    Xi Chen
    Jongeun Park
    Lira Mamanova
    Andrew Knights
    Rachel A. Botting
    Emily Stephenson
    Muzlifah Haniffa
    Austen Lamacraft
    Mirjana Efremova
    Sarah A. Teichmann
    Genome Biology, 22
  • [3] MultiMAP: dimensionality reduction and integration of multimodal data
    Jain, Mika Sarkin
    Polanski, Krzysztof
    Conde, Cecilia Dominguez
    Chen, Xi
    Park, Jongeun
    Mamanova, Lira
    Knights, Andrew
    Botting, Rachel A.
    Stephenson, Emily
    Haniffa, Muzlifah
    Lamacraft, Austen
    Efremova, Mirjana
    Teichmann, Sarah A.
    GENOME BIOLOGY, 2021, 22 (01)
  • [4] Textual data dimensionality reduction - a deep learning approach
    Neetu Kushwaha
    Millie Pant
    Multimedia Tools and Applications, 2020, 79 : 11039 - 11050
  • [5] Twin Kernel Embedding with relaxed constraints on dimensionality reduction for structured data
    Guo, Yi
    Gao, Junbin
    Kwan, Paul W.
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 659 - +
  • [6] Sparse Kernel Entropy Component Analysis for Dimensionality Reduction of Neuroimaging Data
    Jiang, Qikun
    Shi, Jun
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 3366 - 3369
  • [7] Sparse kernel entropy component analysis for dimensionality reduction of biomedical data
    Shi, Jun
    Jiang, Qikun
    Zhang, Qi
    Huang, Qinghua
    Li, Xuelong
    NEUROCOMPUTING, 2015, 168 : 930 - 940
  • [8] Data visualization and dimensionality reduction using kernel maps with a reference point
    Suykens, Johan A. K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (09): : 1501 - 1517
  • [9] Multiple Kernel Learning for Dimensionality Reduction
    Lin, Yen-Yu
    Liu, Tyng-Luh
    Fuh, Chiou-Shann
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (06) : 1147 - 1160
  • [10] Kernel dimensionality reduction for supervised learning
    Fukumizu, K
    Bach, FR
    Jordan, MI
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 81 - 88