Knowledge-based gene expression classification via matrix factorization

被引:27
|
作者
Schachtner, R. [1 ]
Lutter, D. [1 ,2 ,3 ]
Knollmueller, P. [1 ]
Tome, A. M. [4 ]
Theis, F. J. [1 ,2 ]
Schmitz, G. [3 ]
Stetter, M. [5 ]
Vilda, P. Gomez [6 ]
Lang, E. W. [1 ]
机构
[1] Univ Regensburg, CIML Biophys, D-93040 Regensburg, Germany
[2] GSF Munich, CMB IBI, Munich, Germany
[3] Univ Hosp Regensburg, D-93042 Regensburg, Germany
[4] Univ Aveiro, IEETA DETI, P-3810193 Aveiro, Portugal
[5] Siemens AG, Siemens Corp Technol, D-8000 Munich, Germany
[6] Univ Politecn Madrid, DATSI FI, E-18500 Madrid, Spain
关键词
D O I
10.1093/bioinformatics/btn245
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Modern machine learning methods based on matrix decomposition techniques, like independent component analysis (ICA) or non-negative matrix factorization (NMF), provide new and efficient analysis tools which are currently explored to analyze gene expression profiles. These exploratory feature extraction techniques yield expression modes (ICA) or metagenes (NMF). These extracted features are considered indicative of underlying regulatory processes. They can as well be applied to the classification of gene expression datasets by grouping samples into different categories for diagnostic purposes or group genes into functional categories for further investigation of related metabolic pathways and regulatory networks. Results: In this study we focus on unsupervised matrix factorization techniques and apply ICA and sparse NMF to microarray datasets. The latter monitor the gene expression levels of human peripheral blood cells during differentiation from monocytes to macrophages. We show that these tools are able to identify relevant signatures in the deduced component matrices and extract informative sets of marker genes from these gene expression profiles. The methods rely on the joint discriminative power of a set of marker genes rather than on single marker genes. With these sets of marker genes, corroborated by leave-one-out or random forest cross-validation, the datasets could easily be classified into related diagnostic categories. The latter correspond to either monocytes versus macrophages or healthy vs Niemann Pick C disease patients.
引用
下载
收藏
页码:1688 / 1697
页数:10
相关论文
共 50 条
  • [21] On α-divergence based nonnegative matrix factorization for clustering cancer gene expression data
    Liu, Weixiang
    Yuan, Kehong
    Ye, Datian
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2008, 44 (01) : 1 - 5
  • [22] Knowledge-based control via the Internet
    Tang, KZ
    Goh, HL
    Tan, KK
    Lee, TH
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2004, 2 (02) : 207 - 219
  • [23] Classification and Clustering via Structure-enforced Matrix Factorization
    Xu, Lijun
    Zhou, Yijia
    Yu, Bo
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, ISCIDE 2017, 2017, 10559 : 403 - 411
  • [24] Binary matrix factorization for analyzing gene expression data
    Zhang, Zhong-Yuan
    Li, Tao
    Ding, Chris
    Ren, Xian-Wen
    Zhang, Xiang-Sun
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) : 28 - 52
  • [25] Binary matrix factorization for analyzing gene expression data
    Zhong-Yuan Zhang
    Tao Li
    Chris Ding
    Xian-Wen Ren
    Xiang-Sun Zhang
    Data Mining and Knowledge Discovery, 2010, 20 : 28 - 52
  • [26] Development of non-negative matrix factorization with knowledge-based constraints for DNA methylation array analysis.
    Takasawa, Ken
    Hamamoto, Ryuji
    CANCER SCIENCE, 2021, 112 : 674 - 674
  • [27] Leveraging Knowledge-based Inference for Material Classification
    Yu, Jie
    Skaff, Sandra
    Peng, Liang
    Imai, Francisco
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1243 - 1246
  • [28] Knowledge-based classification in automated soil mapping
    Zhou, B
    Wang, RC
    PEDOSPHERE, 2003, 13 (03) : 209 - 218
  • [29] Knowledge-based systems for arrhythmia detection and classification
    Oikonomou, VP
    Tsipouras, MG
    Fotiadis, DI
    Sideris, DA
    ADVANCES IN SCATTERING AND BIOMEDICAL ENGINEERING, PROCEEDINGS, 2004, : 524 - 531
  • [30] On knowledge-based classification of abnormal BGP events
    Li, Jun
    Dou, Dejing
    Kim, Shiwoong
    Qin, Han
    Wang, Yibo
    INFORMATION SYSTEMS SECURITY, PROCEEDINGS, 2007, 4812 : 267 - 271