Multiclass Nonnegative Matrix Factorization for Comprehensive Feature Pattern Discovery

被引:5
|
作者
Li, Yifeng [1 ]
Pan, Youlian [1 ]
Liu, Ziying [1 ]
机构
[1] Natl Res Council Canada, Digital Technol Res Ctr, Ottawa, ON K1A 0R6, Canada
关键词
Big data; cancer; feature pattern discovery; multiclass nonnegative matrix factorization (MC-NMF); stability selection; TUMOR PROGRESSION; CELLS; CATECHOLAMINES; EXPRESSION; SELECTION;
D O I
10.1109/TNNLS.2018.2849932
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this big data era, interpretable machine learning models are strongly demanded for the comprehensive analytics of large-scale multiclass data. Characterizing all features from such data is a key but challenging step to understand the complexity. However, existing feature selection methods do not meet this need. In this paper, to address this problem, we propose a Bayesian multiclass nonnegative matrix factorization (MC-NMF) model with structured sparsity that is able to discover ubiquitous and class-specific features. Variational update rules were derived for efficient decomposition. In order to relieve the need of model selection and stably describe feature patterns, we further propose MC-NMF with stability selection, an ensemble method that collectively detects feature patterns from many runs of MC-NMF using different hyperparameter values and training subsets. We assessed our models on both simulated count data and multitumor ribonucleic acid-seq data. The experiments revealed that our models were able to recover predefined feature patterns from the simulated data and identify biologically meaningful patterns from the pan-cancer data.
引用
收藏
页码:615 / 629
页数:15
相关论文
共 50 条
  • [1] Feature Extraction and Discovery of microRNAs Using Nonnegative Matrix Factorization
    Liu, Weixiang
    Wang, Tianfu
    Chen, Siping
    Tang, Aifa
    [J]. PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [2] Nonnegative Matrix Factorization: A Comprehensive Review
    Wang, Yu-Xiong
    Zhang, Yu-Jin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (06) : 1336 - 1353
  • [3] Nonnegative matrix factorization for pattern recognition
    Okun, O
    Priisalu, H
    [J]. PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2005, : 546 - 551
  • [4] Nonnegative matrix factorization in polynomial feature space
    Buciu, Ioan
    Nikolaidis, Nikos
    Pitas, Ioannis
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (06): : 1090 - 1100
  • [5] Community discovery using nonnegative matrix factorization
    Wang, Fei
    Li, Tao
    Wang, Xin
    Zhu, Shenghuo
    Ding, Chris
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 22 (03) : 493 - 521
  • [6] Community discovery using nonnegative matrix factorization
    Fei Wang
    Tao Li
    Xin Wang
    Shenghuo Zhu
    Chris Ding
    [J]. Data Mining and Knowledge Discovery, 2011, 22 : 493 - 521
  • [7] Discriminant analysis using nonnegative matrix factorization for nonparametric multiclass classification
    Kim, Hyunsoo
    Park, Haesun
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 182 - +
  • [8] Nonnegative Matrix Factorization with Integrated Graph and Feature Learning
    Peng, Chong
    Kang, Zhao
    Hu, Yunhong
    Cheng, Jie
    Cheng, Qiang
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (03)
  • [9] Nonnegative matrix factorization and its applications in pattern recognition
    LIU Weixiang*
    [J]. Science Bulletin, 2006, (01) : 7 - 18
  • [10] Nonnegative matrix factorization and its applications in pattern recognition
    Liu, WX
    Zheng, N
    You, QB
    [J]. CHINESE SCIENCE BULLETIN, 2006, 51 (01): : 7 - 18