Efficient Clustering Using Nonnegative Matrix Factorization for Gene Expression Dataset

被引:1
|
作者
Kherwa, Pooja [1 ]
Bansal, Poonam [1 ]
Singh, Sukhvinder [1 ]
Gupta, Tanishaq [1 ]
机构
[1] Maharaja Surajmal Inst Technol, C-4 Janak Puri, New Delhi 110058, India
关键词
Dimension reduction; Gene expression; Nonnegative matrix factorization; Clustering; SINGULAR-VALUE DECOMPOSITION; CLASSIFICATION; PREDICTION; DISCOVERY; PATTERNS;
D O I
10.1007/978-981-15-1081-6_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gene expression dataset consists of a complex association of gene patterns consisting of tens or hundreds samples. Finding relevant biological information for different tasks from this complex data is really a tedious job. Text mining approaches like classification and clustering are used in the literature to discover relevant aspects of dataset for many biological applications. Gene expression also contains irrelevant data known as noise. In this paper for efficient clustering results, a very powerful dimension reduction technique is presented as preprocessing step to improve clustering results and also cluster the gene expression samples into relevant classes. In this study, the concept of nonnegative matrix factorization and non-smooth nonnegative matrix factorization, which is an extended algorithm of the basic-NNMF algorithm is used for sparser matrix factorization, and the factorization differences are observed. Later on, the performance and the accuracy of K-means, NNMF, and NS-NNMF are compared, and NS-NNMF has shown highest accuracy.
引用
收藏
页码:179 / 190
页数:12
相关论文
共 50 条
  • [1] Regularized Nonnegative Matrix Factorization for Clustering Gene Expression Data
    Liu, Weixiang
    Wang, Tianfu
    Chen, Siping
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [2] Tumor Clustering Using Nonnegative Matrix Factorization With Gene Selection
    Zheng, Chun-Hou
    Huang, De-Shuang
    Zhang, Lei
    Kong, Xiang-Zhen
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (04): : 599 - 607
  • [3] Distributional Clustering Using Nonnegative Matrix Factorization
    Zhu, Zhenfeng
    Ye, Yangdong
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4705 - 4711
  • [4] Document clustering using nonnegative matrix factorization/
    Shahnaz, F
    Berry, MW
    Pauca, VP
    Plemmons, RJ
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 373 - 386
  • [5] On α-divergence based nonnegative matrix factorization for clustering cancer gene expression data
    Liu, Weixiang
    Yuan, Kehong
    Ye, Datian
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2008, 44 (01) : 1 - 5
  • [6] Sparse p-norm Nonnegative Matrix Factorization for clustering gene expression data
    Liu, Weixiang
    Yuan, Kehong
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2008, 2 (03) : 236 - 249
  • [7] Clustering Data using a Nonnegative Matrix Factorization (NMF)
    Abdulla, Hussam Dahwa
    Polovincak, Martin
    Snasel, Vaclav
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 749 - 752
  • [8] The Estimation of Dimensionality In Gene Expression Data using Nonnegative Matrix Factorization
    Kelton, Conor J.
    Lee, Waishing
    Rusay, Matthew
    Maxian, Ondrej
    Fertig, Elana J.
    Ochs, Michael F.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1642 - 1649
  • [9] An efficient network clustering approach using graph-boosting and nonnegative matrix factorization
    Tang, Ji
    Xu, Xiaoru
    Wang, Teng
    Rezaeipanah, Amin
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (11)
  • [10] Constrained Clustering With Nonnegative Matrix Factorization
    Zhang, Xianchao
    Zong, Linlin
    Liu, Xinyue
    Luo, Jiebo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (07) : 1514 - 1526