Automatic Generation of Merge Factor for Clustering Microarray Data

被引:0
|
作者
Pavan, K. Karteeka [1 ]
Rao, Allam Appa [2 ]
Rao, A. V. Dattatreya [3 ]
Sridhar, G. R. [4 ]
机构
[1] RVR & JC Coll Engn, Guntur, Andhra Pradesh, India
[2] Jawaharlal Nehru Technol Univ, Kakinada, Andhra Pradesh, India
[3] Acharya Nagarjuna Univ, Guntur, Andhra Pradesh, India
[4] Endocrine & Diabet Ctr, Visakhapatnam, Andhra Pradesh, India
关键词
Bioinformatics; Microarray gene expression data; coexpressed genes; clustering; K-means; ISODATA; AGMFI;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarrays are made it possible to simultaneously monitor the expression profiles of thousands of genes under various experimental conditions. Identification of coexpressed genes and coherent patterns is the central goal in microarray or gene expression data analysis and is an important task in bioinformatics research. Cluster analysis of gene expression data has proved to be a useful tool for identifying coexpressed genes, biologically relevant groupings of genes and samples. In this paper we propose an algorithm -Automatic Generation of Merge Factor for Isodata - AGMFI, to cluster microarray data on the basis of ISODATA. The main idea of AGMFI is to generate initial values for merge factor, maximum merge times instead of selecting heuristic values as in ISODATA. One significant feature of AGMFI over K-means is that the initial number of clusters may be merged or split, and so the final number of clusters may be different from the number of clusters specified as part of the input. We evaluate it's performance by applying on a well-known publicly available microarray data sets and on simulated data set [3]. We compared the results with those of K-means clustering. The experiments indicate that the proposed algorithm AGMFI increased the enrichment of genes of similar function within the cluster.
引用
收藏
页码:127 / 131
页数:5
相关论文
共 50 条
  • [1] Clustering by fast search and merge of local density peaks for gene expression microarray data
    Rashid Mehmood
    Saeed El-Ashram
    Rongfang Bie
    Hussain Dawood
    Anton Kos
    Scientific Reports, 7
  • [2] Clustering by fast search and merge of local density peaks for gene expression microarray data
    Mehmood, Rashid
    El-Ashram, Saeed
    Bie, Rongfang
    Dawood, Hussain
    Kos, Anton
    SCIENTIFIC REPORTS, 2017, 7
  • [3] Improving Group Search Optimization for Automatic Data Clustering Using Merge and Split Operators
    Pacifico, Luciano D. S.
    Ludermir, Teresa B.
    INTELLIGENT SYSTEMS, PT I, 2022, 13653 : 340 - 354
  • [4] Clustering microarray data
    Gollub, Jeremy
    Sherlock, Gavin
    DNA MICROARRAYS, PART B: DATABASES AND STATISTICS, 2006, 411 : 194 - +
  • [5] Including transcription factor information in the superparamagnetic clustering of microarray data
    Monsivais-Alonso, M. P.
    Navarro-Munoz, J. C.
    Riego-Ruiz, L.
    Lopez-Sandoval, R.
    Rosu, H. C.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (24) : 5689 - 5697
  • [6] Knowledgeable clustering of microarray data
    Potamias, G
    BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2004, 3337 : 491 - 497
  • [7] Clustering DNA microarray data
    Maciejewski, H
    Jasinska, A
    Computer Recognition Systems, Proceedings, 2005, : 595 - 601
  • [8] An Automatic Merge Technique to Improve the Clustering Quality Performed by LAMDA
    Morales, Luis
    Aguilar, Jose
    IEEE ACCESS, 2020, 8 (08): : 162917 - 162944
  • [9] A Novel Approach for Automatic Number of Clusters Detection in Microarray Data based on Consensus Clustering
    Vinh, Nguyen Xuan
    Epps, Julien
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 84 - 91
  • [10] A Multi-Objective Genetic Algorithm Based Fuzzy Relational Clustering for Automatic Microarray Cancer Data Clustering
    Paul, Animesh Kumar
    Shill, Pintu Chandra
    Kundu, Animesh
    2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV), 2016, : 454 - 459