Evolutionary soft co-clustering: formulations, algorithms, and applications

被引:10
|
作者
Zhang, Wenlu [1 ]
Li, Rongjian [1 ]
Feng, Daming [1 ]
Chernikov, Andrey [1 ]
Chrisochoides, Nikos [1 ]
Osgood, Christopher [2 ]
Ji, Shuiwang [1 ]
机构
[1] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
[2] Old Dominion Univ, Dept Biol Sci, Norfolk, VA 23529 USA
基金
美国国家科学基金会;
关键词
Evolutionary co-clustering; Expectation maximization; Biological image computing; Bioinformatics; GENE-EXPRESSION; GLOBAL ANALYSIS; PATTERNS;
D O I
10.1007/s10618-014-0375-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the co-clustering of time-varying data using evolutionary co-clustering methods. Existing approaches are based on the spectral learning framework, thus lacking a probabilistic interpretation. We overcome this limitation by developing a probabilistic model in this paper. The proposed model assumes that the observed data are generated via a two-step process that depends on the historic co-clusters. This allows us to capture the temporal smoothness in a probabilistically principled manner. To perform maximum likelihood parameter estimation, we present an EM-based algorithm. We also establish the convergence of the proposed EM algorithm. An appealing feature of the proposed model is that it leads to soft co-clustering assignments naturally. We evaluate the proposed method on both synthetic and real-world data sets. Experimental results show that our method consistently outperforms prior approaches based on spectral method. To fully exploit the real-world impact of our methods, we further perform a systematic application study on the analysis of Drosophila gene expression pattern images. We encode the spatial gene expression information at a particular developmental time point into a data matrix using a mesh-generation pipeline. We then co-cluster the embryonic domains and the genes simultaneously for multiple time points using our evolutionary co-clustering method. Results show that the co-clusters of gene and embryonic domains reflect the underlying biology.
引用
收藏
页码:765 / 791
页数:27
相关论文
共 50 条
  • [1] Evolutionary soft co-clustering: formulations, algorithms, and applications
    Wenlu Zhang
    Rongjian Li
    Daming Feng
    Andrey Chernikov
    Nikos Chrisochoides
    Christopher Osgood
    Shuiwang Ji
    [J]. Data Mining and Knowledge Discovery, 2015, 29 : 765 - 791
  • [2] Evolutionary Spectral Co-Clustering
    Green, Nathan
    Rege, Manjeet
    Liu, Xumin
    Bailey, Reynold
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 1074 - 1081
  • [3] Scalable Co-clustering Algorithms
    Kwon, Bongjune
    Cho, Hyuk
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 32 - +
  • [4] Co-clustering algorithms for distributional data with automated variable weighting
    De Carvalho, Francisco de A. T.
    Balzanella, Antonio
    Irpino, Antonio
    Verde, Rosanna
    [J]. INFORMATION SCIENCES, 2021, 549 : 87 - 115
  • [6] Joint co-clustering: Co-clustering of genomic and clinical bioimaging data
    Ficarra, Elisa
    De Micheli, Giovanni
    Yoon, Sungroh
    Benini, Luca
    Macii, Enrico
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 938 - 949
  • [7] Bayesian Co-clustering
    Shan, Hanhuai
    Banerjee, Arindam
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 530 - 539
  • [8] Directional co-clustering
    Aghiles Salah
    Mohamed Nadif
    [J]. Advances in Data Analysis and Classification, 2019, 13 : 591 - 620
  • [9] Co-Clustering on Manifolds
    Gu, Quanquan
    Zhou, Jie
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 359 - 367
  • [10] Directional co-clustering
    Salah, Aghiles
    Nadif, Mohamed
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 591 - 620