OrthoClust: an orthology-based network framework for clustering data across multiple species

被引:37
|
作者
Yan, Koon-Kiu [1 ,2 ]
Wang, Daifeng [1 ,2 ]
Rozowsky, Joel [1 ,2 ]
Zheng, Henry [2 ]
Cheng, Chao [4 ]
Gerstein, Mark [1 ,2 ,3 ]
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[2] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[3] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[4] Dartmouth Med Sch, Dept Genet, Hanover, NH 03755 USA
来源
GENOME BIOLOGY | 2014年 / 15卷 / 08期
关键词
GENE-COEXPRESSION NETWORK; NONCODING RNAS; BIOLOGICAL NETWORKS; COMMUNITY STRUCTURE; EXPRESSION; CONSERVATION; ONTOLOGY; CLASSIFICATION; TRANSCRIPTION; ANNOTATION;
D O I
10.1186/gb-2014-15-8-r100
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Increasingly, high-dimensional genomics data are becoming available for many organisms. Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific. We demonstrate the application of OrthoClust using the RNA-Seq expression profiles of Caenorhabditis elegans and Drosophila melanogaster from the modENCODE consortium. A potential application of cross-species modules is to infer putative analogous functions of uncharacterized elements like non-coding RNAs based on guilt-by-association.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Clustering of cancer data based on Stiefel manifold for multiple views
    Tian, Jing
    Zhao, Jianping
    Zheng, Chunhou
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [32] Data distribution system: clustering based on neural network technologies
    Vikulov, E. O.
    Denisova, L. A.
    [J]. INTERNATIONAL WORKSHOP ADVANCED TECHNOLOGIES IN MATERIAL SCIENCE, MECHANICAL AND AUTOMATION ENGINEERING - MIP: ENGINEERING - 2019, 2019, 537
  • [33] Optimization of network security protection posture based on data clustering
    Zhu, Jiancheng
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [34] Research on Data Collection Protocol Based on Clustering for Sensor Network
    Gan, Li
    Li, Jia
    Du, Fangfang
    [J]. 2011 AASRI CONFERENCE ON INFORMATION TECHNOLOGY AND ECONOMIC DEVELOPMENT (AASRI-ITED 2011), VOL 1, 2011, : 62 - 65
  • [35] An Evolutionary Immune Network based on Kernel method for data clustering
    Wu, Lei
    Peng, Lei
    Ye, Ya-Lan
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1759 - +
  • [36] Optimization of network security protection situation based on data clustering
    Ye, Wei
    Wang, Hongkai
    Zhong, Yijun
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022,
  • [37] BAYESIAN MODEL-BASED CLUSTERING FOR POPULATIONS OF NETWORK DATA
    Mantziou, Anastasia
    Lunagomez, Simon
    Mitra, Robin
    [J]. ANNALS OF APPLIED STATISTICS, 2024, 18 (01): : 266 - 302
  • [38] Research on Data Collection Protocol Based on Clustering for Sensor Network
    Gan, Li
    Li, Jia
    Du, Fangfang
    [J]. 2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL IV, 2011, : 62 - 65
  • [39] Energy-Efficient Data Gathering Framework-Based Clustering via Multiple UAVs in Deadline-Based WSN Applications
    Albu-Salih, Alaa Taima
    Seno, Seyed Amin Hosseini
    [J]. IEEE ACCESS, 2018, 6 : 72275 - 72286
  • [40] An Efficient Parallel Algorithm for Clustering Big Data based on the Spark Framework
    Faculty of Science of Rabat, Mohammed V University, Rabat, Morocco
    [J]. Intl. J. Adv. Comput. Sci. Appl., 7 (890-896):