Multi-Factored Gene-Gene Proximity Measures Exploiting Biological Knowledge Extracted from Gene Ontology: Application in Gene Clustering

被引:4
|
作者
Acharya, Sudipta [1 ]
Saha, Sriparna [1 ]
Pradhan, Prasanna [2 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India
[2] Sikkim Manipal Inst Technol, Dept Comp Applicat, Rangpo 737132, Sikkim, India
关键词
Semantics; Integrated circuits; Bioinformatics; Ontologies; Tools; Genomics; Current measurement; Gene ontology (GO); gene clustering; semantic similarity; distance measure; gene-gene similarity matrix; multi-objective clustering; SEMANTIC SIMILARITY; CLASSIFICATION; EXPRESSION; ALGORITHM; CANCER; TOOL;
D O I
10.1109/TCBB.2018.2849362
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
To describe the cellular functions of proteins and genes, a potential dynamic vocabulary is Gene Ontology (GO), which comprises of three sub-ontologies namely, Biological-process, Cellular-component, and Molecular-function. It has several applications in the field of bioinformatics like annotating/measuring gene-gene or protein-protein semantic similarity, identifying genes/proteins by their GO annotations for disease gene and target discovery, etc. To determine semantic similarity between genes, several semantic measures have been proposed in literature, which involve information content of GO-terms, GO tree structure, or the combination of both. But, most of the existing semantic similarity measures do not consider different topological and information theoretic aspects of GO-terms collectively. Inspired by this fact, in this article, we have first proposed three novel semantic similarity/distance measures for genes covering different aspects of GO-tree. These are further implanted in the frameworks of well-known multi-objective and single-objective based clustering algorithms to determine functionally similar genes. For comparative analysis, 10 popular existing GO based semantic similarity/distance measures and tools are also considered. Experimental results on Mouse genome, Yeast, and Human genome datasets evidently demonstrate the supremacy of multi-objective clustering algorithms in association with proposed multi-factored similarity/distance measures. Clustering outcomes are further validated by conducting some biological/statistical significance tests. Supplementary information is available at https://www.iitp.ac.in/sriparna/journals.html.
引用
收藏
页码:207 / 219
页数:13
相关论文
共 50 条
  • [1] Novel symmetry-based gene-gene dissimilarity measures utilizing Gene Ontology: Application in gene clustering
    Acharya, Sudipta
    Saha, Sriparna
    Pradhan, Prasanna
    [J]. GENE, 2018, 679 : 341 - 351
  • [2] Gene function prediction with knowledge from gene ontology
    Shen, Ying
    Zhang, Lin
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (01) : 50 - 62
  • [3] Gene-Environment Interactions and Gene-Gene Interactions on Two Biological Age Measures: Evidence from Taiwan Biobank Participants
    Lin, Wan-Yu
    [J]. ADVANCED BIOLOGY, 2024, 8 (07):
  • [4] Deriving Homogeneous Subsets from Gene Sets by Exploiting the Gene Ontology
    Stier, Quirin
    Thrun, Michael C.
    [J]. INFORMATICA, 2023, 34 (02) : 357 - 386
  • [5] Principal interactions analysis for repeated measures data: application to gene-gene and gene-environment interactions
    Mukherjee, Bhramar
    Ko, Yi-An
    VanderWeele, Tyler
    Roy, Anindya
    Park, Sung Kyun
    Chen, Jinbo
    [J]. STATISTICS IN MEDICINE, 2012, 31 (22) : 2531 - 2551
  • [6] Fuzzy clustering with biological knowledge for gene selection
    Ghosh, Sampreeti
    Mitra, Sushmita
    Dattagupta, Rana
    [J]. APPLIED SOFT COMPUTING, 2014, 16 : 102 - 111
  • [7] Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions
    Hur, Junguk
    Ozgur, Arzucan
    Xiang, Zuoshuang
    He, Yongqun
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2015, 6
  • [8] Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions
    Junguk Hur
    Arzucan Özgür
    Zuoshuang Xiang
    Yongqun He
    [J]. Journal of Biomedical Semantics, 6
  • [9] Unsupervised gene selection using biological knowledge : application in sample clustering
    Acharya, Sudipta
    Saha, Sriparna
    Nikhil, N.
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [10] Unsupervised gene selection using biological knowledge : application in sample clustering
    Sudipta Acharya
    Sriparna Saha
    N. Nikhil
    [J]. BMC Bioinformatics, 18