DBGSA: a novel method of distance-based gene set analysis

被引:0
|
作者
Jin Li
Limei Wang
Liangde Xu
Ruijie Zhang
Meilin Huang
Ke Wang
Jiankai Xu
Hongchao Lv
Zhenwei Shang
Mingming Zhang
Yongshuai Jiang
Maozu Guo
Xia Li
机构
[1] College of Bioinformatics Science and Technology,
[2] Harbin Medical University,undefined
[3] School of Life Science and Technology,undefined
[4] Harbin Institute of Technology,undefined
[5] Center of Computer,undefined
[6] Harbin Medical University,undefined
[7] School of Computer Science and Technology,undefined
[8] Harbin Institute of Technology,undefined
来源
Journal of Human Genetics | 2012年 / 57卷
关键词
average linkage statistical method; centroid statistical method; distance; gene set analysis; permutation;
D O I
暂无
中图分类号
学科分类号
摘要
When compared with single gene functional analysis, gene set analysis (GSA) can extract more information from gene expression profiles. Currently, several gene set methods have been proposed, but most of the methods cannot detect gene sets with a large number of minor-effect genes. Here, we propose a novel distance-based gene set analysis method. The distance between two groups of genes with different phenotypes based on gene expression should be larger if a certain gene set is significantly associated with the given phenotype. We calculated the distance between two groups with different phenotypes, estimated the significant P-values using two permutation methods and performed multiple hypothesis testing adjustments. This method was performed on one simulated data set and three real data sets. After a comparison and literature verification, we determined that the gene resampling-based permutation method is more suitable for GSA, and the centroid statistical and average linkage statistical distance methods are efficient, especially in detecting gene sets containing more minor-effect genes. We believe that this distance-based method will assist us in finding functional gene sets that are significantly related to a complex trait. Additionally, we have prepared a simple and publically available Perl and R package (http://bioinfo.hrbmu.edu.cn/dbgsa or http://cran.r-project.org/web/packages/DBGSA/).
引用
收藏
页码:642 / 653
页数:11
相关论文
共 50 条
  • [31] A kernel distance-based representative subset selection method
    Gani, Walid
    Limam, Mohamed
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (01) : 135 - 148
  • [32] A distance-based method for evaluating protein-structures
    P-P Heym
    W Brandt
    L Wessjohann
    Chemistry Central Journal, 3 (Suppl 1)
  • [33] IDD: A supervised interval distance-based method for discretization
    Ruiz, Francisco J.
    Angulo, Cecilio
    Agell, Nuria
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (09) : 1230 - 1238
  • [34] A laser distance-based method for measuring standing residue
    Fox, FA
    Wagner, LE
    SOIL EROSION RESEARCH FOR THE 21ST CENTURY, PROCEEDINGS, 2001, : 207 - 210
  • [35] A Distance-based Method for Spatial Prediction in the Presence of Trend
    Melo, Carlos E.
    Mateu, Jorge
    Melo, Oscar O.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2020, 25 (03) : 315 - 338
  • [36] DSIM: A Distance-Based Indexing Method for genomic sequences
    Cao, X
    Ooi, BC
    Pang, HH
    Tung, AKH
    Tan, KL
    BIBE 2005: 5TH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, 2005, : 97 - 104
  • [37] A Distance-based Method for Spatial Prediction in the Presence of Trend
    Carlos E. Melo
    Jorge Mateu
    Oscar O. Melo
    Journal of Agricultural, Biological and Environmental Statistics, 2020, 25 : 315 - 338
  • [38] Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data
    Huang, DS
    Pan, W
    BIOINFORMATICS, 2006, 22 (10) : 1259 - 1268
  • [39] Gene expression data classification: some distance-based methods
    Makinde, Olusola Samuel
    KUWAIT JOURNAL OF SCIENCE, 2019, 46 (03) : 31 - 39
  • [40] Distance-correlation based gene set analysis in longitudinal studies
    Sun, Jiehuan
    Herazo-Maya, Jose D.
    Huang, Xiu
    Kaminski, Naftali
    Zhao, Hongyu
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2018, 17 (01)