Clustering analysis of SAGE data using a Poisson approach

被引:0
|
作者
Li Cai
Haiyan Huang
Seth Blackshaw
Jun S Liu
Connie Cepko
Wing H Wong
机构
[1] Dana-Farber Cancer Institute,Department of Research Computing
[2] Harvard School of Public Health,Department of Biostatistics
[3] Harvard Medical School,Department of Genetics
[4] Harvard University,Department of Statistics
[5] Science Center,Department of Statistics
[6] University of California,Department of Neuroscience
[7] Johns Hopkins University School of Medicine,undefined
来源
关键词
Additional Data File; Massive Parallel Signature Sequencing; Joint Likelihood; Unknown Biological Function; RIKEN cDNAs;
D O I
暂无
中图分类号
学科分类号
摘要
Serial analysis of gene expression (SAGE) data have been poorly exploited by clustering analysis owing to the lack of appropriate statistical methods that consider their specific properties. We modeled SAGE data by Poisson statistics and developed two Poisson-based distances. Their application to simulated and experimental mouse retina data show that the Poisson-based distances are more appropriate and reliable for analyzing SAGE data compared to other commonly used distances or similarity measures such as Pearson correlation or Euclidean distance.
引用
收藏
相关论文
共 50 条
  • [1] Clustering analysis of SAGE data using a Poisson approach
    Cai, L
    Huang, HY
    Blackshaw, S
    Liu, JS
    Cepko, C
    Wong, WH
    GENOME BIOLOGY, 2004, 5 (07)
  • [2] A Poisson-based adaptive affinity propagation clustering for SAGE data
    Tang, DongMing
    Zhu, QingXin
    Yang, Fan
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2010, 34 (01) : 63 - 70
  • [3] A new clustering approach using data envelopment analysis
    Po, Rung-Wei
    Guh, Yuh-Yuan
    Yang, Miin-Shen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 199 (01) : 276 - 284
  • [4] CLASSIFICATION AND CLUSTERING OF SEQUENCING DATA USING A POISSON MODEL
    Witten, Daniela M.
    ANNALS OF APPLIED STATISTICS, 2011, 5 (04): : 2493 - 2518
  • [5] Clustering analysis SAGE libraries using maximal information coefficient
    Tang, Dongming
    PROCEEDINGS OF THE 2015 SEVENTH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2015), 2015, : 64 - 69
  • [6] Comment on "A new clustering approach using data envelopment analysis"
    Krueger, Jens J.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 206 (01) : 269 - 270
  • [7] Virtual-SAGE: A new approach to EST data analysis
    Poroyko, V
    Calugaru, V
    Fredricksen, M
    Bohnert, HJ
    DNA RESEARCH, 2004, 11 (02) : 145 - 152
  • [8] A Streaming Clustering Approach Using a Heterogeneous System for Big Data Analysis
    Lee, Dajung
    Althoff, Alric
    Richmond, Dustin
    Kastner, Ryan
    2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 699 - 706
  • [9] Modeling Sage data with a truncated gamma-Poisson model
    Helene H Thygesen
    Aeilko H Zwinderman
    BMC Bioinformatics, 7
  • [10] Modeling Sage data with a truncated gamma-Poisson model
    Thygesen, Helene H.
    Zwinderman, Aeilko H.
    BMC BIOINFORMATICS, 2006, 7 (1)